I've got an FC10 LAMP box that's relatively inactive (we use it for SVN, TWiki, and Bugzilla for a team of 2 people). We recently upgraded the RAM from 8gb to 32GB. The system will run fine for several hours using ~17mb for cache, then suddenly grab all the free RAM for cache. Even though the pattern seems odd (it doesn't seem to happen according to any schedule), my understanding is it's not out-of-the-norm.
Before:
Code:
vmstat
procs -----------memory---------- ---swap-- -----io---- --system-- -----cpu------
r b swpd free buff cache si so bi bo in cs us sy id wa st
2 0 0 32725328 176724 153280 0 0 1 6 30 41 0 0 99 0 0
After:
Code:
vmstat
procs -----------memory---------- ---swap-- -----io---- --system-- -----cpu------
r b swpd free buff cache si so bi bo in cs us sy id wa st
1 0 68 460028 54268 32491640 0 0 9608 0 3443 3795 0 5 95 0 0
The trouble is, within an hour of grabbing all the RAM, we'll get a burst of "Out of memory: kill process xxx score xxx or a child. Killed process xxx" errors and the machine will die. It doesn't seem to be any specific process that's setting it off (I can scroll back in the console and see it's killed 3 or 4 before completely freaking out).
We did run Memtest86 for a dozen or so passes after upgrading the RAM without fail, so I'm assuming this is a memory management issue rather than actual bad RAM.
Any ideas what I can do to stave off these crashes (worst case, I could make a cron job that calls
sync; echo 3 > /proc/sys/vm/drop_caches every 8 hours or so).
Linux version 2.6.27.5-117.fc10.i686.PAE (mockbuild@x86-7.fedora.phx.redhat.com) (gcc version 4.3.2 20081105 (Red Hat 4.3.2-7) (GCC) ) #1 SMP Tue Nov 18 12:08:10 EST 2008