Linux server memory check

by on December 11, 2006 · 6 comments· LAST UPDATED January 7, 2008

in , ,

If your server crashes regularly it could be a buggy kernel, a driver, power supply or any other hardware part. Memory (RAM) is one of the critical server parts. Bad memory can cause various problems such as random Linux server restart or program segfaults.

Generally, I recommend using memtester command. It is an effective userspace tester for stress-testing the memory subsystem. It is very effective at finding intermittent and non deterministic faults under Linux.

Recently Rahul shah email me another interesting method for testing memory. His idea is based upon md5 checksum and dd command.

First find out memory site using free command.
$ free
Output:

 total       used       free     shared    buffers     cached
Mem:        768304     555616     212688          0      22012     270996
-/+ buffers/cache:     262608     505696
Swap:       979956          0     979956

In above example my server has 768304K memory. Now use dd command as follows:
$ dd if=/dev/urandom bs=768304 of=/tmp/memtest count=1050
$ md5sum /tmp/memtest; md5sum /tmp/memtest; md5sum /tmp/memtest

According to him if the checksums do not match, you have faulty memory guaranteed. Read dd command man page to understand all options. dd will create /tmp/memtest file. It will cache data in memory by filling up all memory during read operation. Using md5sum command you are reading same data from memory (as it was cached).

Look like a good hack to me. However I still recommend using memtester userland program. Another option is to use memtest86 program ISO. Download ISO, burn the same on a CD, reboot your system with it test it (it may take more time). From project home page:
Memtest86 is thorough, stand alone memory test for x86 architecture computers. BIOS based memory tests are a quick, cursory check and often miss many of the failures that are detected by Memtest86.

TwitterFacebookGoogle+PDF versionFound an error/typo on this page? Help us!

{ 6 comments… read them below or add one }

1 Darren Griffith December 22, 2006 at 8:41 am

md5sum does *not* keep its entire target in memory. The md5 algorithm, as I understand it, uses the hash value of the previous block of data to compute the hash value of the next block. So only the previous hash value and the current block of data need to be in memory at a given time. So the above test is useless. It will *not* exercise all of your machine’s memory.

You can observe md5sum’s memory usage by looking at the output of ps or top.

Reply

2 Bob Glassberg February 16, 2007 at 5:38 pm

‘Free’ command is not in Solaris. Nor does ‘Vmstat’ on Linux give equivalent info for ‘swap memory’ as ‘vmstat’ gives on Solaris. Do you know of any command common to both systems that would give me info on swap memory in fairly similar format?

Reply

3 Magare September 15, 2008 at 10:38 pm

Darren,
md5check actually uses shutil.copyfileobj to copy the file (which reads and writes 16KB chunks at a time), and then seeks back to the start of the source file, reads the whole thing into memory in one go, and computes its MD5. So it is not useless.
And you use it in single user mode.
cheers

Reply

4 Unihost Brasil - Servidores de hospedagem November 16, 2010 at 9:45 am

There’s another way to test the memory using a shell script:
http://people.redhat.com/~dledford/memtest.shtml

Alex M. – Tech Support – Unihost Brasil

Reply

5 Joseph Bloe March 1, 2014 at 1:01 pm

wow…
“According to him if the checksums do not match, you have faulty memory guaranteed.”

If you drive has write issues (meaning it can’t write all of the same data each time), it will also cause the above behavior.

Saying such definite things is very misleading to those who don’t know any better.

Reply

6 Joseph Bloe March 1, 2014 at 1:02 pm

Someone should also take a stab at the idea of “testing memory” while a kernel is loaded :/

It’s one reason things like memtest are supposed to be executed at boot, not after a kernel has loaded (since you can’t possibly test the RAM the kernel occupies).

Reply

Leave a Comment

Tagged as: , , , , , , , , ,

Previous post:

Next post: