[lug] Reliable SATA server?
Dan Ferris
dan at usrsbin.com
Sat Apr 21 09:50:57 MDT 2012
Your IO wait time is pretty serious, looks like your server is spending
40-50% of its time waiting for IO to complete. My own personal rule is
that more than 10% of IO wait continuously means you have a problem.
Do you use hardware host bus adapters with write back caching, or are
you using plain sata drives and mdadm?
Dan
On 4/21/2012 9:03 AM, Rob Nagler wrote:
> Over the years we've struggled to find a reliable SATA server for our
> backups. I have tried numerous versions of SATA computers, and they
> all seem to fail under our loads, always. The failure is usually in
> the form of the computer hanging in such a bad state that it requires
> a power cycle.
>
> We do have a fairly stable server these days, but now I'm finding the
> disk slots are failing -- along with the disks. The server isn't that
> old, but it has been relatively reliable (only the occasional hang).
>
> I've attached a vmstat when the server is busy. I can't do much on
> the server at this time, e.g. tab completion of a command takes a few
> seconds. This is very different from a benchmark load so the numbers
> are deceptive.
>
> At this stage, I'm tempted to bite the bullet and go with SAS. We
> have SCSI3 machines running with heavy (but dissimilar) loads without
> a problem.
>
> I've asked this question on this list before, and not gotten a
> satisfactory answer. Nobody seems to run the same type of loads that
> we do, and all their servers run just fine. However, if you do have
> experience with non-sequential writing of large amounts of data to
> on a SATA server, please let me know.
>
> Thanks,
> Rob
>
> # vmstat 1
> procs -----------memory---------- ---swap-- -----io---- --system-- -----cpu------
> r b swpd free buff cache si so bi bo in cs us sy id wa st
> 0 2 144 15672 328852 826992 0 0 300 184 9 3 2 1 91 6 0
> 0 2 144 16136 328436 827052 0 0 3684 0 1544 1361 3 2 55 39 0
> 0 2 144 19764 326164 826964 0 0 10948 0 1270 1253 10 2 52 37 0
> 2 1 144 26640 321224 826268 0 0 27828 2948 1295 2364 21 3 50 26 0
> 0 2 144 26504 320472 827268 0 0 5832 0 1375 1150 10 1 54 35 0
> 0 3 144 29356 319048 826360 0 0 4688 784 1371 984 13 2 52 33 0
> 0 3 144 29232 319048 826528 0 0 32 5624 1114 90 0 0 62 37 0
> 0 2 144 29232 319048 826508 0 0 8 748 1119 72 0 0 54 46 0
> 0 3 144 29232 319052 826532 0 0 8 36 1094 56 0 0 26 74 0
> 0 2 144 29608 318708 826432 0 0 3244 4 1446 1138 3 2 52 43 0
> 0 2 144 29884 318204 826952 0 0 2344 1000 1294 656 2 1 55 42 0
> 1 2 144 32428 316516 826648 0 0 6388 0 1167 732 10 2 49 40 0
> 1 2 144 35916 314220 825756 0 0 7476 0 1205 837 11 2 49 38 0
> 0 2 144 37940 312616 825584 0 0 6484 0 1228 835 10 2 49 40 0
> 0 2 144 39108 311712 825816 0 0 4184 14048 1235 553 5 1 50 44 0
> 0 3 144 39428 310848 826544 0 0 4528 4956 1333 939 6 2 53 40 0
> 0 3 144 39428 310852 826676 0 0 8 784 1131 80 0 0 51 49 0
> 0 2 144 39428 310852 826644 0 0 8 468 1099 78 0 0 50 50 0
> 0 2 144 40328 310620 825876 0 0 912 0 1338 694 0 0 52 48 0
> 0 2 144 39600 310476 826804 0 0 3092 104 1492 1285 1 2 60 37 0
> 0 2 144 40972 309896 826492 0 0 3636 1052 1312 806 3 1 54 42 0
> _______________________________________________
> Web Page: http://lug.boulder.co.us
> Mailing List: http://lists.lug.boulder.co.us/mailman/listinfo/lug
> Join us on IRC: irc.hackingsociety.org port=6667 channel=#hackingsociety
More information about the LUG
mailing list