- Joined
- Apr 10, 2005
- Location
- Philippines
I'm having a PIA issue. Its a bit complex, with lots of elements of failure, so I will be as thorough as possible here. I just built a new array with WD20EARS hard drives (see thread on that for tech details). Here are the highlights:
Windows 2008 x64
24 WD20EARS
8 drives run off a promise ex8350 (in a PCIe 4x slot)
12 drives run off an Areca ARC-1130 (in a PCIe 16x slot)
4 drives off of an Areca ARC-1110 (in a PCIe 16x slot)
Arrays set to GPT
I tested it for a week, as recommended in that thread and it seemed to be doing ok.
So last week I started the big file copy over. I started with about 4TBs of data and left it over the weekend. It finished. Some directories were showing up as corrupt. I rebooted, the partition came up as raw (in file explorer, it asked do you want to format, checked in device manager and it said it was raw but blue colored). Crap! I just lost about 2TBs of stock video that I was using to test the array with the week before. Oh well, I have it all on discs so ok, I iwll just have to reload them at some point. Sucks but not a big deal.
So I took array down and checked each disk individually again for errors (plugged into mobo sata interface). All disks came up fine. I rebuild the arrays and copied over only a couple hundred gigs of data over to see how it goes. All ok. I setup a 1TB data copy and left it. Checked it before I went to sleep, looked ok. I rebooted the server and all ok. So I setup teh rest of the 3TB to copy. Came back a few days later and same problem . Raw partition.
Now let me get a bit more descriptive with this. I was copying data to just 1 of my arrays the 8 drive Promise one. I am dong this because I have a working 8 drive promise array in operation, so I can test this card with a group of working disks if I have issues and I have troubleshooting flexibility. When I come back to the machine, the partition is still readable via a file explorer. However clicking on some directories brings up a "folder is corrupt" error. Trying to run checkdisk on it, checkdisk says its raw. When I reboot, the entire volume is raw and file explorer asks me if I want to format it.
So at this point, I have ruled out the drives being bad but that's about it. So its time to do the troubleshooting mambo. First up, basic hardware competencies. Ramtest, system stress test all come up fine. Next up, testing the disks/array on a working environment and testing the raid card. I move the array that is currently raw to my working and in operation ex8350. In that setup it also comes up raw. Second, I move the working, in operation, 8 drives (1.5TB Samsung) to the ex8350 in the problem machine. Its working fine. I copy 1.2Tbs of data to it (its all the free space on that array). I check it later and its fine (phew, I was worried about loosing data on that array). I swap the disks back (thank heavens for hot swap bays)
Ok after that I am stuck for what to test, so I copy 2Tbs of data to the 12 drive Areca array. It seems to be ok. 3 reboots and no problems. So I do the remaning 2TB copy (of my original 4TB attempt) and that finished today. Raw partition!
I am totally lost. Summing up here's what I know:
1) 24 drives are fine
2) Promise card is fine.
3) Didn't do exhaustive checks but the mobo, ram, video, etc are fine. Passed a system stress test.
4) Data is a constant. I am using the same about 4 TBs of data to copy over and over again
Please help? Ideas, anything at all. What can I do? Its been weeks now and I can't get my LAN back up, its a terrible thing. I can't get back to work, I can't game, my office is a total disaster area as I started another project assuming that I would just have to wait for this to finish copying over, I can't even enjoy my home theater setup. I'm tearing my hair out in frustration!
Thanks!
Edit: More info. Raid is raid 5. Set write cache to write back, sector size is 4k, stripe size is 128KB. I don't really know more than what google tells me about these settings. I've used most of these settings before. I specifically choose the 4k sector size because of the "advanced format" EARS drives.
First windows format was default file allocation, then I changed it to 64k, and after that I changed between default and 32k and 64k to see iof any of these made a difference. Again, not an expert with any of this, just read google.
Is there anything here I should change?
Windows 2008 x64
24 WD20EARS
8 drives run off a promise ex8350 (in a PCIe 4x slot)
12 drives run off an Areca ARC-1130 (in a PCIe 16x slot)
4 drives off of an Areca ARC-1110 (in a PCIe 16x slot)
Arrays set to GPT
I tested it for a week, as recommended in that thread and it seemed to be doing ok.
So last week I started the big file copy over. I started with about 4TBs of data and left it over the weekend. It finished. Some directories were showing up as corrupt. I rebooted, the partition came up as raw (in file explorer, it asked do you want to format, checked in device manager and it said it was raw but blue colored). Crap! I just lost about 2TBs of stock video that I was using to test the array with the week before. Oh well, I have it all on discs so ok, I iwll just have to reload them at some point. Sucks but not a big deal.
So I took array down and checked each disk individually again for errors (plugged into mobo sata interface). All disks came up fine. I rebuild the arrays and copied over only a couple hundred gigs of data over to see how it goes. All ok. I setup a 1TB data copy and left it. Checked it before I went to sleep, looked ok. I rebooted the server and all ok. So I setup teh rest of the 3TB to copy. Came back a few days later and same problem . Raw partition.
Now let me get a bit more descriptive with this. I was copying data to just 1 of my arrays the 8 drive Promise one. I am dong this because I have a working 8 drive promise array in operation, so I can test this card with a group of working disks if I have issues and I have troubleshooting flexibility. When I come back to the machine, the partition is still readable via a file explorer. However clicking on some directories brings up a "folder is corrupt" error. Trying to run checkdisk on it, checkdisk says its raw. When I reboot, the entire volume is raw and file explorer asks me if I want to format it.
So at this point, I have ruled out the drives being bad but that's about it. So its time to do the troubleshooting mambo. First up, basic hardware competencies. Ramtest, system stress test all come up fine. Next up, testing the disks/array on a working environment and testing the raid card. I move the array that is currently raw to my working and in operation ex8350. In that setup it also comes up raw. Second, I move the working, in operation, 8 drives (1.5TB Samsung) to the ex8350 in the problem machine. Its working fine. I copy 1.2Tbs of data to it (its all the free space on that array). I check it later and its fine (phew, I was worried about loosing data on that array). I swap the disks back (thank heavens for hot swap bays)
Ok after that I am stuck for what to test, so I copy 2Tbs of data to the 12 drive Areca array. It seems to be ok. 3 reboots and no problems. So I do the remaning 2TB copy (of my original 4TB attempt) and that finished today. Raw partition!
I am totally lost. Summing up here's what I know:
1) 24 drives are fine
2) Promise card is fine.
3) Didn't do exhaustive checks but the mobo, ram, video, etc are fine. Passed a system stress test.
4) Data is a constant. I am using the same about 4 TBs of data to copy over and over again
Please help? Ideas, anything at all. What can I do? Its been weeks now and I can't get my LAN back up, its a terrible thing. I can't get back to work, I can't game, my office is a total disaster area as I started another project assuming that I would just have to wait for this to finish copying over, I can't even enjoy my home theater setup. I'm tearing my hair out in frustration!
Thanks!
Edit: More info. Raid is raid 5. Set write cache to write back, sector size is 4k, stripe size is 128KB. I don't really know more than what google tells me about these settings. I've used most of these settings before. I specifically choose the 4k sector size because of the "advanced format" EARS drives.
First windows format was default file allocation, then I changed it to 64k, and after that I changed between default and 32k and 64k to see iof any of these made a difference. Again, not an expert with any of this, just read google.
Is there anything here I should change?
Last edited: