• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

Help with io error and lost wu's

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

dfonda

Senior Golfer
Joined
Feb 25, 2004
Location
N of splat W of Torin
HFM shows no failed units in the last 6. Since I restarted the client. I have lost the last 2 8101's but completed a few of the smaller 150K wu credits in between.

Any advice for a fix? re install?

Code:
[07:07:05] + No unsent completed units remaining.
[07:07:05] - Autosend completed
[07:12:07] Completed 245000 out of 250000 steps  (98%)
[07:35:09] Completed 247500 out of 250000 steps  (99%)
[07:58:07] Completed 250000 out of 250000 steps  (100%)

Writing final coordinates.

 Average load imbalance: 14.9 %
 Part of the total run time spent waiting due to load imbalance: 3.5 %


    Parallel run - timing based on wallclock.

               NODE (s)   Real (s)      (%)
       Time: 138398.863 138398.863    100.0
                       1d14h26:38
               (Mnbf/s)   (GFlops)   (ns/day)  (hour/ns)
Performance:    785.408     41.460      0.624     38.444

Thanx for Using GROMACS - Have a Nice Day

[07:58:21] DynamicWrapper: Finished Work Unit: sleep=10000
[07:58:31] 
[07:58:31] Finished Work Unit:
[07:58:31] - Reading up to 64340496 from "work/wudata_07.trr": Read 64340496
[07:58:31] trr file hash check passed.
[07:58:31] - Reading up to 31618128 from "work/wudata_07.xtc": Read 31618128
[07:58:31] - Checksum of file (work/wudata_07.xtc) read from disk doesn't match
[07:58:31] 
[COLOR=Red][07:58:31] Folding@home Core Shutdown: FILE_IO_ERROR
[07:58:32] CoreStatus = 75 (117)
[07:58:32] Error opening or reading from a file.
[07:58:32] Deleting current work unit & continuing...[/COLOR]
[07:58:44] Trying to send all finished work units
[07:58:44] + No unsent completed units remaining.
[07:58:44] - Preparing to get new work unit...
[07:58:44] Cleaning up work directory
[07:58:44] + Attempting to get work packet
[07:58:44] Passkey found
[07:58:44] - Will indicate memory of 32233 MB
[07:58:44] - Connecting to assignment server
[07:58:44] Connecting to http://assign.stanford.edu:8080/
[07:58:45] Posted data.
[07:58:45] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[07:58:45] + News From Folding@Home: Welcome to Folding@Home
[07:58:45] Loaded queue successfully.
[07:58:45] Sent data
[07:58:45] Connecting to http://128.143.231.201:8080/
[07:58:51] Posted data.
[07:58:51] Initial: 0000; - Receiving payload (expected size: 30307401)
[07:58:57] - Downloaded at ~4932 kB/s
[07:58:57] - Averaged speed for that direction ~4463 kB/s
[07:58:57] + Received work.
[07:58:57] + Closed connections
[07:59:02] 
[07:59:02] + Processing work unit
[07:59:02] Core required: FahCore_a5.exe
[07:59:02] Core found.
[07:59:02] Working on queue slot 08 [February 12 07:59:02 UTC]
[07:59:02] + Working ...
[07:59:02] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 08 -np 32 -checkpoint 30 -verbose -lifeline 2038 -version 634'

[07:59:02] 
[07:59:02] *------------------------------*
[07:59:02] Folding@Home Gromacs SMP Core
[07:59:02] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[07:59:02] 
[07:59:02] Preparing to commence simulation
[07:59:02] - Looking at optimizations...
[07:59:02] - Created dyn
[07:59:02] - Files status OK
[07:59:06] - Expanded 30306889 -> 33158020 (decompressed 109.4 percent)
[07:59:06] Called DecompressByteArray: compressed_data_size=30306889 data_size=33158020, decompressed_data_size=33158020 diff=0
[07:59:07] - Digital signature verified
[07:59:07] 
[07:59:07] Project: 8101 (Run 26, Clone 1, Gen 131)
[07:59:07] 
[07:59:07] Assembly optimizations on if available.
[07:59:07] Entering M.D.
                         :-)  G  R  O  M  A  C  S  (-:

                   Groningen Machine for Chemical Simulation

                            :-)  VERSION 4.5.3  (-:

        Written by Emile Apol, Rossen Apostolov, Herman J.C. Berendsen,
      Aldert van Buuren, Pär Bjelkmar, Rudi van Drunen, Anton Feenstra, 
        Gerrit Groenhof, Peter Kasson, Per Larsson, Pieter Meulenhoff, 
           Teemu Murtola, Szilard Pall, Sander Pronk, Roland Schulz, 
                Michael Shirts, Alfons Sijbers, Peter Tieleman,

               Berk Hess, David van der Spoel, and Erik Lindahl.

       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
            Copyright (c) 2001-2010, The GROMACS development team at
        Uppsala University & The Royal Institute of Technology, Sweden.
            check out http://www.gromacs.org for more information.


                               :-)  Gromacs  (-:

Reading file work/wudata_08.tpr, VERSION 4.5.5-dev-20120903-d64b9e3 (single precision)
[07:59:14] Mapping NT from 32 to 32 
Starting 32 threads
Making 2D domain decomposition 8 x 4 x 1
starting mdrun 'FP_membrane in water'
33000000 steps, 132000.0 ps (continuing from step 32750000, 131000.0 ps).
[07:59:20] Completed 0 out of 250000 steps  (0%)
[08:22:13] Completed 2500 out of 250000 steps  (1%)
[08:45:09] Completed 5000 out of 250000 steps  (2%)
[09:08:09] Completed 7500 out of 250000 steps  (3%)
[09:31:10] Completed 10000 out of 250000 steps  (4%)
[09:54:02] Completed 12500 out of 250000 steps  (5%)
[10:16:56] Completed 15000 out of 250000 steps  (6%)
[10:39:50] Completed 17500 out of 250000 steps  (7%)
[11:02:45] Completed 20000 out of 250000 steps  (8%)
[11:25:35] Completed 22500 out of 250000 steps  (9%)
[11:48:29] Completed 25000 out of 250000 steps  (10%)
[12:11:22] Completed 27500 out of 250000 steps  (11%)
[12:34:17] Completed 30000 out of 250000 steps  (12%)
[12:57:11] Completed 32500 out of 250000 steps  (13%)
[13:07:05] - Autosending finished units... [February 12 13:07:05 UTC]
[13:07:05] Trying to send all finished work units
[13:07:05] + No unsent completed units remaining.
[13:07:05] - Autosend completed
[13:20:05] Completed 35000 out of 250000 steps  (14%)
[13:42:59] Completed 37500 out of 250000 steps  (15%)
[14:05:54] Completed 40000 out of 250000 steps  (16%)
[14:28:46] Completed 42500 out of 250000 steps  (17%)
[14:51:39] Completed 45000 out of 250000 steps  (18%)
[15:14:34] Completed 47500 out of 250000 steps  (19%)
[15:37:28] Completed 50000 out of 250000 steps  (20%)
[16:00:22] Completed 52500 out of 250000 steps  (21%)
[16:23:10] Completed 55000 out of 250000 steps  (22%)
 
FAH Wiki says:
These errors indicate an I/O hardware problem or perhaps an AV program preventing FAH from writing/reading certain work files.

If you have changed your anti-virus program (or it's settings), recently then disable it or reset the settings, and re-try.

If not, then the HD is the next suspect to swap out and see if that solves it.
 
No av on a 4P. I'll one unit and shut down till I get a new SSD.

That will be it for me Team. I may be down for a while.

I'll check with H and see if that can be shown as a fail in HFM.
 
No av on a 4P. I'll one unit and shut down till I get a new SSD.

That will be it for me Team. I may be down for a while.

I'll check with H and see if that can be shown as a fail in HFM.

Right there is your problem, get a HDD. It most likely reached the max number of times you can record to an SSD. Using a SSD for boot disk IMO is a bad idea.
 
Thanks but I have a spare ssd that's folded 4 years. I'll stick with them . I have just as many 1 year failures with HD's. Plus its premature to think its the SSD at all. This error covers a lot of possibilities.
 
Back