• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

new tinkers and faulty work units.

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

FizzledFiend

Member
Joined
Jun 18, 2001
Location
Winston Salem NC
since I have added the advmethods flag to the new 5.0 console I have this wierd problem. when I work on this new fatboy tinkers and I reboot windows when it starts back up I get a faulty work unit report and a brand new one is downloaded. Anybody else having this situation? Surely this isn't normal? I have absolutly no problems with ANYTHING else so i don't want to suspect my OC, which is wicked crazy.
 
FizzledFiend said:
since I have added the advmethods flag to the new 5.0 console I have this wierd problem. when I work on this new fatboy tinkers and I reboot windows when it starts back up I get a faulty work unit report and a brand new one is downloaded. Anybody else having this situation? Surely this isn't normal? I have absolutly no problems with ANYTHING else so i don't want to suspect my OC, which is wicked crazy.
I lost two so far but it's on a machine that I just put together this week so stability issues are a definite possibility. This is the only folder running FAH5.00 so I can't make a comparison. I use the verbosity flag and it writes a book when the wu fails. I can say though that my problems occurred after sending a gromac and recieving a new one. I had a spare Supertinker so I switched over and that's what I'm folding now. I was finally able to get a new gromacs downloaded but turned it off so that the Tinker gets the whole processor.
 
--- Opening Log file [August 16 07:31:19]


# Windows Console Edition #####################################################
###############################################################################

Folding@Home Client Version 5.00

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\Program Files\FAH
Executable: C:\Program Files\FAH\FAH4Console.exe
Arguments: -verbosity 9

[07:31:19] - Ask before connecting: No
[07:31:19] - Use IE connection settings: Yes
[07:31:19] - User name: FizzeldFiend (Team 32)
[07:31:19] - User ID: 43BE954A2D897468
[07:31:19] - Machine ID: 1
[07:31:19]
[07:31:19] Loaded queue successfully.
[07:31:19] + Benchmarking ...
[07:31:22] The benchmark result is 9116
[07:31:22]
[07:31:22] + Processing work unit
[07:31:22] Core required: FahCore_65.exe
[07:31:22] Core found.
[07:31:22] - Autosending finished units...
[07:31:22] Trying to send all finished work units
[07:31:22] + No unsent completed units remaining.
[07:31:22] - Autosend completed
[07:31:22] Working on Unit 00 [August 16 07:31:22]
[07:31:22] + Working ...
[07:31:22] - Calling 'FahCore_65.exe -dir work/ -suffix 00 -checkpoint 15 -verbose -lifeline 1544 -version 500'

[07:31:23] Folding@Home Client Core Version 2.53 (June 29, 2004)
[07:31:23]
[07:31:23] Proj: work/wudata_00
[07:31:23] Done: 15905 -> 124800 (decompressed 784.6 percent)
[07:31:23] nsteps: 10000000 dt: 2.000000 dt_dump: 1000.000000 temperature: 296.000000
[07:31:23] xyzfile:
[07:31:23] " 113 p1103_Kiefhaber_4
[07:31:23] 1 CA -182.043695 -98.070312 -55.504606 214 ..."
[07:31:23] keyfile:
[07:31:23] "parameters ./Kiefhaber.prm
[07:31:23] NOVERSION
[07:31:23] ARCHIVE
[07:31:23]
[07:31:23] cutoff 16.0
[07:31:23] taper 1..."
[07:31:23]
[07:31:23] Hashes matched on file work/wudata_00.dyn
[07:31:23] ARC file integrity verified
[07:31:23] Restarting from checkpointed files.
[07:31:23]
[07:31:23] Protein: p1103_Kiefhaber_4
[07:31:23] - Run: 14 (Clone 21, Gen 11)
[07:31:23] - Frames Completed: 2, Remaining: 198
[07:31:23] - Dynamic steps required: 9900000
[07:31:23]
[07:31:23] Writing local files:
[07:31:23]
[07:31:23] parameters work/wudata_00.prm
[07:31:23] - Writing "work/wudata_00.key": (overwrite) successful.
[07:31:23] - Writing "work/wudata_00.xyz": (overwrite) successful.
[07:31:23] - Writing "work/wudata_00.prm": (overwrite) successful.
[07:31:23] - Writing "work/wudata_00.key": (append) successful.
[07:31:24]
[07:31:24] PROJECT="work/wudata_00", NSTEPS=9900000, DT=2.0000, DTDUMP=100.000000, TEMP=296.00
[07:31:24] TINKER: Software Tools for Molecular Design
[07:31:24] Version 3.8 October 2000
[07:31:24] Copyright (c) Jay William Ponder 1990-2000
[07:31:24] portions Copyright (c) Michael Shirts 2001
[07:31:24] portions Copyright (c) Vijay S Pande 2001
[07:31:25]
[07:31:25] Received faulty work unit.
[07:31:35] logfile size: 139264
[07:31:35] - Writing 139776 bytes of core data to disk.
[07:31:35] Done: 139264 -> 11555 (compressed to 8.2 percent)
[07:31:35] end (WriteWorkResults)
[07:31:35]
[07:31:35] Folding@home Core Shutdown: BAD_WORK_UNIT
[07:31:39] CoreStatus = 72 (114)
[07:31:39] Sending work to server


[07:31:39] + Attempting to send results
[07:31:39] - Reading file work/wuresults_00.dat from core
[07:31:39] (Read 12067 bytes from disk)
[07:31:40] - Uploaded at ~12 kB/s
[07:31:40] - Averaged speed for that direction ~11 kB/s
[07:31:40] + Results successfully sent
[07:31:40] Thank you for your contribution to Folding@Home.
[07:31:44] Trying to send all finished work units
[07:31:44] + No unsent completed units remaining.
[07:31:44] - Preparing to get new work unit...
[07:31:44] + Attempting to get work packet
[07:31:44] - Will indicate memory of 511 MB.
[07:31:44] - Connecting to assignment server
[07:31:44] - Successful: assigned to (171.64.122.112).
[07:31:44] + News From Folding@Home: Welcome to Folding@Home
[07:31:44] Loaded queue successfully.
[07:31:45] - Deadline time not received.
[07:31:45] - Receiving payload (expected size: 15861)
[07:31:45] Conversation time very short, giving reduced weight in bandwidth avg
[07:31:45] - Downloaded at ~30 kB/s
[07:31:45] - Averaged speed for that direction ~27 kB/s
[07:31:45] + Received work.
[07:31:45] Trying to send all finished work units
[07:31:45] + No unsent completed units remaining.
[07:31:45] + Closed connections
[07:31:50]
[07:31:50] + Processing work unit
[07:31:50] Core required: FahCore_65.exe
[07:31:50] Core found.
[07:31:50] Working on Unit 01 [August 16 07:31:50]
[07:31:50] + Working ...
[07:31:50] - Calling 'FahCore_65.exe -dir work/ -suffix 01 -checkpoint 15 -verbose -lifeline 1544 -version 500'

[07:31:50] Folding@Home Client Core Version 2.53 (June 29, 2004)
[07:31:50]
[07:31:50] Proj: work/wudata_01
[07:31:50] Done: 15349 -> 123084 (decompressed 801.9 percent)
[07:31:50] nsteps: 10000000 dt: 2.000000 dt_dump: 1000.000000 temperature: 296.000000
[07:31:50] xyzfile:
[07:31:50] " 87 p1101_Kiefhaber_2
[07:31:50] 1 CA -35.537746 146.961058 18.785178 214 ..."
[07:31:50] keyfile:
[07:31:50] "parameters ./Kiefhaber.prm
[07:31:50] NOVERSION
[07:31:50] ARCHIVE
[07:31:50]
[07:31:50] cutoff 16.0
[07:31:50] taper 1..."
[07:31:50]
[07:31:50] - Couldn't get size info for dyn file: work/wudata_01.dyn
[07:31:50] Starting from initial work packet
[07:31:50]
[07:31:50] Protein: p1101_Kiefhaber_2
[07:31:50] - Run: 6 (Clone 20, Gen 16)
[07:31:50] - Frames Completed: 0, Remaining: 200
[07:31:50] - Dynamic steps required: 10000000
[07:31:50]
[07:31:50] Writing local files:
[07:31:50]
[07:31:50] parameters work/wudata_01.prm
[07:31:50] - Writing "work/wudata_01.key": (overwrite) successful.
[07:31:51] - Writing "work/wudata_01.xyz": (overwrite) successful.
[07:31:51] - Writing "work/wudata_01.prm": (overwrite) successful.
[07:31:51] - Writing "work/wudata_01.key": (append) successful.
[07:31:51]
[07:31:51] PROJECT="work/wudata_01", NSTEPS=10000000, DT=2.0000, DTDUMP=100.000000, TEMP=296.00
[07:31:52] TINKER: Software Tools for Molecular Design
[07:31:52] Version 3.8 October 2000
[07:31:52] Copyright (c) Jay William Ponder 1990-2000
[07:31:52] portions Copyright (c) Michael Shirts 2001
[07:31:52] portions Copyright (c) Vijay S Pande 2001
 
Not exactly the same...my plexi farm the servr and all clients have been having File-IO-Errors all day with proteins 724, 1024, 731...a lot of checksum erros..and of course the work currently completed was deleted so nothing was sent....this is on my Knoppix LTSP farm....sucks going 24 hours and not having good data....

No wonder I'm having a crappy day!

Paps
 
I went back to 4 due to the fact of having 3 error's on all 3 5.0 system's and now that I have been back to 4.0 I havn't got one error.
 
Back