• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

Not getting new WU?

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

deeppow

Senior Member
Joined
May 10, 2002
Location
Los Alamos, NM
Just noted that my server indicates it has finished its current WU but doesn't seem to have gotten a new WU. Do I have something setup wrong? :beer:
 
Most likely you've done nothing wrong... if running the SMP client it can take a decent amount of time for the WU to finalize, then connect to Stanford, send the WU, and retrieve a new one.

One way or another if you post up your log file (in code tags) we'll be able to tell you what's happening.
 
When things are working correctly, it doesn't take long to get a new WU:

Code:
[23:46:31] Completed 500000 out of 500000 steps  (100 percent)
[23:46:31] Writing final coordinates.
[23:46:31] Past main M.D. loop
[23:46:31] Will end MPI now
[23:47:31] 
[23:47:31] Finished Work Unit:
[23:47:31] - Reading up to 3723552 from "work/wudata_07.arc": Read 3723552
[23:47:32] - Reading up to 1782860 from "work/wudata_07.xtc": Read 1782860
[23:47:32] goefile size: 0
[23:47:32] logfile size: 19514
[23:47:32] Leaving Run
[23:47:34] - Writing 5530326 bytes of core data to disk...
[23:47:34]   ... Done.
[23:47:34] - Shutting down core
[23:47:34] 
[23:47:34] Folding@home Core Shutdown: FINISHED_UNIT
[23:47:40] CoreStatus = 64 (100)
[23:47:40] Unit 7 finished with 80 percent of time to deadline remaining.
[23:47:40] Updated performance fraction: 0.801556
[23:47:40] Sending work to server


[23:47:40] + Attempting to send results
[23:47:40] - Reading file work/wuresults_07.dat from core
[23:47:40]   (Read 5530326 bytes from disk)
[23:47:40] Connecting to http://171.64.65.56:8080/
[23:50:11] Posted data.
[23:50:11] Initial: 0000; - Uploaded at ~35 kB/s
[23:50:13] - Averaged speed for that direction ~47 kB/s
[23:50:13] + Results successfully sent
[23:50:13] Thank you for your contribution to Folding@Home.
[23:50:13] + Number of Units Completed: 406

[23:54:17] - Warning: Could not delete all work unit files (7): Core returned invalid code
[23:54:17] Trying to send all finished work units
[23:54:17] + No unsent completed units remaining.
[23:54:17] - Preparing to get new work unit...
[23:54:17] + Attempting to get work packet
[23:54:17] - Will indicate memory of 1013 MB
[23:54:17] - Connecting to assignment server
[23:54:17] Connecting to http://assign.stanford.edu:8080/
[23:54:17] Posted data.
[23:54:17] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[23:54:17] + News From Folding@Home: Welcome to Folding@Home
[23:54:17] Loaded queue successfully.
[23:54:17] Connecting to http://171.64.65.56:8080/
[23:54:20] Posted data.
[23:54:20] Initial: 0000; - Receiving payload (expected size: 2433664)
[23:54:24] - Downloaded at ~594 kB/s
[23:54:24] - Averaged speed for that direction ~549 kB/s
[23:54:24] + Received work.
[23:54:24] Trying to send all finished work units
[23:54:24] + No unsent completed units remaining.
[23:54:24] + Closed connections
[23:54:24] 
[23:54:24] + Processing work unit
[23:54:24] Core required: FahCore_a1.exe
[23:54:24] Core found.
[23:54:24] Working on Unit 08 [January 5 23:54:24]
[23:54:24] + Working ...
[23:54:24] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 08 -priority 96 -checkpoint 30 -forceasm -verbose -lifeline 706 -version 600'

[23:54:24] 
[23:54:24] *------------------------------*
[23:54:24] Folding@Home Gromacs SMP Core
[23:54:24] Version 1.74 (November 27, 2006)
[23:54:24] 
[23:54:24] Preparing to commence simulation
[23:54:24] - Ensuring status. Please wait.
[23:54:41] - Assembly optimizations manually forced on.
[23:54:41] - Not checking prior termination.
[23:54:42] - Expanded 2433152 -> 12886013 (decompressed 529.6 percent)
[23:54:42] - Starting from initial work packet
[23:54:42] 
[23:54:42] Project: 2605 (Run 17, Clone 345, Gen 16)
[23:54:42] 
[23:54:43] Assembly optimizations on if available.
[23:54:43] Entering M.D.
[23:54:48] Rejecting checkpoint
[23:54:49] Protein: Protein in POPC
[23:54:49] Writing local files
[23:54:50] Extra SSE boost OK.
[23:54:50] Writing local files
[23:54:50] Completed 0 out of 500000 steps  (0 percent)

00:08:19 from 100% of the first WU to the 0% on the second.
 
Last edited:
Ehh... about 8 minutes. But when the WU servers aren't cooperating it can take longer. :D

mpi is probably hung... I would love to hear the correct procedure for dealing with this situation. It's only happened to me a handful of times and every time I've tried (on Linux or Windows) has resulted in the work being lost.

ChasR? - I know you knows what to do. :D
 
Can I add the verbosity=9 in my client.cfg file to gen detailed log file? I start SMP fah as a service.
 
Back