PDA

View Full Version : many many workunits?


TIS
11-16-07, 12:44 AM
check out the number of workunits I turned in today in this screenshot... what is going on? is there a bunch of bad units or something? :confused:

http://74.185.195.244/points.jpg

jonspd
11-16-07, 12:55 AM
I dunno but I was born in mandeville and lived in bogalusa till 11currently working in slidel...

Adak
11-16-07, 01:28 AM
Take a peek at your log file(s) and see what the wu's project #'s are, and what computer is throwing all these EUE's.

It's possible you received a boatload of a bad wu project, but doubtful. If the project #'s are different, then I'd say it was caused by a failure of some component in that computer.

Good luck with your troubleshooting.

NedClocker
11-16-07, 07:44 AM
Did you perchance just add a GPU folder?

dfonda
11-16-07, 08:15 AM
Yeah get that machine reeled in quickly.

ihrsetrdr
11-16-07, 09:43 AM
I know that a GPU folder can spit out a lot of EUEs if left unattended. Some time ago, I had one that pumped out over 2800 in one day. It thought that Stanford has addressed that issue...

NedClocker
11-16-07, 09:59 AM
Maybe they have. I D K. It was just a thought.

TIS
11-16-07, 10:06 AM
ill check the logs of all the ones i have here in the office, but i have a bunch of borgs all over the place too. ill get back with what i find... yeah jonspd, I own professional pc repair here in mandeville on lotus dr. been here for a while now. stop by sometime and we can talk shop eh?

TIS
11-16-07, 10:39 AM
Found the problem child... it was the display computer in my shop. I recently changed the processor to one of those newfangled black edition 5000+'s with the unlocked multiplier. had it clocked up from 2.6 to 3.2. funny, it was folding just fine on the other core but core 1 is apparently much more sensitive. check out the logs. eue's all over the place. i backed it down to 3.0 - still eue, backed it down to 2.8 and finally it didnt eue and started folding properly. so only a 200 mhz overclock :(. the machine seemed to be completely stable at 3.2 - no signs that it was having any problems. :shrug:



07:16:27]
[07:16:27] + Processing work unit
[07:16:27] Core required: FahCore_82.exe
[07:16:27] Core found.
[07:16:27] Working on Unit 05 [November 16 07:16:27]
[07:16:27] + Working ...
[07:16:27]
[07:16:27] *------------------------------*
[07:16:27] Folding@Home PMD Core
[07:16:27] Version 1.03 (September 7, 2005)
[07:16:27]
[07:16:27] Preparing to commence simulation
[07:16:27] - Looking at optimizations...
[07:16:27] - Created dyn
[07:16:27] - Files status OK
[07:16:27]
[07:16:27] Project: 891 (Run 1, Clone 659, Gen 0)
[07:16:27]
[07:16:28] Assembly optimizations on if available.
[07:16:28] Entering M.D.
[07:18:10] Protein: p891_p53wtpeptide_tip3p
[07:18:10]
[07:18:10] Completed 0 out of 500000 steps (0)
[07:18:11] Going to send back what have done.
[07:18:11] logfile size: 6975
[07:18:11] - Writing 7495 bytes of core data to disk...
[07:18:11] Done: 6983 -> 2173 (compressed to 31.1 percent)
[07:18:11] ... Done.
[07:18:11]
[07:18:11] Folding@home Core Shutdown: EARLY_UNIT_END
[07:18:13] CoreStatus = 72 (114)
[07:18:13] Sending work to server


[07:18:13] + Attempting to send results
[07:18:13] + Results successfully sent
[07:18:13] Thank you for your contribution to Folding@Home.
[07:18:17] - Preparing to get new work unit...
[07:18:17] + Attempting to get work packet
[07:18:17] - Connecting to assignment server
[07:18:18] - Successful: assigned to (171.64.122.82).
[07:18:18] + News From Folding@Home: Welcome to Folding@Home
[07:18:18] Loaded queue successfully.
[07:18:21] + Closed connections
[07:18:26]
[07:18:26] + Processing work unit
[07:18:26] Core required: FahCore_81.exe
[07:18:26] Core found.
[07:18:26] Working on Unit 06 [November 16 07:18:26]
[07:18:26] + Working ...
[07:18:26]
[07:18:26] *------------------------------*
[07:18:26] Folding@Home Gromacs Simulated Tempering Core
[07:18:26] Version 1.10 (Oct 4, 2007)
[07:18:26]
[07:18:26] Preparing to commence simulation
[07:18:26] - Looking at optimizations...
[07:18:26] - Created dyn
[07:18:26] - Files status OK
[07:18:26] - Expanded 363718 -> 1797968 (decompressed 494.3 percent)
[07:18:26] - Starting from initial work packet
[07:18:26]
[07:18:26] Project: 3627 (Run 11, Clone 15, Gen 5)
[07:18:26]
[07:18:27] Assembly optimizations on if available.
[07:18:27] Entering M.D.
[07:18:33] Protein: p3627_Seq11_Amber03_Native
[07:18:33]
[07:18:33] Writing local files
[07:19:22] Extra SSE boost OK.
[07:19:23] Writing local files
[07:19:23] Completed 0 out of 1500000 steps (0)
[07:19:30] Quit 101 - Fatal error:
[07:19:30] Step 36, time 0.072 (ps) LINCS WARNING
[07:19:30] relative constraint deviation after LINCS:
[07:19:30] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[07:19:30]
[07:19:30] Simulation instability has been encountered. The run has entered a
[07:19:30] state from which no further progress can be made.
[07:19:30] This may be the correct result of the simulation, however if you
[07:19:30] often see other project units terminating early like this
[07:19:30] too, you may wish to check the stability of your computer (issues
[07:19:30] such as high temperature, overclocking, etc.).
[07:19:30] Going to send back what have done.
[07:19:30] logfile size: 8484
[07:19:30] - Writing 9156 bytes of core data to disk...
[07:19:30] Done: 8644 -> 3262 (compressed to 37.7 percent)
[07:19:30] ... Done.
[07:19:30]
[07:19:30] Folding@home Core Shutdown: EARLY_UNIT_END
[07:19:32] CoreStatus = 72 (114)
[07:19:32] Sending work to server


[07:19:32] + Attempting to send results
[07:19:32] - Error: Could not read results file work/wuresults_06.dat from disk
[07:19:32] - Error: Could not read unit 06 file. Removing from queue.
[07:19:32] - Preparing to get new work unit...
[07:19:32] + Attempting to get work packet
[07:19:32] - Connecting to assignment server
[07:19:32] - Successful: assigned to (171.64.122.82).
[07:19:32] + News From Folding@Home: Welcome to Folding@Home
[07:19:33] Loaded queue successfully.
[07:19:35] + Closed connections
[07:19:40]
[07:19:40] + Processing work unit
[07:19:40] Core required: FahCore_81.exe
[07:19:40] Core found.
[07:19:40] Working on Unit 07 [November 16 07:19:40]
[07:19:40] + Working ...
[07:19:40]
[07:19:40] *------------------------------*
[07:19:40] Folding@Home Gromacs Simulated Tempering Core
[07:19:40] Version 1.10 (Oct 4, 2007)
[07:19:40]
[07:19:40] Preparing to commence simulation
[07:19:40] - Looking at optimizations...
[07:19:40] - Created dyn
[07:19:40] - Files status OK
[07:19:41] - Expanded 363718 -> 1797968 (decompressed 494.3 percent)
[07:19:41] - Starting from initial work packet
[07:19:41]
[07:19:41] Project: 3627 (Run 11, Clone 15, Gen 5)
[07:19:41]
[07:19:41] Assembly optimizations on if available.
[07:19:41] Entering M.D.
[07:19:47] Protein: p3627_Seq11_Amber03_Native
[07:19:47]
[07:19:47] Writing local files
[07:20:41] Extra SSE boost OK.
[07:20:41] Writing local files
[07:20:41] Completed 0 out of 1500000 steps (0)
[07:20:46] Quit 101 - Fatal error:
[07:20:46] Step 39, time 0.078 (ps) LINCS WARNING
[07:20:46] relative constraint deviation after LINCS:
[07:20:46] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[07:20:46]
[07:20:46] Simulation instability has been encountered. The run has entered a
[07:20:46] state from which no further progress can be made.
[07:20:46] This may be the correct result of the simulation, however if you
[07:20:46] often see other project units terminating early like this
[07:20:46] too, you may wish to check the stability of your computer (issues
[07:20:46] such as high temperature, overclocking, etc.).
[07:20:46] Going to send back what have done.
[07:20:46] logfile size: 8484
[07:20:46] - Writing 9156 bytes of core data to disk...
[07:20:46] Done: 8644 -> 3258 (compressed to 37.6 percent)
[07:20:46] ... Done.
[07:20:46]
[07:20:46] Folding@home Core Shutdown: EARLY_UNIT_END
[07:20:48] CoreStatus = 72 (114)
[07:20:48] Sending work to server


[07:20:48] + Attempting to send results
[07:20:48] - Error: Could not read results file work/wuresults_07.dat from disk
[07:20:48] - Error: Could not read unit 07 file. Removing from queue.
[07:20:48] - Preparing to get new work unit...
[07:20:48] + Attempting to get work packet
[07:20:48] - Connecting to assignment server
[07:20:49] - Successful: assigned to (171.64.122.82).
[07:20:49] + News From Folding@Home: Welcome to Folding@Home
[07:20:49] Loaded queue successfully.
[07:20:52] + Closed connections
[07:20:57]
[07:20:57] + Processing work unit
[07:20:57] Core required: FahCore_81.exe
[07:20:57] Core found.
[07:20:57] Working on Unit 08 [November 16 07:20:57]
[07:20:57] + Working ...
[07:20:57]
[07:20:57] *------------------------------*
[07:20:57] Folding@Home Gromacs Simulated Tempering Core
[07:20:57] Version 1.10 (Oct 4, 2007)
[07:20:57]
[07:20:57] Preparing to commence simulation
[07:20:57] - Looking at optimizations...
[07:20:57] - Created dyn
[07:20:57] - Files status OK
[07:20:57] - Expanded 363718 -> 1797968 (decompressed 494.3 percent)
[07:20:57] - Starting from initial work packet
[07:20:57]
[07:20:57] Project: 3627 (Run 11, Clone 15, Gen 5)
[07:20:57]
[07:20:58] Assembly optimizations on if available.
[07:20:58] Entering M.D.
[07:21:04] Protein: p3627_Seq11_Amber03_Native
[07:21:04]
[07:21:04] Writing local files
[07:21:54] Extra SSE boost OK.
[07:21:54] Writing local files
[07:21:54] Completed 0 out of 1500000 steps (0)
[07:22:02] Quit 101 - Fatal error:
[07:22:02] Step 42, time 0.084 (ps) LINCS WARNING
[07:22:02] relative constraint deviation after LINCS:
[07:22:02] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[07:22:02]
[07:22:02] Simulation instability has been encountered. The run has entered a
[07:22:02] state from which no further progress can be made.
[07:22:02] This may be the correct result of the simulation, however if you
[07:22:02] often see other project units terminating early like this
[07:22:02] too, you may wish to check the stability of your computer (issues
[07:22:02] such as high temperature, overclocking, etc.).
[07:22:02] Going to send back what have done.
[07:22:02] logfile size: 8484
[07:22:02] - Writing 9156 bytes of core data to disk...
[07:22:02] Done: 8644 -> 3258 (compressed to 37.6 percent)
[07:22:02] ... Done.
[07:22:02]
[07:22:02] Folding@home Core Shutdown: EARLY_UNIT_END
[07:22:05] CoreStatus = 72 (114)
[07:22:05] Sending work to server


[07:22:05] + Attempting to send results
[07:22:05] - Error: Could not read results file work/wuresults_08.dat from disk
[07:22:05] - Error: Could not read unit 08 file. Removing from queue.
[07:22:05] - Preparing to get new work unit...
[07:22:05] + Attempting to get work packet
[07:22:05] - Connecting to assignment server
[07:22:05] - Successful: assigned to (171.64.122.82).
[07:22:05] + News From Folding@Home: Welcome to Folding@Home
[07:22:05] Loaded queue successfully.
[07:22:08] + Closed connections
[07:22:13]
[07:22:13] + Processing work unit
[07:22:13] Core required: FahCore_81.exe
[07:22:13] Core found.
[07:22:13] Working on Unit 09 [November 16 07:22:13]
[07:22:13] + Working ...
[07:22:13]
[07:22:13] *------------------------------*
[07:22:13] Folding@Home Gromacs Simulated Tempering Core
[07:22:13] Version 1.10 (Oct 4, 2007)
[07:22:13]
[07:22:13] Preparing to commence simulation
[07:22:13] - Looking at optimizations...
[07:22:13] - Created dyn
[07:22:13] - Files status OK
[07:22:14] - Expanded 363718 -> 1797968 (decompressed 494.3 percent)
[07:22:14] - Starting from initial work packet
[07:22:14]
[07:22:14] Project: 3627 (Run 11, Clone 15, Gen 5)
[07:22:14]
[07:22:14] Assembly optimizations on if available.
[07:22:14] Entering M.D.
[07:22:20] Protein: p3627_Seq11_Amber03_Native
[07:22:20]
[07:22:20] Writing local files
[07:23:17] Extra SSE boost OK.
[07:23:17] Writing local files
[07:23:17] Completed 0 out of 1500000 steps (0)
[07:23:17] Quit 101 - Fatal error:
[07:23:17] Step 3, time 0.006 (ps) LINCS WARNING
[07:23:17] relative constraint deviation after LINCS:
[07:23:17] max 0.000000 (between atoms 1 and 2) rms 1.#QNAN0
[07:23:17]
[07:23:17] Simulation instability has been encountered. The run has entered a
[07:23:17] state from which no further progress can be made.
[07:23:17] This may be the correct result of the simulation, however if you
[07:23:17] often see other project units terminating early like this
[07:23:17] too, you may wish to check the stability of your computer (issues
[07:23:17] such as high temperature, overclocking, etc.).
[07:23:17] Going to send back what have done.
[07:23:17] logfile size: 8484
[07:23:17] - Writing 9155 bytes of core data to disk...
[07:23:17] Done: 8643 -> 3257 (compressed to 37.6 percent)
[07:23:17] ... Done.
[07:23:17]
[07:23:17] Folding@home Core Shutdown: EARLY_UNIT_END
[07:23:21] CoreStatus = 72 (114)
[07:23:21] Sending work to server


[07:23:21] + Attempting to send results
[07:23:21] - Error: Could not read results file work/wuresults_09.dat from disk
[07:23:21] - Error: Could not read unit 09 file. Removing from queue.
[07:23:21] - Preparing to get new work unit...
[07:23:21] + Attempting to get work packet
[07:23:21] - Connecting to assignment server
[07:23:22] - Successful: assigned to (171.64.122.72).
[07:23:22] + News From Folding@Home: Welcome to Folding@Home
[07:23:22] Loaded queue successfully.
[07:23:25] + Closed connections
[07:23:30]
[07:23:30] + Processing work unit
[07:23:30] Core required: FahCore_80.exe
[07:23:30] Core found.
[07:23:30] Working on Unit 00 [November 16 07:23:30]
[07:23:30] + Working ...
[07:23:30]
[07:23:30] *------------------------------*
[07:23:30] Folding@Home Gromacs SREM Core
[07:23:30] Version 1.02 (Dec 15, 2006)
[07:23:30]
[07:23:30] Preparing to commence simulation
[07:23:30] - Looking at optimizations...
[07:23:30] - Created dyn
[07:23:30] - Files status OK
[07:23:30] - Expanded 424536 -> 1971821 (decompressed 464.4 percent)
[07:23:30] - Starting from initial work packet
[07:23:30]
[07:23:30] Project: 3675 (Run 96, Clone 18, Gen 1)
[07:23:30]
[07:23:31] Assembly optimizations on if available.
[07:23:31] Entering M.D.
[07:23:37] Protein: p3675_Seq20_Amber03_Native
[07:23:37]
[07:23:37] Writing local files
[07:24:26] Gromacs error.
[07:24:26]
[07:24:26] Folding@home Core Shutdown: UNKNOWN_ERROR
[07:24:30] CoreStatus = 79 (121)
[07:24:30] Client-core communications error: ERROR 0x79
[07:24:30] Deleting current work unit & continuing...
[07:24:50] - Preparing to get new work unit...
[07:24:50] + Attempting to get work packet
[07:24:50] - Connecting to assignment server
[07:24:50] - Successful: assigned to (171.64.122.72).
[07:24:50] + News From Folding@Home: Welcome to Folding@Home
[07:24:50] Loaded queue successfully.
[07:24:54] + Closed connections
[07:24:59]
[07:24:59] + Processing work unit
[07:24:59] Core required: FahCore_80.exe
[07:24:59] Core found.
[07:24:59] Working on Unit 01 [November 16 07:24:59]
[07:24:59] + Working ...
[07:24:59]
[07:24:59] *------------------------------*
[07:24:59] Folding@Home Gromacs SREM Core
[07:24:59] Version 1.02 (Dec 15, 2006)
[07:24:59]
[07:24:59] Preparing to commence simulation
[07:24:59] - Looking at optimizations...
[07:24:59] - Created dyn
[07:24:59] - Files status OK
[07:24:59] - Expanded 424536 -> 1971821 (decompressed 464.4 percent)
[07:24:59] - Starting from initial work packet
[07:24:59]
[07:24:59] Project: 3675 (Run 96, Clone 18, Gen 1)
[07:24:59]
[07:25:00] Assembly optimizations on if available.
[07:25:00] Entering M.D.
[07:25:06] Protein: p3675_Seq20_Amber03_Native
[07:25:06]
[07:25:06] Writing local files
[07:25:55] Gromacs error.
[07:25:55]
[07:25:55] Folding@home Core Shutdown: UNKNOWN_ERROR
[07:25:59] CoreStatus = 79 (121)
[07:25:59] Client-core communications error: ERROR 0x79
[07:25:59] Deleting current work unit & continuing...
[07:26:19] - Preparing to get new work unit...
[07:26:19] + Attempting to get work packet
[07:26:19] - Connecting to assignment server
[07:26:19] - Successful: assigned to (171.64.122.72).
[07:26:19] + News From Folding@Home: Welcome to Folding@Home
[07:26:19] Loaded queue successfully.
[07:26:22] + Closed connections
[07:26:27]
[07:26:27] + Processing work unit
[07:26:27] Core required: FahCore_80.exe
[07:26:27] Core found.
[07:26:27] Working on Unit 02 [November 16 07:26:27]
[07:26:27] + Working ...
[07:26:28]
[07:26:28] *------------------------------*
[07:26:28] Folding@Home Gromacs SREM Core
[07:26:28] Version 1.02 (Dec 15, 2006)
[07:26:28]
[07:26:28] Preparing to commence simulation
[07:26:28] - Looking at optimizations...
[07:26:28] - Created dyn
[07:26:28] - Files status OK
[07:26:28] - Expanded 424536 -> 1971821 (decompressed 464.4 percent)
[07:26:28] - Starting from initial work packet
[07:26:28]
[07:26:28] Project: 3675 (Run 96, Clone 18, Gen 1)
[07:26:28]
[07:26:29] Assembly optimizations on if available.
[07:26:29] Entering M.D.
[07:26:35] Protein: p3675_Seq20_Amber03_Native
[07:26:35]
[07:26:35] Writing local files
[07:27:23] Gromacs error.
[07:27:23]
[07:27:23] Folding@home Core Shutdown: UNKNOWN_ERROR
[07:27:26] CoreStatus = 79 (121)
[07:27:26] Client-core communications error: ERROR 0x79
[07:27:26]
Folding@Home will go to sleep for 1 day as there have been 5 consecutive Cores executed which failed to complete a work unit.
[07:27:26] (To wake it up early, quit the application and restart it.)
[07:27:26] If problems persist, please visit our website at http://folding.stanford.edu for help.
[07:27:26] + Sleeping...

Adak
11-16-07, 02:28 PM
Good job! Always sweet getting a C2D Blacky, back in the fold. :soda: :soda: