gottcha
01-12-10, 09:10 PM
Help what does this mean? I'm in a contest and i can't get any work!:cry::shrug::temper:
ebug122
01-12-10, 09:16 PM
Uninstalling and re-installing the SMP client worked for me.
I dunno if that's the only solution, I was going to re-install it the other day anyway.
gottcha
01-12-10, 09:21 PM
really that suxs. anybody else with a solution? i will have to do something by tonight.
gottcha
01-12-10, 09:43 PM
ok uninstalled and did a reinstall and i get the same thing???????
Some times the servers do run out of prepared WUs, and it might be other problems, like the server may be overloaded.
If you still aren't getting work, you could restart the client about every half hour. It checks several times quickly, then spaces out the requests and waits longer between retries. Restarting the client resets the cycle.
However, if it is because the server is overloaded, you would actually be making it worse by retrying more often. :shrug:
I've got an SMP WU finishing up in the next 10 minutes or so. It is a linux client, so it may be hitting a different server. But I'll let you know if it picks up a new WU.
gottcha
01-12-10, 09:57 PM
Cool thanx for the help. I'll try not to keep restarting the client i don't want to make it worse.
gottcha
01-12-10, 09:59 PM
I tried the restart thing twice and now it downloaded a WU! Thank for the help i guess you were right i think there servers were overloaded...... that means were doing good right!
Jolly-Swagman
01-12-10, 10:21 PM
Bookmark this Link Server Status
http://fah-web.stanford.edu/serverstat.html
No need to reinstall the client.
Just stop the client, and restart it, to reinitialize the clients attempt to connect cycle, like Torin posted.
dylskee
01-13-10, 09:02 AM
I had a -bigadv WU that was 50% done and I woke up this morning to see the ETA was up to 19 days or so. I checked the log file and there seemed to be some type of error but I was running late for work so I didn't have much time to look into it. I shut down the client and restarted it and it deleted the work and started a new WU. Hopefully this one goes well.
THe -bigadv WUs have been rock solid on stable machines. If you're having trouble, turn down the OC. If you drop 2 WUs in 10, you get no bonus at all, so squeezing the last few MHz on the OC does you much less than no good.
Porvalsh
01-13-10, 11:55 AM
I get that error when my internet connection is down. For some reason my wireless adapter hiccups randomly and I have to restart to get it working again. Annoying to say the least.
In this case, the AS reports "no appropriate work server was available". It wasn't a connection problem. The client was actually functioning normally, making the connection, but all the work servers for that particular configuration were busy. Another possibile cause of that message is a configuration error, I don't think that's the case here though. In the old days, if you configured for deadlineless WUs and there weren't any, you got the message. Now days, you'd have to enter less than 64 MB in the memory setting of config.
I get that error when my internet connection is down. For some reason my wireless adapter hiccups randomly and I have to restart to get it working again. Annoying to say the least.
+1.
I get the same identical log if my network connection craps out on me, which happens a lot on the VMplayer :cry:
dylskee
01-13-10, 04:08 PM
THe -bigadv WUs have been rock solid on stable machines. If you're having trouble, turn down the OC. If you drop 2 WUs in 10, you get no bonus at all, so squeezing the last few MHz on the OC does you much less than no good.
I've been folding 24/7 for a month now completely stable @4GHz, I already clocked down from 4.2GHz and that seemed to help. This is the error I saw this morning.
[10:25:22] Project: 2681 (Run 0, Clone 2, Gen 64)
[10:25:22] - Error: Could not get length of results file work/wuresults_00.dat
[10:25:22] - Error: Could not read unit 00 file. Removing from queue.
[10:25:22] Trying to send all finished work units
[10:25:22] + No unsent completed units remaining.
[10:25:22] - Preparing to get new work unit...
Got home form work tonight and found the SMP client working away. But a check of the log showed after it finished a WU this morning, it took 8 hours to get another one. Grrr. :rain:
Attempt to get work failed, and no other work to do. Waiting before retry.
augie1111
01-17-10, 02:59 PM
Got home form work tonight and found the SMP client working away. But a check of the log showed after it finished a WU this morning, it took 8 hours to get another one. Grrr. :rain:
Same here Giz, I restarted the client but no go. I'll restart VMWare Player as I have nothing to lose after 15 retries. :cry:
Hmm, my retry. No idea of what's happening, hope the WU went through. This on top of my 260 getting half points. sigh That's beta far ya!:D
1:02:25] + News From Folding@Home: Welcome to Folding@Home
[21:02:25] Loaded queue successfully.
[21:02:25] Connecting to http://171.67.108.24:8080/
[21:02:26] Posted data.
[21:02:26] Initial: 0000; + Could not connect to Work Server
[21:02:26] - Attempt #5 to get work failed, and no other work to do.
Waiting before retry.
[21:03:48] + Attempting to get work packet
[21:03:48] - Will indicate memory of 979 MB
[21:03:48] - Connecting to assignment server
[21:03:48] Connecting to http://assign.stanford.edu:8080/
[21:06:57] - Couldn't send HTTP request to server
[21:06:57] + Could not connect to Assignment Server
[21:06:57] Connecting to http://assign2.stanford.edu:80/
[21:06:57] Posted data.
[21:06:57] Initial: 40AB; - Successful: assigned to (171.64.65.56).
[21:06:57] + News From Folding@Home: Welcome to Folding@Home
[21:06:57] Loaded queue successfully.
[21:06:57] Connecting to http://171.64.65.56:80/
[21:07:04] Posted data.
[21:07:04] Initial: 0000; - Receiving payload (expected size: 4835967)
[21:07:12] - Downloaded at ~590 kB/s
[21:07:12] - Averaged speed for that direction ~685 kB/s
[21:07:12] + Received work.
[21:07:12] Trying to send all finished work units
[21:07:12] + No unsent completed units remaining.
[21:07:12] + Closed connections
[21:07:17]
[21:07:17] + Processing work unit
[21:07:17] At least 4 processors must be requested.Core required: FahCore_a2.exe
[21:07:17] Core found.
[21:07:17] Working on queue slot 02 [January 17 21:07:17 UTC]
[21:07:17] + Working ...
[21:07:17] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 02 -checkpoint 15 -verbose -lifeline 27700 -version 624'
[21:07:17]
[21:07:17] *------------------------------*
[21:07:17] Folding@Home Gromacs SMP Core
[21:07:17] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[21:07:17]
[21:07:17] Preparing to commence simulation
[21:07:17] - Ensuring status. Please wait.
[21:07:26] - Looking at optimizations...
[21:07:26] - Working with standard loops on this execution.
[21:07:26] - Files status OK
[21:07:28] - Expanded 4835455 -> 23979129 (decompressed 495.9 percent)
[21:07:28] Called DecompressByteArray: compressed_data_size=4835455 data_size=23979129, decompressed_data_size=23979129 diff=0
[21:07:28] - Digital signature verified
[21:07:28]
[21:07:28] Project: 2669 (Run 15, Clone 21, Gen 63)
[21:07:28]
[21:07:28] Entering M.D.
[21:07:34] Using Gromacs checkpoints
NNODES=4, MYRANK=0, HOSTNAME=ubuntu
NODEID=0 argc=23
Reading file work/wudata_02.tpr, VERSION 3.3.99_development_20070618 (single precision)
NNODES=4, MYRANK=1, HOSTNAME=ubuntu
NODEID=1 argc=23
NNODES=4, MYRANK=2, HOSTNAME=ubuntu
NODEID=2 argc=23
NNODES=4, MYRANK=3, HOSTNAME=ubuntu
NODEID=3 argc=23
Note: tpx file_version 48, software version 68
Reading checkpoint file work/wudata_02.cpt generated: Sat Jan 16 08:06:27 2010
-------------------------------------------------------
Program mdrun, VERSION 4.0.99_development_20090605
Source code file: checkpoint.c, line: 1196
Fatal error:
Checkpoint file is for a system of 147237 atoms, while the current system consists of 146706 atoms
For more information and tips for trouble shooting please check the GROMACS Wiki at
http://wiki.gromacs.org/index.php/Errors
-------------------------------------------------------
Thanx for Using GROMACS - Have a Nice Day
Error on node 0, will try to stop all the nodes
Halting parallel program mdrun on CPU 0 out of 4
gcq#0: Thanx for Using GROMACS - Have a Nice Day
application called MPI_Abort(MPI_COMM_WORLD, -1) - process 0
[0]0:Return code = 255
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[21:07:42] CoreStatus = FF (255)
[21:07:42] Sending work to server
[21:07:42] Project: 2669 (Run 15, Clone 21, Gen 63)
[21:07:42] - Error: Could not get length of results file work/wuresults_02.dat
[21:07:42] - Error: Could not read unit 02 file. Removing from queue.
[21:07:42] Trying to send all finished work units
[21:07:42] + No unsent completed units remaining.
[21:07:42] - Preparing to get new work unit...
[21:07:42] + Attempting to get work packet
[21:07:42] - Will indicate memory of 979 MB
[21:07:42] - Connecting to assignment server
[21:07:42] Connecting to http://assign.stanford.edu:8080/
[21:07:43] Posted data.
[21:07:43] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:07:43] + News From Folding@Home: Welcome to Folding@Home
[21:07:43] Loaded queue successfully.
[21:07:43] Connecting to http://171.67.108.24:8080/
[21:07:44] Posted data.
[21:07:44] Initial: 0000; + Could not connect to Work Server
[21:07:44] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[21:08:03] + Attempting to get work packet
[21:08:03] - Will indicate memory of 979 MB
[21:08:03] - Connecting to assignment server
[21:08:03] Connecting to http://assign.stanford.edu:8080/
[21:08:03] Posted data.
[21:08:03] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:08:03] + News From Folding@Home: Welcome to Folding@Home
[21:08:03] Loaded queue successfully.
[21:08:03] Connecting to http://171.67.108.24:8080/
[21:08:10] Posted data.
[21:08:10] Initial: 0000; - Receiving payload (expected size: 4836575)
[21:08:16] - Downloaded at ~787 kB/s
[21:08:16] - Averaged speed for that direction ~705 kB/s
[21:08:16] + Received work.
[21:08:16] Trying to send all finished work units
[21:08:16] + No unsent completed units remaining.
[21:08:16] + Closed connections
[21:08:21]
[21:08:21] + Processing work unit
[21:08:21] At least 4 processors must be requested.Core required: FahCore_a2.exe
[21:08:21] Core found.
[21:08:21] Working on queue slot 03 [January 17 21:08:21 UTC]
[21:08:21] + Working ...
[21:08:21] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 03 -checkpoint 15 -verbose -lifeline 27700 -version 624'
[21:08:22]
[21:08:22] *------------------------------*
[21:08:22] Folding@Home Gromacs SMP Core
[21:08:22] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[21:08:22]
[21:08:22] Preparing to commence simulation
[21:08:22] - Ensuring status. Please wait.
[21:08:31] - Looking at optimizations...
[21:08:31] - Working with standard loops on this execution.
[21:08:31] - Files status OK
[21:08:32] - Expanded 4836063 -> 24033949 (decompressed 496.9 percent)
[21:08:32] Called DecompressByteArray: compressed_data_size=4836063 data_size=24033949, decompressed_data_size=24033949 diff=0
[21:08:33] - Digital signature verified
[21:08:33]
[21:08:33] Project: 2671 (Run 32, Clone 76, Gen 191)
[21:08:33]
[21:08:33] Entering M.D.
[21:08:39] Using Gromacs checkpoints
NNODES=4, MYRANK=0, HOSTNAME=ubuntu
NODEID=0 argc=23
NNODES=4, MYRANK=1, HOSTNAME=ubuntu
NODEID=1 argc=23
NNODES=4, MYRANK=2, HOSTNAME=ubuntu
NODEID=2 argc=23
NNODES=4, MYRANK=3, HOSTNAME=ubuntu
NODEID=3 argc=23
Reading file work/wudata_03.tpr, VERSION 3.3.99_development_20070618 (single precision)
Note: tpx file_version 48, software version 68
Reading checkpoint file work/wudata_03.cpt generated: Sat Dec 19 07:42:45 2009
-------------------------------------------------------
Program mdrun, VERSION 4.0.99_development_20090605
Source code file: checkpoint.c, line: 1196
Fatal error:
Checkpoint file is for a system of 148758 atoms, while the current system consists of 147051 atoms
For more information and tips for trouble shooting please check the GROMACS Wiki at
http://wiki.gromacs.org/index.php/Errors
-------------------------------------------------------
Thanx for Using GROMACS - Have a Nice Day
Error on node 0, will try to stop all the nodes
Halting parallel program mdrun on CPU 0 out of 4
gcq#0: Thanx for Using GROMACS - Have a Nice Day
application called MPI_Abort(MPI_COMM_WORLD, -1) - process 0
[0]0:Return code = 255
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[21:08:46] CoreStatus = FF (255)
[21:08:46] Sending work to server
[21:08:46] Project: 2671 (Run 32, Clone 76, Gen 191)
[21:08:46] - Error: Could not get length of results file work/wuresults_03.dat
[21:08:46] - Error: Could not read unit 03 file. Removing from queue.
[21:08:46] Trying to send all finished work units
[21:08:46] + No unsent completed units remaining.
[21:08:46] - Preparing to get new work unit...
[21:08:46] + Attempting to get work packet
[21:08:46] - Will indicate memory of 979 MB
[21:08:46] - Connecting to assignment server
[21:08:46] Connecting to http://assign.stanford.edu:8080/
[21:08:47] Posted data.
[21:08:47] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:08:47] + News From Folding@Home: Welcome to Folding@Home
[21:08:47] Loaded queue successfully.
[21:08:47] Connecting to http://171.67.108.24:8080/
[21:08:53] Posted data.
[21:08:53] Initial: 0000; - Receiving payload (expected size: 4836575)
[21:08:59] - Downloaded at ~787 kB/s
[21:08:59] - Averaged speed for that direction ~721 kB/s
[21:08:59] + Received work.
[21:08:59] Trying to send all finished work units
[21:08:59] + No unsent completed units remaining.
[21:08:59] + Closed connections
[21:09:04]
[21:09:04] + Processing work unit
[21:09:04] At least 4 processors must be requested.Core required: FahCore_a2.exe
[21:09:04] Core found.
[21:09:04] Working on queue slot 04 [January 17 21:09:04 UTC]
[21:09:04] + Working ...
[21:09:04] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 04 -checkpoint 15 -verbose -lifeline 27700 -version 624'
[21:09:04]
[21:09:04] *------------------------------*
[21:09:04] Folding@Home Gromacs SMP Core
[21:09:04] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[21:09:04]
[21:09:04] Preparing to commence simulation
[21:09:04] - Ensuring status. Please wait.
[21:09:14] - Looking at optimizations...
[21:09:14] - Working with standard loops on this execution.
[21:09:14] - Files status OK
[21:09:15] - Expanded 4836063 -> 24033949 (decompressed 496.9 percent)
[21:09:15] Called DecompressByteArray: compressed_data_size=4836063 data_size=24033949, decompressed_data_size=24033949 diff=0
[21:09:15] - Digital signature verified
[21:09:15]
[21:09:15] Project: 2671 (Run 32, Clone 76, Gen 191)
[21:09:15]
[21:09:15] Entering M.D.
[21:09:21] Using Gromacs checkpoints
NNODES=4, MYRANK=0, HOSTNAME=ubuntu
NODEID=0 argc=23
NNODES=4, MYRANK=2, HOSTNAME=ubuntu
NODEID=2 argc=23
NNODES=4, MYRANK=1, HOSTNAME=ubuntu
NODEID=1 argc=23
NNODES=4, MYRANK=3, HOSTNAME=ubuntu
NODEID=3 argc=23
Reading file work/wudata_04.tpr, VERSION 3.3.99_development_20070618 (single precision)
Note: tpx file_version 48, software version 68
Reading checkpoint file work/wudata_04.cpt generated: Tue Dec 15 07:38:32 2009
-------------------------------------------------------
Program mdrun, VERSION 4.0.99_development_20090605
Source code file: checkpoint.c, line: 1196
Fatal error:
Checkpoint file is for a system of 146706 atoms, while the current system consists of 147051 atoms
For more information and tips for trouble shooting please check the GROMACS Wiki at
http://wiki.gromacs.org/index.php/Errors
-------------------------------------------------------
Thanx for Using GROMACS - Have a Nice Day
Error on node 0, will try to stop all the nodes
Halting parallel program mdrun on CPU 0 out of 4
gcq#0: Thanx for Using GROMACS - Have a Nice Day
application called MPI_Abort(MPI_COMM_WORLD, -1) - process 0
[0]0:Return code = 255
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[21:09:29] CoreStatus = FF (255)
[21:09:29] Sending work to server
[21:09:29] Project: 2671 (Run 32, Clone 76, Gen 191)
[21:09:29] - Error: Could not get length of results file work/wuresults_04.dat
[21:09:29] - Error: Could not read unit 04 file. Removing from queue.
[21:09:29] Trying to send all finished work units
[21:09:29] + No unsent completed units remaining.
[21:09:29] - Preparing to get new work unit...
[21:09:29] + Attempting to get work packet
[21:09:29] - Will indicate memory of 979 MB
[21:09:29] - Connecting to assignment server
[21:09:29] Connecting to http://assign.stanford.edu:8080/
[21:09:29] Posted data.
[21:09:29] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:09:29] + News From Folding@Home: Welcome to Folding@Home
[21:09:30] Loaded queue successfully.
[21:09:30] Connecting to http://171.67.108.24:8080/
[21:09:30] Posted data.
[21:09:30] Initial: 0000; - Error: Bad packet type from server, expected work assignment
[21:09:31] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[21:09:50] + Attempting to get work packet
[21:09:50] - Will indicate memory of 979 MB
[21:09:50] - Connecting to assignment server
[21:09:50] Connecting to http://assign.stanford.edu:8080/
[21:09:51] Posted data.
[21:09:51] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:09:51] + News From Folding@Home: Welcome to Folding@Home
[21:09:51] Loaded queue successfully.
[21:09:51] Connecting to http://171.67.108.24:8080/
[21:09:58] Posted data.
[21:09:58] Initial: 0000; - Receiving payload (expected size: 4835587)
[21:10:05] - Downloaded at ~674 kB/s
[21:10:05] - Averaged speed for that direction ~712 kB/s
[21:10:05] + Received work.
[21:10:05] Trying to send all finished work units
[21:10:05] + No unsent completed units remaining.
[21:10:05] + Closed connections
[21:10:10]
[21:10:10] + Processing work unit
[21:10:10] At least 4 processors must be requested.Core required: FahCore_a2.exe
[21:10:10] Core found.
[21:10:10] Working on queue slot 05 [January 17 21:10:10 UTC]
[21:10:10] + Working ...
[21:10:10] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 05 -checkpoint 15 -verbose -lifeline 27700 -version 624'
[21:10:10]
[21:10:10] *------------------------------*
[21:10:10] Folding@Home Gromacs SMP Core
[21:10:10] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[21:10:10]
[21:10:10] Preparing to commence simulation
[21:10:10] - Ensuring status. Please wait.
[21:10:20] - Looking at optimizations...
[21:10:20] - Working with standard loops on this execution.
[21:10:20] - Files status OK
[21:10:21] - Expanded 4835075 -> 24033949 (decompressed 497.0 percent)
[21:10:21] Called DecompressByteArray: compressed_data_size=4835075 data_size=24033949, decompressed_data_size=24033949 diff=0
[21:10:21] - Digital signature verified
[21:10:21]
[21:10:21] Project: 2671 (Run 32, Clone 73, Gen 191)
[21:10:21]
[21:10:21] Entering M.D.
[21:10:27] Using Gromacs checkpoints
NNODES=4, MYRANK=1, HOSTNAME=ubuntu
NNODES=4, MYRANK=2, HOSTNAME=ubuntu
NNODES=4, MYRANK=3, HOSTNAME=ubuntu
NNODES=4, MYRANK=0, HOSTNAME=ubuntu
NODEID=0 argc=23
Reading file work/wudata_05.tpr, VERSION 3.3.99_development_20070618 (single precision)
NODEID=1 argc=23
NODEID=2 argc=23
NODEID=3 argc=23
Note: tpx file_version 48, software version 68
Reading checkpoint file work/wudata_05.cpt generated: Wed Dec 23 07:54:11 2009
-------------------------------------------------------
Program mdrun, VERSION 4.0.99_development_20090605
Source code file: checkpoint.c, line: 1196
Fatal error:
Checkpoint file is for a system of 147090 atoms, while the current system consists of 147051 atoms
For more information and tips for trouble shooting please check the GROMACS Wiki at
http://wiki.gromacs.org/index.php/Errors
-------------------------------------------------------
Thanx for Using GROMACS - Have a Nice Day
Error on node 0, will try to stop all the nodes
Halting parallel program mdrun on CPU 0 out of 4
gcq#0: Thanx for Using GROMACS - Have a Nice Day
application called MPI_Abort(MPI_COMM_WORLD, -1) - process 0
[0]0:Return code = 255
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[21:10:35] CoreStatus = FF (255)
[21:10:35] Sending work to server
[21:10:35] Project: 2671 (Run 32, Clone 73, Gen 191)
[21:10:35] - Error: Could not get length of results file work/wuresults_05.dat
[21:10:35] - Error: Could not read unit 05 file. Removing from queue.
[21:10:35] Trying to send all finished work units
[21:10:35] + No unsent completed units remaining.
[21:10:35] - Preparing to get new work unit...
[21:10:35] + Attempting to get work packet
[21:10:35] - Will indicate memory of 979 MB
[21:10:35] - Connecting to assignment server
[21:10:35] Connecting to http://assign.stanford.edu:8080/
[21:10:35] Posted data.
[21:10:35] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:10:35] + News From Folding@Home: Welcome to Folding@Home
[21:10:35] Loaded queue successfully.
[21:10:35] Connecting to http://171.67.108.24:8080/
[21:10:41] Posted data.
[21:10:41] Initial: 0000; - Receiving payload (expected size: 4835587)
[21:10:48] - Downloaded at ~674 kB/s
[21:10:48] - Averaged speed for that direction ~704 kB/s
[21:10:48] + Received work.
[21:10:48] Trying to send all finished work units
[21:10:48] + No unsent completed units remaining.
[21:10:48] + Closed connections
[21:10:53]
[21:10:53] + Processing work unit
[21:10:53] At least 4 processors must be requested.Core required: FahCore_a2.exe
[21:10:53] Core found.
[21:10:53] Working on queue slot 06 [January 17 21:10:53 UTC]
[21:10:53] + Working ...
[21:10:53] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -suffix 06 -checkpoint 15 -verbose -lifeline 27700 -version 624'
[21:10:53]
[21:10:53] *------------------------------*
[21:10:53] Folding@Home Gromacs SMP Core
[21:10:53] Version 2.10 (Sun Aug 30 03:43:28 CEST 2009)
[21:10:53]
[21:10:53] Preparing to commence simulation
[21:10:53] - Ensuring status. Please wait.
[21:11:02] - Looking at optimizations...
[21:11:02] - Working with standard loops on this execution.
[21:11:02] - Files status OK
[21:11:03] - Expanded 4835075 -> 24033949 (decompressed 497.0 percent)
[21:11:03] Called DecompressByteArray: compressed_data_size=4835075 data_size=24033949, decompressed_data_size=24033949 diff=0
[21:11:04] - Digital signature verified
[21:11:04]
[21:11:04] Project: 2671 (Run 32, Clone 73, Gen 191)
[21:11:04]
[21:11:04] Entering M.D.
[21:11:10] Using Gromacs checkpoints
NNODES=4, MYRANK=0, HOSTNAME=ubuntu
NODEID=0 argc=23
NNODES=4, MYRANK=3, HOSTNAME=ubuntu
NNODES=4, MYRANK=2, HOSTNAME=ubuntu
NODEID=2 argc=23
Reading file work/wudata_06.tpr, VERSION 3.3.99_development_20070618 (single precision)
NODEID=3 argc=23
NNODES=4, MYRANK=1, HOSTNAME=ubuntu
NODEID=1 argc=23
Note: tpx file_version 48, software version 68
Reading checkpoint file work/wudata_06.cpt generated: Wed Dec 16 07:51:00 2009
-------------------------------------------------------
Program mdrun, VERSION 4.0.99_development_20090605
Source code file: checkpoint.c, line: 1196
Fatal error:
Checkpoint file is for a system of 147170 atoms, while the current system consists of 147051 atoms
For more information and tips for trouble shooting please check the GROMACS Wiki at
http://wiki.gromacs.org/index.php/Errors
-------------------------------------------------------
Thanx for Using GROMACS - Have a Nice Day
Error on node 0, will try to stop all the nodes
Halting parallel program mdrun on CPU 0 out of 4
gcq#0: Thanx for Using GROMACS - Have a Nice Day
application called MPI_Abort(MPI_COMM_WORLD, -1) - process 0
[0]0:Return code = 255
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[21:11:18] CoreStatus = FF (255)
[21:11:18] Sending work to server
[21:11:18] Project: 2671 (Run 32, Clone 73, Gen 191)
[21:11:18] - Error: Could not get length of results file work/wuresults_06.dat
[21:11:18] - Error: Could not read unit 06 file. Removing from queue.
[21:11:18] Trying to send all finished work units
[21:11:18] + No unsent completed units remaining.
[21:11:18] - Preparing to get new work unit...
[21:11:18] + Attempting to get work packet
[21:11:18] - Will indicate memory of 979 MB
[21:11:18] - Connecting to assignment server
[21:11:18] Connecting to http://assign.stanford.edu:8080/
[21:11:18] Posted data.
[21:11:18] Initial: 43AB; - Successful: assigned to (171.67.108.24).
[21:11:18] + News From Folding@Home: Welcome to Folding@Home
[21:11:18] Loaded queue successfully.
[21:11:18] Connecting to http://171.67.108.24:8080/
[21:11:19] Posted data.
[21:11:19] Initial: 0000; - Error: Bad packet type from server, expected work assignment
[21:11:19] - Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
Thanks as always good folks.:)
You need to stop the client and delete the Work dir and queue.dat and see if that helps.
vBulletin® v3.8.7, Copyright ©2000-2012, vBulletin Solutions, Inc.