• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

Couple problems

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

Psykoikonov

Member
Joined
Dec 16, 2003
Location
ON, CA
I'm running an a v7 SMP client using Win7 x64->VMWare->Ubuntu 10.10 and receive the following errors in the logs from time to time. The second problem is one missed QRB :(, I think because the correct work server went down and work unit was received from another (best guess). Any insight greatly appreciated.

12:39:33:SocketDevice::read(): End of stream
12:39:33:Server connection id=2 on 0.0.0.0:36330 from 127.0.0.1
12:39:33:Started thread 12 on PID 1420
12:39:33:Server connection id=1 ended
12:40:59:SocketDevice::write(): Send error: 32: Broken pipe
12:40:59:SocketDevice::write(): Send error: 32: Broken pipe
12:40:59:Server connection id=3 on 0.0.0.0:36330 from 127.0.0.1
12:40:59:Started thread 13 on PID 1420
12:40:59:SocketDevice::write(): Send error: 32: Broken pipe
12:40:59:Server connection id=2 ended
12:47:45:SocketDevice::read(): End of stream
12:47:45:Server connection id=4 on 0.0.0.0:36330 from 127.0.0.1
12:47:45:Started thread 14 on PID 1420
12:47:45:Server connection id=3 ended
12:50:45:Unit 00:Completed 300000 out of 1500000 steps (20%)
13:03:58:Unit 00:Completed 315000 out of 1500000 steps (21%)
13:07:55:SocketDevice::read(): End of stream
13:07:56:Server connection id=5 on 0.0.0.0:36330 from 127.0.0.1
13:07:56:Started thread 15 on PID 1420
13:07:56:Server connection id=4 ended
13:09:23:SocketDevice::write(): Send error: 32: Broken pipe
13:09:23:SocketDevice::write(): Send error: 32: Broken pipe
13:09:23:Server connection id=6 on 0.0.0.0:36330 from 127.0.0.1
13:09:23:Started thread 16 on PID 1420
13:09:24:SocketDevice::write(): Send error: 32: Broken pipe
13:09:24:Server connection id=5 ended

15:04:12:Unit 01:Completed 500000 out of 500000 steps (100%)
15:04:13:Unit 01:DynamicWrapper: Finished Work Unit: sleep=10000
15:04:13:Connecting to assign3.stanford.edu:8080
15:04:13:News: Welcome to Folding@Home
15:04:13:Assigned to work server 171.64.65.99
15:04:13:Requesting new work unit for slot 00: RUNNING smp:8 from 171.64.65.99
15:04:13:Connecting to 171.64.65.99:8080
15:04:14:Slot 00: Downloading 1.96MiB
15:04:20:Slot 00: 92.49%
15:04:20:Slot 00: Download complete
15:04:20:Received Unit: id:00 state:DOWNLOAD error:OK project:7808 run:7 clone:245 gen:20 core:0xa4 unit:0x0000002e0a3b1e874e30ff15a3d727f2
15:04:23:Unit 01:
15:04:23:Unit 01:Finished Work Unit:
15:04:23:Unit 01:- Reading up to 24721224 from "01/wudata_01.trr": Read 24721224
15:04:23:Unit 01:trr file hash check passed.
15:04:23:Unit 01:edr file hash check passed.
15:04:23:Unit 01:logfile size: 31479
15:04:23:Unit 01:Leaving Run
15:04:24:Unit 01:- Writing 24760059 bytes of core data to disk...
15:04:28:Unit 01:Done: 24759547 -> 19666129 (compressed to 79.4 percent)
15:04:28:Unit 01: ... Done.
15:04:33:Unit 01:- Shutting down core
15:04:33:Unit 01:
15:04:33:Unit 01:Folding@home Core Shutdown: FINISHED_UNIT
15:04:33:FahCore, running Unit 01, returned: FINISHED_UNIT (100 = 0x64)
15:04:33:Sending unit results: id:01 state:SEND error:OK project:7905 run:69 clone:43 gen:1 core:0xa4 unit:0x0000000100ac9c234ecffbb9eef4af88
15:04:33:Unit 01: Uploading 18.76MiB to 128.113.12.163
15:04:33:Connecting to 128.113.12.163:8080
15:04:33:Starting Unit 00
15:04:33:Running core: /var/lib/fahclient/cores/www.stanford.edu/~pande/Linux/AMD64/Core_a4.fah/FahCore_a4 -dir 00 -suffix 01 -lifeline 1515 -version 701 -checkpoint 3 -np 8
15:04:33:Started core on PID 9309
15:04:33:Started thread 14 on PID 1515
15:04:33:FahCore 0xa4 started
15:04:34:Unit 00:
15:04:34:Unit 00:*------------------------------*
15:04:34:Unit 00:Folding@Home Gromacs GB Core
15:04:34:Unit 00:Version 2.27 (Dec. 15, 2010)
15:04:34:Unit 00:
15:04:34:Unit 00:preparing to commence simulation
15:04:34:Unit 00:- Looking at optimizations...
15:04:34:Unit 00:- Created dyn
15:04:34:Unit 00:- Files status OK
15:04:34:Unit 00:- Expanded 2054309 -> 5365960 (decompressed 261.2 percent)
15:04:34:Unit 00:Called DecompressByteArray: compressed_data_size=2054309 data_size=5365960, decompressed_data_size=5365960 diff=0
15:04:34:Unit 00:- Digital signature verified
15:04:34:Unit 00:
15:04:34:Unit 00:project: 7808 (Run 7, Clone 245, Gen 20)
15:04:34:Unit 00:
15:04:34:Unit 00:Assembly optimizations on if available.
15:04:34:Unit 00:Entering M.D.
15:04:39:Unit 01: 2.56%
15:04:41:Unit 00:Completed 0 out of 1500000 steps (0%)
15:04:45:Unit 01: 4.48%
15:04:51:Unit 01: 6.39%
15:04:57:Unit 01: 8.33%
15:05:03:Unit 01: 10.25%
15:05:09:Unit 01: 12.23%
15:05:15:Unit 01: 14.14%
15:05:48:Unit 01: 15.22%
15:06:18:Unit 01: 15.25%
15:06:48:Unit 01: 15.27%
15:07:18:Unit 01: 15.29%
15:07:48:Unit 01: 15.31%
15:08:18:Unit 01: 15.31%
15:08:18:SocketDevice::write(): Send of size 4096 failed. Sent 3560 bytes
15:08:48:Unit 01: 15.31%
15:08:48:SocketDevice::write(): Send of size 4096 failed. Sent 0 bytes
15:09:18:SocketDevice::write(): Send of size 4096 failed. Sent 0 bytes
15:09:18:SocketDevice::write(): Socket not open
[93m15:09:18:WARNING: Exception: Failed to send results to work server: Upload failed[0m
15:09:18:Trying to send results to collection server
15:09:18:Unit 01: Uploading 18.76MiB to 129.74.85.16
15:09:18:Connecting to 129.74.85.16:8080
15:09:24:Unit 01: 2.56%
15:09:30:Unit 01: 4.50%
15:09:36:Unit 01: 6.37%
15:09:42:Unit 01: 8.41%
15:09:48:Unit 01: 10.37%
15:09:54:Unit 01: 12.27%
15:10:00:Unit 01: 14.22%
15:10:06:Unit 01: 16.16%
15:10:12:Unit 01: 18.12%
15:10:18:Unit 01: 20.06%
15:10:24:Unit 01: 21.99%
15:10:30:Unit 01: 23.95%
15:10:36:Unit 01: 25.87%
15:10:42:Unit 01: 27.80%
15:10:48:Unit 01: 29.74%
15:10:54:Unit 01: 31.68%
15:11:00:Unit 01: 33.62%
15:11:06:Unit 01: 35.57%
15:11:12:Unit 01: 37.49%
15:11:18:Unit 01: 39.45%
15:11:24:Unit 01: 41.38%
15:11:30:Unit 01: 43.32%
15:11:36:Unit 01: 45.26%
15:11:42:Unit 01: 47.19%
15:11:48:Unit 01: 49.11%
15:11:54:Unit 01: 51.07%
15:12:00:Unit 01: 53.01%
15:12:06:Unit 01: 54.94%
15:12:12:Unit 01: 56.88%
15:12:18:Unit 01: 58.84%
15:12:24:Unit 01: 60.75%
15:12:30:Unit 01: 62.69%
15:12:36:Unit 01: 64.65%
15:12:42:Unit 01: 66.58%
15:12:48:Unit 01: 68.52%
15:12:49:Unit 00:Completed 15000 out of 1500000 steps (1%)
15:12:54:Unit 01: 70.46%
15:13:00:Unit 01: 72.40%
15:13:06:Unit 01: 74.33%
15:13:12:Unit 01: 76.29%
15:13:18:Unit 01: 78.06%
15:13:24:Unit 01: 80.14%
15:13:30:Unit 01: 82.10%
15:13:36:Unit 01: 83.95%
15:13:42:Unit 01: 85.97%
15:13:48:Unit 01: 87.91%
15:13:54:Unit 01: 89.66%
15:14:00:Unit 01: 91.76%
15:14:06:Unit 01: 93.72%
15:14:12:Unit 01: 95.60%
15:14:18:Unit 01: 97.60%
15:14:24:Unit 01: 99.55%
15:14:30:Unit 01: Upload complete
15:14:30:Server responded WORK_ACK (400)
15:14:30:Final credit estimate, 487.00 points
15:14:30:Cleaning up Unit 01
 
is this your first wu in VM? also, try to use 6.34 instead of 7 in linux. have you tried the command flag -send x (where 'x' is the wu number that is completed; 01, 02, 03, or so on) thats what I do when a WU gets stuck, works like a charm; except "-send all" does not work... I normally have to specify the WU.

edit: ensure your VM is set up for a bridged connection too.
 
This is my first couple units with new processor (BD) in VM, I have folded a number of units with another processor (1090T) and not noticed this before. I'll check for bridged NAT.
 
What version of v7 are you using? 7.1.38?

Also, to get the most mileage out of what you're seeing you should report this on FF.org.
 
Yes, v7.1.38. I'll post at the folding forum to. I have changed to bridge NAT now, will see if that helps.
 
Back