Im getting Client-core communications error: ERROR 0xc0000005 on both my gpu clients, i have already lost 4 WUs to it!!
im running this in arch linux using the foldingathome-gpu-nvidia package from the AUR .
It completes the work unit, tries to upload it, then gives me the error, deletes the unit and downloads a new one.
I have stoped this service on both the gpus as there is no point in me doing the units if they dont upload.
Any Pointers?
Thanks, Mark
Im praying that it doesnt happen to the smp client on this machine, i havnt completed a smp unit yet, so i wont know, (current one is at 95% so a little while and i will know)
im running this in arch linux using the foldingathome-gpu-nvidia package from the AUR .
It completes the work unit, tries to upload it, then gives me the error, deletes the unit and downloads a new one.
I have stoped this service on both the gpus as there is no point in me doing the units if they dont upload.
Code:
# Windows GPU Console Edition #################################################
###############################################################################
Folding@Home Client Version 6.30r1
http://folding.stanford.edu
###############################################################################
###############################################################################
Launch directory: Z:\opt\fah-gpu\alpha
Executable: Z:\opt\fah-gpu\[email protected]
Arguments: -forcegpu nvidia_g80 -gpu 0 -verbosity 9
[14:19:09] - Ask before connecting: No
[14:19:09] - User name: markp1989 (Team 32)
[14:19:09] - User ID: 171CE6BC1C1CEECA
[14:19:09] - Machine ID: 2
[14:19:09]
[14:19:09] Gpu species not recognized.
[14:19:09] Loaded queue successfully.
[14:19:09]
[14:19:09] + Processing work unit
[14:19:09] Core required: FahCore_11.exe
[14:19:09] Core found.
[14:19:09] - Autosending finished units... [October 20 14:19:09 UTC]
[14:19:09] Trying to send all finished work units
[14:19:09] + No unsent completed units remaining.
[14:19:09] - Autosend completed
[14:19:09] Working on queue slot 02 [October 20 14:19:09 UTC]
[14:19:09] + Working ...
[14:19:09] - Calling '.\FahCore_11.exe -dir work/ -suffix 02 -nice 19 -checkpoint 15 -verbose -lifeline 8 -version 630'
[14:19:09]
[14:19:09] *------------------------------*
[14:19:09] Folding@Home GPU Core
[14:19:09] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[14:19:09]
[14:19:09] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[14:19:09] Build host: amoeba
[14:19:09] Board Type: Nvidia
[14:19:09] Core :
[14:19:09] Preparing to commence simulation
[14:19:09] - Looking at optimizations...
[14:19:09] DeleteFrameFiles: successfully deleted file=work/wudata_02.ckp
[14:19:09] - Created dyn
[14:19:09] - Files status OK
[14:19:09] Error: Missing work file=<>
[14:19:09]
[14:19:09] Folding@home Core Shutdown: MISSING_WORK_FILES
[14:19:13] CoreStatus = 74 (116)
[14:19:13] The core could not find the work files specified. Removing from queue
[14:19:13] Deleting current work unit & continuing...
[14:19:17] Trying to send all finished work units
[14:19:17] + No unsent completed units remaining.
[14:19:17] - Preparing to get new work unit...
[14:19:17] Cleaning up work directory
[14:19:17] + Attempting to get work packet
[14:19:17] Passkey found
[14:19:17] - Will indicate memory of 3952 MB
[14:19:17] Gpu species not recognized.
[14:19:17] - Detect CPU. Vendor: GenuineIntel, Family: 6, Model: 14, Stepping: 5
[14:19:17] - Connecting to assignment server
[14:19:17] Connecting to http://assign-GPU.stanford.edu:8080/
[14:19:18] Posted data.
[14:19:18] Initial: 40AB; - Successful: assigned to (171.64.65.61).
[14:19:18] + News From Folding@Home: Welcome to Folding@Home
[14:19:18] Loaded queue successfully.
[14:19:18] Gpu species not recognized.
[14:19:18] Sent data
[14:19:18] Connecting to http://171.64.65.61:8080/
[14:19:19] Posted data.
[14:19:19] Initial: 0000; - Receiving payload (expected size: 74322)
[14:19:20] - Downloaded at ~72 kB/s
[14:19:20] - Averaged speed for that direction ~81 kB/s
[14:19:20] + Received work.
[14:19:20] + Closed connections
[14:19:25]
[14:19:25] + Processing work unit
[14:19:25] Core required: FahCore_11.exe
[14:19:25] Core found.
[14:19:25] Working on queue slot 03 [October 20 14:19:25 UTC]
[14:19:25] + Working ...
[14:19:25] - Calling '.\FahCore_11.exe -dir work/ -suffix 03 -nice 19 -checkpoint 15 -verbose -lifeline 8 -version 630'
[14:19:26]
[14:19:26] *------------------------------*
[14:19:26] Folding@Home GPU Core
[14:19:26] Version 1.31 (Tue Sep 15 10:57:42 PDT 2009)
[14:19:26]
[14:19:26] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[14:19:26] Build host: amoeba
[14:19:26] Board Type: Nvidia
[14:19:26] Core :
[14:19:26] Preparing to commence simulation
[14:19:26] - Looking at optimizations...
[14:19:26] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[14:19:26] - Created dyn
[14:19:26] - Files status OK
[14:19:26] - Expanded 73810 -> 383588 (decompressed 519.6 percent)
[14:19:26] Called DecompressByteArray: compressed_data_size=73810 data_size=383588, decompressed_data_size=383588 diff=0
[14:19:26] - Digital signature verified
[14:19:26]
[14:19:26] Project: 6606 (Run 8, Clone 514, Gen 360)
[14:19:26]
[14:19:26] Assembly optimizations on if available.
[14:19:26] Entering M.D.
[14:19:32] Tpr hash work/wudata_03.tpr: 2699855297 767549063 1027283909 3647827944 842538790
[14:19:32]
[14:19:32] Calling fah_main args: 14 usage=100
[14:19:32]
[14:19:32] Working on Protein
[14:19:33] Client config found, loading data.
[14:19:33] Starting GUI Server
[14:21:12] Completed 1%
[14:22:54] Completed 2%
[14:24:33] Completed 3%
[14:26:06] Completed 4%
[14:27:39] Completed 5%
[14:29:13] Completed 6%
[14:30:47] Completed 7%
[14:32:20] Completed 8%
[14:33:53] Completed 9%
[14:35:27] Completed 10%
[14:37:00] Completed 11%
[14:38:33] Completed 12%
[14:40:07] Completed 13%
[14:41:41] Completed 14%
[14:43:15] Completed 15%
[14:44:50] Completed 16%
[14:46:27] Completed 17%
[14:48:02] Completed 18%
[14:49:41] Completed 19%
[14:51:18] Completed 20%
[14:52:53] Completed 21%
[14:54:29] Completed 22%
[14:56:04] Completed 23%
[14:57:41] Completed 24%
[14:59:15] Completed 25%
[15:00:50] Completed 26%
[15:02:24] Completed 27%
[15:03:58] Completed 28%
[15:05:31] Completed 29%
[15:07:04] Completed 30%
[15:08:38] Completed 31%
[15:10:12] Completed 32%
[15:11:46] Completed 33%
[15:13:19] Completed 34%
[15:14:52] Completed 35%
[15:16:27] Completed 36%
[15:18:00] Completed 37%
[15:19:34] Completed 38%
[15:21:12] Completed 39%
[15:22:54] Completed 40%
[15:24:36] Completed 41%
[15:26:19] Completed 42%
[15:27:53] Completed 43%
[15:29:32] Completed 44%
[15:31:08] Completed 45%
[15:32:47] Completed 46%
[15:34:29] Completed 47%
[15:36:05] Completed 48%
[15:37:43] Completed 49%
[16:39:25] Completed 50%
[16:40:59] Completed 51%
[16:42:45] Completed 52%
[16:44:18] Completed 53%
[16:45:53] Completed 54%
[16:47:29] Completed 55%
[16:49:06] Completed 56%
[16:50:41] Completed 57%
[16:52:15] Completed 58%
[16:53:48] Completed 59%
[16:55:21] Completed 60%
[16:56:55] Completed 61%
[16:58:34] Completed 62%
[17:00:15] Completed 63%
[17:01:49] Completed 64%
[17:03:24] Completed 65%
[17:05:01] Completed 66%
[17:06:42] Completed 67%
[17:08:24] Completed 68%
[17:10:05] Completed 69%
[17:11:41] Completed 70%
[17:13:16] Completed 71%
[17:14:54] Completed 72%
[17:16:29] Completed 73%
[17:18:14] Completed 74%
[17:19:51] Completed 75%
[17:21:24] Completed 76%
[17:22:58] Completed 77%
[17:24:31] Completed 78%
[17:26:05] Completed 79%
[17:27:38] Completed 80%
[17:29:12] Completed 81%
[17:30:44] Completed 82%
[17:32:17] Completed 83%
[17:33:50] Completed 84%
[17:35:24] Completed 85%
[17:36:57] Completed 86%
[17:38:30] Completed 87%
[17:40:03] Completed 88%
[17:41:36] Completed 89%
[17:43:08] Completed 90%
[17:44:41] Completed 91%
[17:46:15] Completed 92%
[17:47:48] Completed 93%
[17:49:21] Completed 94%
[17:50:54] Completed 95%
[17:52:26] Completed 96%
[17:53:59] Completed 97%
[17:55:32] Completed 98%
[17:57:05] Completed 99%
[17:58:38] Completed 100%
[17:58:38] Successful run
[17:58:38] DynamicWrapper: Finished Work Unit: sleep=10000
[17:58:48] Reserved 85704 bytes for xtc file; Cosm status=0
[17:58:48] Allocated 85704 bytes for xtc file
[17:58:48] - Reading up to 85704 from "work/wudata_03.xtc": Read 85704
[17:58:48] Read 85704 bytes from xtc file; available packet space=786344760
[17:58:48] xtc file hash check passed.
[17:58:48] Reserved 25248 25248 786344760 bytes for arc file=<work/wudata_03.trr> Cosm status=0
[17:58:48] Allocated 25248 bytes for arc file
[17:58:48] - Reading up to 25248 from "work/wudata_03.trr": Read 25248
[17:58:48] Read 25248 bytes from arc file; available packet space=786319512
[17:58:48] trr file hash check passed.
[17:58:48] Allocated 560 bytes for edr file
[17:58:48] Read bedfile
[17:58:48] edr file hash check passed.
[17:58:48] Allocated 31162 bytes for logfile
[17:58:48] Read logfile
[17:58:48] GuardedRun: success in DynamicWrapper
[17:58:48] GuardedRun: done
[17:58:48] Run: GuardedRun completed.
[17:58:49] + Opened results file
[17:58:49] - Writing 143186 bytes of core data to disk...
[17:58:49] Done: 142674 -> 119850 (compressed to 84.0 percent)
[17:58:49] ... Done.
[17:58:49] DeleteFrameFiles: successfully deleted file=work/wudata_03.ckp
[17:58:49] Shutting down core
[17:58:49]
[17:58:49] Folding@home Core Shutdown: FINISHED_UNIT
[18:09:25] CoreStatus = C0000005 (-1073741819)
[18:09:25] Client-core communications error: ERROR 0xc0000005
[18:09:25] This is a sign of more serious problems, shutting down.
Any Pointers?
Thanks, Mark
Im praying that it doesnt happen to the smp client on this machine, i havnt completed a smp unit yet, so i wont know, (current one is at 95% so a little while and i will know)