• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

Folding hangup. Help?

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

NV

Member
Joined
Sep 18, 2009
Hey guys, power flickered here [again] and my computer went down for a little. Something happened to the WU I was crunching. Here's what I get when I run the CPU client:

Code:
Note: Please read the license agreement (fah6 -license). Further 
use of this software requires that you have read and accepted this agreement.

8 cores detected


--- Opening Log file [October 17 23:50:50 UTC] 


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.24R3

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/folding
Executable: ./fah6
Arguments: -smp 8 -bigadv -verbosity 9 

[23:50:50] - Ask before connecting: No
[23:50:50] - User name: NV (Team 32)
[23:50:50] - User ID: 19832F305562F41B
[23:50:50] - Machine ID: 1
[23:50:50] 
[23:50:50] Loaded queue successfully.
[23:50:50] 
[23:50:50] - Autosending finished units... [October 17 23:50:50 UTC]
[23:50:50] + Processing work unit
[23:50:50] Trying to send all finished work units
[23:50:50] Core required: FahCore_a2.exe
[23:50:50] + No unsent completed units remaining.
[23:50:50] - Autosend completed
[23:50:50] Core found.
[23:50:50] Working on queue slot 00 [October 17 23:50:50 UTC]
[23:50:50] + Working ...
[23:50:50] - Calling './mpiexec -np 8 -host 127.0.0.1 ./FahCore_a2.exe -dir work/ -nice 19 -suffix 00 -priority 96 -checkpoint 15 -verbose -lifeline 8549 -version 624'

[0]0:Return code = 0, signaled with Quit
[0]1:Return code = 0, signaled with Quit
[0]2:Return code = 0, signaled with Quit
[0]3:Return code = 0, signaled with Quit
[0]4:Return code = 0, signaled with Quit
[0]5:Return code = 0, signaled with Quit
[0]6:Return code = 0, signaled with Quit
[0]7:Return code = 117
[23:51:13] CoreStatus = 75 (117)
[23:51:13] Error opening or reading from a file.
[23:51:13] Deleting current work unit & continuing...

Dunno what exactly this means, although I'm assuming I've lost the 70+% of that bigWU I had :(. Yes, I did already try restarting the client.

Thanks in advance guys.
 
Yes... unfortunately the WU is gone. :(

The flicker must have hit while the client was updating one of its data files... thus leaving it in an unexpected state when restarting the client.

My best advice, invest in a quality UPS (I like APC) to help alleviate those power flicker issues. Your WUs will thank you for it. :)

I just bought one of these recently...

http://www.provantage.com/~7AMPU06W.htm

...or this one is a little more powerful for not much more.

http://www.provantage.com/~7AMPU06X.htm
 
It's not an actual power issue, its just idiots coming into my room and flipping my circuit breaker thinking its a lightswitch. Even if I did drop the money for a UPS, I don't know where I'd put it.

I appreciate the thought and will look into getting one, since they're pretty important to have anyway, but my more immediate concern is how I get the client crunching again. It just hangs on that error, so right now my i7 is doing... nothing. Doesn't seem normal to me, but I could just be missing something simple.
 
Try deleting the Work directory and queue.dat. But befor you do that, you could try to recover the WU using qfix. Sometimes the queue gets broken rather than the WU. These aren't exactly the right instructions as your's didn't hang at completion but the general instructions are correct and the download is linked.

http://foldingforum.org/viewtopic.php?f=44&t=3889
 
I ran the -delete argument for a couple hours, but it never deleted the unit from the queue. Deleted the work folder and queue.dat, all systems appear to be go once again. Thanks for the help.
 
Back