PDA

View Full Version : help with 2nd client on c2d


Ichelo351
11-19-07, 08:08 AM
Hey all,
on my c2d, my 2nd client isn't folding, here is the log file, any help you could give me would be great. the first client is folding along nicely, but this one just stops right there at the end, i've tried deleting everything and rerunning the console and it doesn't the same thing, task manager shows the core running and using cpu, but it doesn't do anything.


--- Opening Log file [November 19 13:29:37]


# Windows Console Edition ################################################## ###
################################################## #############################

Folding@Home Client Version 5.04beta

http://folding.stanford.edu

################################################## #############################
################################################## #############################

Launch directory: C:\Program Files\fah\fah2
Service: C:\Program Files\fah\fah2\FAH504-Console.exe
Arguments: -svcstart

Launched as a service.
Entered C:\Program Files\fah\fah2 to do work.

[13:29:37] - Ask before connecting: No
[13:29:37] - User name: ichelo351 (Team 32)
[13:29:37] - User ID: 25CC3C1C1C20BBB6
[13:29:37] - Machine ID: 2
[13:29:37]
[13:29:37] Work directory not found. Creating...
[13:29:37] Could not open work queue, generating new queue...
[13:29:37] + Benchmarking ...
[13:29:40] - Preparing to get new work unit...
[13:29:40] + Attempting to get work packet
[13:29:40] - Connecting to assignment server
[13:29:41] - Couldn't send HTTP request to server
[13:29:41] + Could not connect to Assignment Server
[13:29:42] - Successful: assigned to (171.64.122.136).
[13:29:42] + News From Folding@Home: Welcome to Folding@Home
[13:29:42] Loaded queue successfully.
[13:29:43] + Closed connections
[13:29:43]
[13:29:43] + Processing work unit
[13:29:43] Core required: FahCore_7a.exe
[13:29:43] Core not found.
[13:29:43] - Core is not present or corrupted.
[13:29:43] - Attempting to download new core...
[13:29:43] + Downloading new core: FahCore_7a.exe
[13:29:43] + 10240 bytes downloaded
[13:29:44] + 20480 bytes downloaded
[13:29:44] + 30720 bytes downloaded
[13:29:44] + 40960 bytes downloaded
[13:29:44] + 51200 bytes downloaded
[13:29:44] + 61440 bytes downloaded
[13:29:44] + 71680 bytes downloaded
[13:29:44] + 81920 bytes downloaded
[13:29:44] + 92160 bytes downloaded
[13:29:44] + 102400 bytes downloaded
[13:29:44] + 112640 bytes downloaded
[13:29:44] + 122880 bytes downloaded
[13:29:44] + 133120 bytes downloaded
[13:29:44] + 143360 bytes downloaded
[13:29:44] + 153600 bytes downloaded
[13:29:44] + 163840 bytes downloaded
[13:29:44] + 174080 bytes downloaded
[13:29:44] + 184320 bytes downloaded
[13:29:44] + 194560 bytes downloaded
[13:29:44] + 204800 bytes downloaded
[13:29:44] + 215040 bytes downloaded
[13:29:44] + 225280 bytes downloaded
[13:29:44] + 235520 bytes downloaded
[13:29:44] + 245760 bytes downloaded
[13:29:44] + 256000 bytes downloaded
[13:29:44] + 266240 bytes downloaded
[13:29:44] + 276480 bytes downloaded
[13:29:44] + 286720 bytes downloaded
[13:29:44] + 296960 bytes downloaded
[13:29:44] + 307200 bytes downloaded
[13:29:44] + 317440 bytes downloaded
[13:29:44] + 327680 bytes downloaded
[13:29:44] + 337920 bytes downloaded
[13:29:44] + 348160 bytes downloaded
[13:29:44] + 358400 bytes downloaded
[13:29:44] + 368640 bytes downloaded
[13:29:44] + 378880 bytes downloaded
[13:29:44] + 389120 bytes downloaded
[13:29:44] + 399360 bytes downloaded
[13:29:44] + 409600 bytes downloaded
[13:29:44] + 419840 bytes downloaded
[13:29:44] + 430080 bytes downloaded
[13:29:44] + 440320 bytes downloaded
[13:29:44] + 450560 bytes downloaded
[13:29:44] + 460800 bytes downloaded
[13:29:44] + 471040 bytes downloaded
[13:29:44] + 481280 bytes downloaded
[13:29:45] + 491520 bytes downloaded
[13:29:45] + 501760 bytes downloaded
[13:29:45] + 512000 bytes downloaded
[13:29:45] + 522240 bytes downloaded
[13:29:45] + 532480 bytes downloaded
[13:29:45] + 542720 bytes downloaded
[13:29:45] + 552960 bytes downloaded
[13:29:45] + 563200 bytes downloaded
[13:29:45] + 573440 bytes downloaded
[13:29:45] + 583680 bytes downloaded
[13:29:45] + 593920 bytes downloaded
[13:29:45] + 604160 bytes downloaded
[13:29:45] + 614400 bytes downloaded
[13:29:45] + 624640 bytes downloaded
[13:29:45] + 634880 bytes downloaded
[13:29:45] + 645120 bytes downloaded
[13:29:45] + 655360 bytes downloaded
[13:29:45] + 665600 bytes downloaded
[13:29:45] + 675840 bytes downloaded
[13:29:45] + 686080 bytes downloaded
[13:29:45] + 696320 bytes downloaded
[13:29:45] + 706560 bytes downloaded
[13:29:45] + 716800 bytes downloaded
[13:29:45] + 727040 bytes downloaded
[13:29:45] + 737280 bytes downloaded
[13:29:46] + 747520 bytes downloaded
[13:29:46] + 757760 bytes downloaded
[13:29:46] + 768000 bytes downloaded
[13:29:46] + 778240 bytes downloaded
[13:29:46] + 788480 bytes downloaded
[13:29:46] + 798720 bytes downloaded
[13:29:46] + 808960 bytes downloaded
[13:29:46] + 819200 bytes downloaded
[13:29:46] + 829440 bytes downloaded
[13:29:46] + 839680 bytes downloaded
[13:29:46] + 849920 bytes downloaded
[13:29:46] + 853452 bytes downloaded
[13:29:46] Verifying core Core_7a.fah...
[13:29:46] Signature is VALID
[13:29:46]
[13:29:46] Trying to unzip core FahCore_7a.exe
[13:29:46] Decompressed FahCore_7a.exe (2514944 bytes) successfully
[13:29:46] + Core successfully engaged
[13:29:51]
[13:29:51] + Processing work unit
[13:29:51] Core required: FahCore_7a.exe
[13:29:51] Core found.
[13:29:51] Working on Unit 01 [November 19 13:29:51]
[13:29:51] + Working ...
[13:29:51]
[13:29:51] *------------------------------*
[13:29:51] Folding@Home GB Gromacs Core
[13:29:51] Version 1.94 (March 9, 2007)
[13:29:51]
[13:29:51] Preparing to commence simulation
[13:29:51] - Looking at optimizations...
[13:29:51] - Created dyn
[13:29:51] - Files status OK
[13:29:51] - Expanded 10908 -> 142699 (decompressed 1308.2 percent)
[13:29:51] - Starting from initial work packet
[13:29:52]
[13:29:52] Project: 2096 (Run 119, Clone 41, Gen 21)
[13:29:52]
[13:29:52] Assembly optimizations on if available.
[13:29:52] Entering M.D.
[13:29:58] Protein: p2096_A21_agbnp_amber99
[13:29:58]
[13:29:58] Writing local files
[13:29:58] Extra SSE boost OK.



also my client.cfg
[settings]
username=ichelo351
team=32
asknet=no
machineid=2
local=2

[http]
active=no
host=localhost
port=8080
usereg=no

[power]
battery=yes

[clienttype]
type=3

ChasR
11-19-07, 08:24 AM
While we work on figuring this out, add -forceasm -verbosity 9 -local to the service registy key FAH@C:+Program Files+fah+fah2+FAH504-Console.exe\ImagePath. That will cause the client to write the maximum info to the log and perhaps we'll see what is happening when it stalls. -local is required if you have v5.03 installed on the machine and ensures you use the files in the right directory. forceasm forces the client to use optimizations (SSE). I assume instance 1 has machine ID set to 1 and is in the folder fah/fah1.

Ichelo351
11-19-07, 08:46 AM
I assume instance 1 has machine ID set to 1 and is in the folder fah/fah1.
yes

keys added, stoped and restarted,


# Windows Console Edition ################################################## ###
################################################## #############################

Folding@Home Client Version 5.04beta

http://folding.stanford.edu

################################################## #############################
################################################## #############################

Launch directory: C:\Program Files\fah\fah2
Service: C:\Program Files\fah\fah2\FAH504-Console.exe
Arguments: -svcstart -forceasm -verbosity 9 -local

Launched as a service.
Entered C:\Program Files\fah\fah2 to do work.

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[14:43:46] - Ask before connecting: No
[14:43:46] - User name: ichelo351 (Team 32)
[14:43:46] - User ID: 25CC3C1C1C20BBB6
[14:43:46] - Machine ID: 2
[14:43:46]
[14:43:46] Loaded queue successfully.
[14:43:46] + Benchmarking ...
[14:43:49] The benchmark result is 9620
[14:43:49]
[14:43:49] + Processing work unit
[14:43:49] Core required: FahCore_7a.exe
[14:43:49] - Autosending finished units...
[14:43:49] Trying to send all finished work units
[14:43:49] + No unsent completed units remaining.
[14:43:49] - Autosend completed
[14:43:49] Core found.
[14:43:49] Working on Unit 01 [November 19 14:43:49]
[14:43:49] + Working ...
[14:43:49] - Calling 'FahCore_7a.exe -dir work/ -suffix 01 -checkpoint 15 -service -forceasm -verbose -lifeline 740 -version 504'

[14:43:49]
[14:43:49] *------------------------------*
[14:43:49] Folding@Home GB Gromacs Core
[14:43:49] Version 1.94 (March 9, 2007)
[14:43:49]
[14:43:49] Preparing to commence simulation
[14:43:49] - Assembly optimizations manually forced on.
[14:43:49] - Not checking prior termination.
[14:43:49] - Expanded 10908 -> 142699 (decompressed 1308.2 percent)
[14:43:49]
[14:43:49] Project: 2096 (Run 119, Clone 41, Gen 21)
[14:43:49]
[14:43:49] Assembly optimizations on if available.
[14:43:49] Entering M.D.
[14:43:55] Protein: p2096_A21_agbnp_amber99
[14:43:55]
[14:43:55] Writing local files
[14:43:55] Extra SSE boost OK.

Hack30
11-19-07, 04:24 PM
stop first then add keys then restart

Adak
11-19-07, 05:02 PM
Is this running on a laptop? Desktop's shouldn't have an entry in their client.cfg file of:

[power]
battery=yes

If it is a laptop, it should stay as "yes". Batteries are no match for Folding@Home! :D

and did you add the set up info that ChasR asked you to, to the other FAH client.cfg file, (the #1 machine ID), as well as this one?

It's been a long time since I've seen an amber WU! This used to be a VERY common set of WU's, some years ago.

Ichelo351
11-19-07, 05:25 PM
it's my work laptop so that is why the battery thing is in, and i kinda need the battery to last.
yes the client started up with the keys added
Arguments: -svcstart -forceasm -verbosity 9 -local

and i added it to the first instance, and restarted and it kept going fine.

Adak
11-19-07, 05:50 PM
The first install is folding fine, only the second one is bonkers.

I've seen this happen when the first install was copied over to a new directory. Since it is a service install, it has directory info written into the registry. After being moved or copied, the info in the registry is not correct for the 2nd installed Fah client.

Were any of this 2nd clients files copied over from the first installed folder?

Ichelo351
11-19-07, 05:57 PM
no, it was a fresh install for both, i got new laptop at work on friday, and i made both directories, put the client in each directory ran them, etc. checked this morning and 1 wasn't working. deleted everything in directory in second, re downloaded client ran it, and still nothing.
so i'm kinda stumped...

Adak
11-19-07, 09:25 PM
Do you get more info when the 2nd instance stops, now that you have the verbosity 9 added to it on start up?

How's the memory and other programs on this laptop? Where yours quit previously is very close or at the point where the client needs to claim some free memory.

I'm wondering if you stopped the first Fah client, and just ran the 2nd one, if the 2nd client would then run OK?

Perhaps there is another program which has an affinity set to one core only, and that program is causing Fah to "step aside", since Fah is designed to run only on spare cpu cycles?

This is a stumper! < You can tell I'm reaching on this, at the moment >.

ChasR
11-19-07, 09:31 PM
Try running the 2nd instance from the command line instead of as a service. The client writes far more info to the screen than it does to FAHlog.txt. Stop the fah2 service and disable or set the service to manual and run the client with the flags from the command line (or run). See what happens.

Ichelo351
11-20-07, 05:31 AM
as far as the laptop its a c2d t7300 w/ 2gig ram, currently 1.3 gigs are showing as available, so as far as specs it should be a pretty nice folder (for a laptop) but its not working, when i shut down the first client, and ran the second from a command line, i got
C:\Program Files\fah\fah2>FAH504-Console.exe -forceasm -verbosity 9 -local

Note: Please read the license agreement (FAH504-Console.exe -license). Further
use of this software requires that you have read and accepted this agreement.

Using local directory for work files


--- Opening Log file [November 20 11:11:22]


# Windows Console Edition ################################################## ###
################################################## #############################

Folding@Home Client Version 5.04beta

http://folding.stanford.edu

################################################## #############################
################################################## #############################

Launch directory: C:\Program Files\fah\fah2
Executable: FAH504-Console.exe
Arguments: -forceasm -verbosity 9 -local

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[11:11:22] - Ask before connecting: No
[11:11:22] - User name: ichelo351 (Team 32)
[11:11:22] - User ID: 25CC3C1C1C20BBB6
[11:11:22] - Machine ID: 2
[11:11:22]
[11:11:23] Loaded queue successfully.
[11:11:23] + Benchmarking ...
[11:11:26] The benchmark result is 10176
[11:11:26]
[11:11:26] - Autosending finished units...
[11:11:26] + Processing work unit
[11:11:26] Trying to send all finished work units
[11:11:26] Core required: FahCore_7a.exe
[11:11:26] + No unsent completed units remaining.
[11:11:26] - Autosend completed
[11:11:26] Core found.
[11:11:26] Working on Unit 02 [November 20 11:11:26]
[11:11:26] + Working ...
[11:11:26] - Calling 'FahCore_7a.exe -dir work/ -suffix 02 -checkpoint 15 -force
asm -verbose -lifeline 2688 -version 504'

[11:11:26]
[11:11:26] *------------------------------*
[11:11:26] Folding@Home GB Gromacs Core
[11:11:26] Version 1.94 (March 9, 2007)
[11:11:26]
[11:11:26] Preparing to commence simulation
[11:11:26] - Assembly optimizations manually forced on.
[11:11:26] - Not checking prior termination.
[11:11:26] - Expanded 10908 -> 142699 (decompressed 1308.2 percent)
[11:11:26]
[11:11:26] Project: 2096 (Run 119, Clone 41, Gen 21)
[11:11:26]
[11:11:26] Assembly optimizations on if available.
[11:11:26] Entering M.D.
[11:11:32] Protein: p2096_A21_agbnp_amber99
[11:11:32]
[11:11:32] Writing local files
[11:11:32] Extra SSE boost OK.
and nothing else, it just hangs there, and i can't figure out why, so i stopped that one, deleted everything in the directory, the directory, recreated the directory fah2, re downloaded the client from the folding site, put it in, ran it from the command line and still nothing
C:\Program Files\fah\fah2>FAH504-Console.exe -forceasm -verbosity 9 -local

Note: Please read the license agreement (FAH504-Console.exe -license). Further
use of this software requires that you have read and accepted this agreement.

Using local directory for work files


--- Opening Log file [November 20 11:19:58]


# Windows Console Edition ################################################## ###
################################################## #############################

Folding@Home Client Version 5.04beta

http://folding.stanford.edu

################################################## #############################
################################################## #############################

Launch directory: C:\Program Files\fah\fah2
Executable: FAH504-Console.exe
Arguments: -forceasm -verbosity 9 -local

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[11:19:59] Configuring Folding@Home...

User name [Anonymous]? ichelo351
Team Number [0]? 32
Do not launch this program automatically, remove the service (no/yes) [no]?
Ask before fetching/sending work (no/yes) [no]?
Use Internet Explorer Settings (no/yes) [no]?
Use proxy (yes/no) [no]?
Allow receipt of work assignments and return of work results greater than
5MB in size (such work units may have large memory demands) (no/yes) [no]?
Change advanced options (yes/no) [no]? yes
Core Priority (idle/low) [idle]?
CPU usage requested (5-100) [100]?
Disable highly optimized assembly code (no/yes) [no]?
Pause if battery power is being used (useful for laptops) (no/yes) [no]? yes
Interval, in minutes, between checkpoints (3-30) [15]?
Memory, in MB, to indicate (2038 available) [2038]?
Request work units without deadlines (no/yes) [no]?
Set -advmethods flag always, requesting new advanced
scientific cores and/or work units if available (no/yes) [no]? yes
Ignore any deadline information (mainly useful if
system clock frequently has errors) (no/yes) [no]?
Machine ID (1-8) [1]? 2

[11:20:32] - Ask before connecting: No
[11:20:32] - User name: ichelo351 (Team 32)
[11:20:32] - User ID: 25CC3C1C1C20BBB6
[11:20:32] - Machine ID: 2
[11:20:32]
[11:20:32] Work directory not found. Creating...
[11:20:32] Could not open work queue, generating new queue...
[11:20:32] + Benchmarking ...
[11:20:35] The benchmark result is 9984
[11:20:35] - Preparing to get new work unit...
[11:20:35] - Autosending finished units...
[11:20:35] + Attempting to get work packet
[11:20:35] Trying to send all finished work units
[11:20:35] + No unsent completed units remaining.
[11:20:35] - Will indicate memory of 2038 MB
[11:20:35] - Autosend completed
[11:20:35] - Connecting to assignment server
[11:20:35] Connecting to http://assign.stanford.edu:8080/
[11:20:36] - Couldn't send HTTP request to server
[11:20:36] + Could not connect to Assignment Server
[11:20:36] Connecting to http://assign2.stanford.edu:80/
[11:20:37] Posted data.
[11:20:37] Initial: 40AB; - Successful: assigned to (171.64.122.136).
[11:20:37] + News From Folding@Home: Welcome to Folding@Home
[11:20:37] Loaded queue successfully.
[11:20:37] Connecting to http://171.64.122.136:80/
[11:20:37] Posted data.
[11:20:37] Initial: 0000; - Receiving payload (expected size: 11420)
[11:20:38] - Downloaded at ~11 kB/s
[11:20:38] - Averaged speed for that direction ~11 kB/s
[11:20:38] + Received work.
[11:20:38] + Closed connections
[11:20:38]
[11:20:38] + Processing work unit
[11:20:38] Core required: FahCore_7a.exe
[11:20:38] Core not found.
[11:20:38] - Core is not present or corrupted.
[11:20:38] - Attempting to download new core...
[11:20:38] + Downloading new core: FahCore_7a.exe
[11:20:38] Downloading core (/~pande/Win32/x86//Core_7a.fah from www.stanford.ed
u)
[11:20:38] Initial: AFDE; + 10240 bytes downloaded
[...
[11:20:40] Initial: 7BCC; + 853452 bytes downloaded
[11:20:40] Verifying core Core_7a.fah...
[11:20:40] Signature is VALID
[11:20:40]
[11:20:40] Trying to unzip core FahCore_7a.exe
[11:20:40] Decompressed FahCore_7a.exe (2514944 bytes) successfully
[11:20:40] + Core successfully engaged
[11:20:45]
[11:20:45] + Processing work unit
[11:20:45] Core required: FahCore_7a.exe
[11:20:45] Core found.
[11:20:45] Working on Unit 01 [November 20 11:20:45]
[11:20:45] + Working ...
[11:20:45] - Calling 'FahCore_7a.exe -dir work/ -suffix 01 -checkpoint 15 -force
asm -verbose -lifeline 2604 -version 504'

[11:20:45]
[11:20:45] *------------------------------*
[11:20:45] Folding@Home GB Gromacs Core
[11:20:45] Version 1.94 (March 9, 2007)
[11:20:45]
[11:20:45] Preparing to commence simulation
[11:20:45] - Assembly optimizations manually forced on.
[11:20:45] - Not checking prior termination.
[11:20:45] - Expanded 10908 -> 142699 (decompressed 1308.2 percent)
[11:20:45] - Starting from initial work packet
[11:20:45]
[11:20:45] Project: 2096 (Run 119, Clone 41, Gen 21)
[11:20:45]
[11:20:45] Assembly optimizations on if available.
[11:20:45] Entering M.D.
[11:20:52] Protein: p2096_A21_agbnp_amber99
[11:20:52]
[11:20:52] Writing local files
[11:20:52] Extra SSE boost OK.

its very confusing because it has plenty of resources available, its the only instance running, its fresh put in, and it doesn't go past that point.
and task manager show the process and its using 50% cpu (the other 50% is currently idle)
i am really stumped on this one...

ChasR
11-20-07, 06:16 AM
It's possible that the issue is one of writing to the log as opposed to actually folding. I have this tingling sensation that tells me that this happens when FAH is installed in a subdirectory of another folding directory, but I thought the -local flag would take care of that. In any event, you eliminated that possibility earlier.

FC Forum is down so I can't search there for the answer.

Ichelo351
11-20-07, 07:42 AM
well i kinda fixed it. i deleted everything, re-ran it, but used a machine id of 3. and when i ran it, it picked up FahCore_82, and its working. so now both are folding. so my thought is was there something with FahCore_7a that didn't get along with my computer...
and as far as the directory of the clients i have both in separate directories in \fah i.e. \fah\fah1 and \fah\fah2

i guess what concerns me is if i get assigned another wu that uses the 7a core and it freezes like that again...

Adak
11-20-07, 08:52 AM
Well, that's good news! Now you'll be folding. :clap:

Here's what I did with my desktop C2D with the same memory as your laptop:

1) created a Program Filez directory (with a z)
2) created a fah sub-directory of the Progam Filez directory
3) created a fah1 and fah2 sub-directory of fah

4) downloaded the console 5.04 client, and put one copy in fah1 and in fah2
5) ran them, giving them the same settings you gave yours in your next to the last post. OK, not the same name, but I added the battery settings! :)

6) the WU's I received were different, the cores are 78 and 80.

Let them run in program mode for 30 minutes, then re-booted to test their service running ability.

Result: No problem. Both run fine.

My current theory is that you were given the wrong core to process that WU, but I see from your log that the core for that project, is also a new version. Very possible a new core version in need of some further de-bugging.

We can't discount the possibility that it might also just be a bad WU, but I doubt this, because a bad WU will "kick" out,
and the FAH client will give you an error message, in nearly all cases.

Fahmon is a sweet little monitoring program for the console client, especially when the console is running as a service, without a visible window. Highly recommended:
www.silent-blade.org

ChasR
11-20-07, 09:25 AM
You may be able to set advanced methods in config and avoid assignment of that work unit and core. Also, if you open your firewall on port 8080, your potential WU assignments will be broader. In running config you can influence WU assignment by adjusting the amount of ram reported to the Assignment Server. ATM, the minimum required to be reported is 64 MB. If you set reported ram for an instance above 600, no ram filtering of assignments will occur. Typically, you don't want to run 2 instances with each reporting all the ram though at the moment, with 2 GB of ram, it doesn't make a difference and probably never will on rigs not set to fold big WUs. I'm glad you got it fixed. When the FC Forum comes back up I'll look and see if this issue has been reported. Have you considered the SMP client?

Ichelo351
11-20-07, 09:39 AM
Thanks for your help!
as far as the smp client goes, i thought i read that it was for quad cores and not dual?
if you think i should run the smp i will deffiantly give it a try

ChasR
11-20-07, 10:40 AM
I have a T7200 running Vista making 1100 ppd on the SMP client. It runs fine on most dual core rigs, but 24/7 operation is almost a must to meet the tight deadlines, at least on the slower ones.

Ichelo351
11-20-07, 11:06 AM
thanks, i will have to try it out