• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

Stalled FAH

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

orion456

Member
Joined
May 31, 2004
Recently I have been finding my cores stalling out after they reach 100%. Sometimes going to Services and restarting them will work but about 1/2 the time only rebooting will get the calculations going again.

Also when two FAH's are running they have a higher priority than a background program with normal priority. It seems the priority of the FAH's is set too high and background work is being ignored. Recently I had my background video processing going at 3% when the two FAH's had the rest of the time slices. When i put the video to foreground it went up to 50% or if I kill one FAH, again I get 50%. How do you determine what priority a service has and how do you make sure its low enough not to block anything else?
 
FAH operates best set to idle priority. The choices are idle and low, both should be lower than any application you might be running. In client.cfg, if there is no priority statement or priority=0, FAH is set to idle priority. Nothing can be lower than that. If priority=96, FAH is set to low priority and you may see interference with other background tasks running at the same priority. You can change priority of a Stanford install by either stopping the service and running FAH with the -config or -configonly option or editing client.cfg with notepad or metapad or notetab. For a one-click install, edit client.cfg. The one click defaults to idle priority.

I don't think the service stalling is related to FAH but a symptom of some other OS related problem.
 
I think the reason the FAHs are stalling is the following warming in the event viewer:

"TCP/IP has reached the security limit imposed on the number of concurrent TCP connect attempts."

Somehow FAH is generating concurrent connect attemps and it is doing that on multiple computers. Any clues as to what is going on? Apparently this is something new added to SP2.
 
orion456 said:
I think the reason the FAHs are stalling is the following warming in the event viewer:

"TCP/IP has reached the security limit imposed on the number of concurrent TCP connect attempts."

Somehow FAH is generating concurrent connect attemps and it is doing that on multiple computers. Any clues as to what is going on? Apparently this is something new added to SP2.
Could it be a firewall issue?
Thats just my gut feeling though....
 
--- Opening Log file [November 28 03:23:17]


# Windows Console Edition #####################################################
###############################################################################

Folding@Home Client Version 5.02

http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: C:\PROGRA~1\fah2
Service: C:\PROGRA~1\fah2\fah502-console
Arguments: -svcstart

Launched as a service.
Entered C:\PROGRA~1\fah2 to do work.

[03:23:17] - Ask before connecting: No
[03:23:17] - Use IE connection settings: Yes
[03:23:17] - User name: orion456 (Team 32)
[03:23:17] - User ID: xxxxxxxxxxxxxxxxxxxxx
[03:23:17] - Machine ID: 2
[03:23:17]
[03:23:18] Loaded queue successfully.
[03:23:18] Deleting incompletely fetched item (4) from queue position #2
[03:23:18] + Benchmarking ...
[03:23:20] - Preparing to get new work unit...


[03:23:20] + Attempting to get work packet
[03:23:20] + Attempting to send results
[03:23:20] - Connecting to assignment server
[03:23:20] - Successful: assigned to ().
[03:23:20] + News From Folding@Home: Welcome to Folding@Home
[03:23:20] Loaded queue successfully.
[03:23:51] Couldn't send HTTP request to server (wininet)
[03:23:51] + Could not connect to Work Server (results)
[03:23:51] (171.65.103.160:8080)
[03:23:51] - Error: Could not transmit unit 03 (completed November 3) to work server.


[03:23:51] + Attempting to send results
[03:23:51] Error: Got status code 503 from server
[03:23:51] + Could not connect to Work Server (results)
[03:23:51] ()
[03:23:51] Could not transmit unit 03 to Collection server; keeping in queue.
[03:24:00] + Could not get Work unit data from Work Server
[03:24:00] - Error: Attempt #1 to get work failed, and no other work to do.
Waiting before retry.
[03:24:11] + Attempting to get work packet
[03:24:11] - Connecting to assignment server
[03:24:12] Couldn't send HTTP request to server (wininet)
[03:24:12] + Could not connect to Assignment Server
[03:24:12] - Successful: assigned to ().
[03:24:12] + News From Folding@Home: Welcome to Folding@Home
[03:24:12] Loaded queue successfully.
[03:24:49] + Could not get Work unit data from Work Server
[03:24:49] - Error: Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[03:24:59] + Attempting to get work packet
[03:24:59] - Connecting to assignment server
[03:25:00] Couldn't send HTTP request to server (wininet)
[03:25:00] + Could not connect to Assignment Server
[03:25:00] Couldn't send HTTP request to server (wininet)
[03:25:00] + Could not connect to Assignment Server 2
[03:25:00] + Couldn't get work instructions.
[03:25:00] - Error: Attempt #3 to get work failed, and no other work to do.
Waiting before retry.
[03:25:25] + Attempting to get work packet
[03:25:25] - Connecting to assignment server
[03:25:26] Couldn't send HTTP request to server (wininet)
[03:25:26] + Could not connect to Assignment Server
[03:25:26] Couldn't send HTTP request to server (wininet)
[03:25:26] + Could not connect to Assignment Server 2
[03:25:26] + Couldn't get work instructions.
[03:25:26] - Error: Attempt #4 to get work failed, and no other work to do.
Waiting before retry.
[03:26:14] + Attempting to get work packet
[03:26:14] - Connecting to assignment server
[03:26:15] Couldn't send HTTP request to server (wininet)
[03:26:15] + Could not connect to Assignment Server
[03:26:15] Couldn't send HTTP request to server (wininet)
[03:26:15] + Could not connect to Assignment Server 2
[03:26:15] + Couldn't get work instructions.
[03:26:15] - Error: Attempt #5 to get work failed, and no other work to do.
Waiting before retry.
[03:27:38] + Attempting to get work packet
[03:27:38] - Connecting to assignment server
[03:27:38] Couldn't send HTTP request to server (wininet)
[03:27:38] + Could not connect to Assignment Server
[03:27:38] Couldn't send HTTP request to server (wininet)
[03:27:38] + Could not connect to Assignment Server 2
[03:27:38] + Couldn't get work instructions.
[03:27:38] - Error: Attempt #6 to get work failed, and no other work to do.
Waiting before retry.
[03:30:26] + Attempting to get work packet
[03:30:26] - Connecting to assignment server
[03:30:26] Couldn't send HTTP request to server (wininet)
[03:30:26] + Could not connect to Assignment Server
[03:30:26] Couldn't send HTTP request to server (wininet)
[03:30:26] + Could not connect to Assignment Server 2
[03:30:26] + Couldn't get work instructions.
[03:30:26] - Error: Attempt #7 to get work failed, and no other work to do.
Waiting before retry.
[03:35:50] + Attempting to get work packet
[03:35:50] - Connecting to assignment server
[03:35:50] Couldn't send HTTP request to server (wininet)
[03:35:50] + Could not connect to Assignment Server
[03:35:50] Couldn't send HTTP request to server (wininet)
[03:35:50] + Could not connect to Assignment Server 2
[03:35:50] + Couldn't get work instructions.
[03:35:50] - Error: Attempt #8 to get work failed, and no other work to do.
Waiting before retry.
[03:46:36] + Attempting to get work packet
[03:46:36] - Connecting to assignment server
[03:46:36] Couldn't send HTTP request to server (wininet)
[03:46:36] + Could not connect to Assignment Server
[03:46:37] Couldn't send HTTP request to server (wininet)
[03:46:37] + Could not connect to Assignment Server 2
[03:46:37] + Couldn't get work instructions.
[03:46:37] - Error: Attempt #9 to get work failed, and no other work to do.
Waiting before retry.
[04:08:06] + Attempting to get work packet
[04:08:06] - Connecting to assignment server
[04:08:06] Couldn't send HTTP request to server (wininet)
[04:08:06] + Could not connect to Assignment Server
[04:08:06] Couldn't send HTTP request to server (wininet)
[04:08:06] + Could not connect to Assignment Server 2
[04:08:06] + Couldn't get work instructions.
[04:08:06] - Error: Attempt #10 to get work failed, and no other work to do.
Waiting before retry.
 
Now if I reboot, all will be well, the finished product gets sent and it can again get data....this is happening on 4 machines now.
 
hmm...i think it could a number of things...
check your firewall for one...allow that ip address (what firewall are you using?)
I don't use IE settings, so set "[http]active=no host=local" in the client might help...

Did this just happen recently or was it happening as soon as you installed it?
 
pcmaker401 said:
hmm...i think it could a number of things...
check your firewall for one...allow that ip address (what firewall are you using?)
I don't use IE settings, so set "[http]active=no host=local" in the client might help...

Did this just happen recently or was it happening as soon as you installed it?

It works fine for days and then suddenly stops so it can't be the firewall. Apparently sp2 introduced a new way to limit IP connections to 10 per second in order to control virus spread. In the past it was unlimited; but I'm not sure if that is significant.

It has been happening on one machine for a few months and now is showing up in more....perhaps since the last MS update.
 
I just terminated the svhost that was trying to run access the IP and the data has been sent.

There is a patch to fix the IP limit to 50. I will try it and see if that cures the problem.
 
If it were a FAH problem, you probably wouldn't be the only one on the forum to be experiencing it. I've got 20+ XP SP2 machines folding on the network at the office and have never seen this type error. As noted by pcmaker401, if you are using IE settings and upgrade to IE7, you will have problems of the type you describe.
 
ChasR said:
If it were a FAH problem, you probably wouldn't be the only one on the forum to be experiencing it. I've got 20+ XP SP2 machines folding on the network at the office and have never seen this type error. As noted by pcmaker401, if you are using IE settings and upgrade to IE7, you will have problems of the type you describe.

Just upgraded to IE7, so I guess that is the problem/
 
Back