• Welcome to Overclockers Forums! Join us to reply in threads, receive reduced ads, and to customize your site experience!

Slots failing in Linux folding boxes

Overclockers is supported by our readers. When you click a link to make a purchase, we may earn a commission. Learn More.

KeeperOfTheButch

Member
Joined
May 8, 2024
Something is going on with the apes Linux folding machines. Slots are failing after send errors and cores being returned and failed. Its happening across multiple GPUs, multiple PCs, multiple IP addresses. It seems to be all slots are getting errors but some to the point of failure. The slots are random and seem to happen to any of them.

Windows folding boxes seem to be doing just fine. I believe this to be a Linux related issue.

Is anyone else experiencing issues? I'm hoping this is just temporary. Might have to switch everything over to windows.

Our PPD has taken a significant hit. From 200m down to 130m.
 
what core is failing? from what I saw depending on the distro you could have every 22, 23, or 24 core fail because the packaged libraries.

1736204010672.png
 
24 and 23 for sure

Errors seem to come around the same time for different computers. They also seem to include some "Could not get an assignment" problems.
Post magically merged:

Box 4

1736265797090.png

Box 1

1736265828621.png
 
Last edited:
So I've noticed some weird point/WU fluctuations on my 2 Windows machines, don't think this is just limited to Linux boxes. Just checked the logs and both machines are returning errors about not being assigned WUs...

Code:
******************************* Date: 2025-01-07 *******************************
14:55:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
******************************* Date: 2025-01-07
 
My windows box is now getting a bunch of stuff. Looks like we may have finished folding @ home.




1736278805599.png
 
My windows box is now getting a bunch of stuff. Looks like we may have finished folding @ home.


*beeg uh-oh errors*

I was just about to post similar...I'm seeing 1-2/day on this machine alone. Really hope that's not the case, as I could have saved a TON of money by not being heavily influenced by PPD on my recent 4070 Super :ROFLMAO:

Code:
14:55:55:WU00:FS00:0x23:Completed 2475000 out of 2500000 steps (99%)
14:55:55:WU01:FS00:Connecting to assign1.foldingathome.org:80
14:55:56:WU01:FS00:Assigned to work server 158.130.118.26
14:55:56:WU01:FS00:Requesting new work unit for slot 00: gpu:1:0 TU104 [GeForce RTX 2080 SUPER] from 158.130.118.26
14:55:56:WU01:FS00:Connecting to 158.130.118.26:8080
14:55:56:ERROR:WU01:FS00:Exception: Server did not assign work unit
14:55:57:WU01:FS00:Connecting to assign1.foldingathome.org:80
14:55:57:WU01:FS00:Assigned to work server 128.174.73.74
14:55:57:WU01:FS00:Requesting new work unit for slot 00: gpu:1:0 TU104 [GeForce RTX 2080 SUPER] from 128.174.73.74
14:55:57:WU01:FS00:Connecting to 128.174.73.74:8080
14:55:59:WU01:FS00:Downloading 9.57MiB
14:56:00:WU01:FS00:Download complete
 
I'm glad to see its not just me. I was about to nuke my Linux machines and install windows. Looks like everything is having problems. Good think is we finished F@H now and not after the 50 series becomes available.

I wish they where more transparent about wtf is going on at F@H. Any kind of info would be nice.
 
LOL can you imagine all the people buying $1-2000 5080s and 5090s only to have nothing to Fold? The winners would be NVIDIA and the companies raking in restocking fees. XD

F@H has NEVER been great let alone good at relaying wth is going on with their servers, WUs, cores, etc.. Can't complain too much I guess since it's only a handful of folks doing the work for what I assume is free?
 
Can't be every Win Folder. My 24 Hour average is 83Mill over 7 days but in the last 24 I have done 91Mill.:unsure:

The 'Hawk is similar almost 88 over 7 days. 90.5 in the last 24.

I am still on board for a 5000.
 
Can't be every Win Folder. My 24 Hour average is 83Mill over 7 days but in the last 24 I have done 91Mill.:unsure:

The 'Hawk is similar almost 88 over 7 days. 90.5 in the last 24.

I am still on board for a 5000.

STOP TAKING ALL OUR WUs!!!!! :LOL:

I was looking at a lot of the top guys and everyone is down. Some more than others. The monkeys are down around 20M PPD. Maybe its getting better?
Post magically merged:


Looks like assign1 & assign2 are throwing errors and might have some down nodes?

I wonder if that winter storm hand anything to do with it.
 
Don't forget that when COVID hit and F@H started doing WU's for COVID, all the assignment server was drain of WU's and back then it was not uncommon to go a day or 2 and not get a WU. So it may be F@H is a bit slow to in getting new WU's out.
 
Don't forget that when COVID hit and F@H started doing WU's for COVID, all the assignment server was drain of WU's and back then it was not uncommon to go a day or 2 and not get a WU. So it may be F@H is a bit slow to in getting new WU's out.

Are folks preferring one type of WU over the other? I've always just set it to Any Disease when configuring through the Windows web control.
 
Back