PDA

View Full Version : OS reinstall = broken SMP


CJ145
02-14-08, 04:46 PM
For various reasons I reinstalled Gentoo on this comp.

Now when I run folding It just hangs when starting. ./fah6, mpiexec, and the 4 cores show up but no load.

cj@Speedy ~/folding $ ./fah6 -smp -forceasm -verbosity 9

Note: Please read the license agreement (fah6 -license). Further
use of this software requires that you have read and accepted this agreement.

2 cores detected


--- Opening Log file [February 14 22:41:28]


# SMP Client ################################################## ################
################################################## #############################

Folding@Home Client Version 6.01beta2

http://folding.stanford.edu

################################################## #############################
################################################## #############################

Launch directory: /home/cj/folding
Executable: ./fah6
Arguments: -smp -forceasm -verbosity 9

Warning:
By using the -forceasm flag, you are overriding
safeguards in the program. If you did not intend to
do this, please restart the program without -forceasm.
If work units are not completing fully (and particularly
if your machine is overclocked), then please discontinue
use of the flag.

[22:41:28] - Ask before connecting: No
[22:41:28] - User name: cjreis (Team 32)
[22:41:28] - User ID: 7DB7BBE266074DA2
[22:41:28] - Machine ID: 1
[22:41:28]
[22:41:29] Loaded queue successfully.
[22:41:29] - Autosending finished units...
[22:41:29] Trying to send all finished work units
[22:41:29] + No unsent completed units remaining.
[22:41:29] - Autosend completed
[22:41:29]
[22:41:29] + Processing work unit
[22:41:29] Core required: FahCore_a1.exe
[22:41:29] Core found.
[22:41:29] Working on Unit 01 [February 14 22:41:29]
[22:41:29] + Working ...
[22:41:29] - Calling './mpiexec -np 4 -host 127.0.0.1 ./FahCore_a1.exe -dir work/ -suffix 01 -checkpoint 15 -forceasm -verbose -lifeline 29825 -version 601'

[22:41:29]
[22:41:29] *------------------------------*
[22:41:29] Folding@Home Gromacs SMP Core
[22:41:29] Version 1.74 (November 27, 2006)
[22:41:29]
[22:41:29] Preparing to commence simulation
[22:41:29] - Ensuring status. Please wait.
[22:41:29] - Starting from initial work packet
[22:41:29]
[22:41:29] Project: 2605 (Run 11, Clone 130, Gen 28)
[22:41:29]
[22:41:29] Assembly optimizations on if available.
[22:41:29] Entering M.D.
[22:41:46] on if available.
[22:41:46] Entering M.D.
NNODES=4, MYRANK=0, HOSTNAME=Speedy
NNODES=4, MYRANK=1, HOSTNAME=Speedy
NNODES=4, MYRANK=2, HOSTNAME=Speedy
NNODES=4, MYRANK=3, HOSTNAME=Speedy


Anyone seen this before? The same thing happens as root, when I redownload the client, and when I delete work/queue etc.

WarriorII
02-14-08, 05:57 PM
Shelnutt2 is the Linux Guru as far as I know.

I run Ubuntu... cheesey easy install & run.

Shelnutt2
02-14-08, 06:08 PM
Ok, few things, check in your favorite process monitoring. (I'm not sure what you have installed), and see if the four FahCore_a1.exe are running, and if so, are they taking any CPU cycles up?

Also when you said reinstall the OS, what all did you do? Was it a simple kernel upgrade, or did you wipe the drive and start over? Did you leave your /home partition? Just trying to get a feel for what you did.

Also did you install the emul-linux-x86-baselibs package via portage?

Lastly delete everything, delete your entire folding directory, and start again, from the beginning (download the client, make folder, etc.).

cw823
02-14-08, 06:24 PM
I had this happen anytime I tried to change network settings after the ubuntu install. Runs, but never actually folds. Had this happen twice, both times I reinstalled.

CJ145
02-14-08, 06:25 PM
There are no run-away folding processes when FAH is off. When it's on I have fah6, mpiexec, and the multiple FahCore's but none of them are taking any CPU and only about 20MB of ram.

This was a brand new from scratch install no old home etc.. only restored my files from backups (no os parts).

emul-linux-x86-baselibs is installed.

I have tried to delete the entire FAH directory and start again. I really don't want to **** off Stanford by deleting work units constantly. I will try one more time though.

I haven't touched my net settings since installing. Eth0 and Eth1 are dhcp by default w/ a blank /etc/conf.d/net.

Shelnutt2
02-14-08, 07:20 PM
post up your /etc/hosts file, just in case some how its gotten messed up.

Hack30
02-14-08, 08:07 PM
did you forget the "ia32-libs" i almost always forget those when I reinstall.

CJ145
02-14-08, 08:22 PM
There is no ia32-libs in gentoo. you can build with a multilib profile (same as ubuntu ia32-libs basically) or without (64bit only).

Here is my /etc/hosts

cj@Speedy ~ $ cat /etc/hosts
# /etc/hosts: Local Host Database
#
# This file describes a number of aliases-to-address mappings for the for
# local hosts that share this file.
#
# In the presence of the domain name service or NIS, this file may not be
# consulted at all; see /etc/host.conf for the resolution order.
#

# IPv4 and IPv6 localhost aliases
127.0.0.1 localhost
::1 localhost

#
# Imaginary network.
#10.0.0.2 myname
#10.0.0.3 myfriend
#
# According to RFC 1918, you can use the following IP networks for private
# nets which will never be connected to the Internet:
#
# 10.0.0.0 - 10.255.255.255
# 172.16.0.0 - 172.31.255.255
# 192.168.0.0 - 192.168.255.255
#
# In case you want to be able to connect directly to the Internet (i.e. not
# behind a NAT, ADSL router, etc...), you need real official assigned
# numbers. Do not try to invent your own network numbers but instead get one
# from your network provider (if any) or from your regional registry (ARIN,
# APNIC, LACNIC, RIPE NCC, or AfriNIC.)
#

Shelnutt2
02-14-08, 08:48 PM
"Speedy", thats the name of your computer? Try adding "Speedy" to the host file just after the first line of actual code, so it will look like,
# /etc/hosts: Local Host Database
#
# This file describes a number of aliases-to-address mappings for the for
# local hosts that share this file.
#
# In the presence of the domain name service or NIS, this file may not be
# consulted at all; see /etc/host.conf for the resolution order.
#

# IPv4 and IPv6 localhost aliases
127.0.0.1 localhost Speedy
::1 localhost

#
# Imaginary network.
#10.0.0.2 myname
#10.0.0.3 myfriend
#
# According to RFC 1918, you can use the following IP networks for private
# nets which will never be connected to the Internet:
#
# 10.0.0.0 - 10.255.255.255
# 172.16.0.0 - 172.31.255.255
# 192.168.0.0 - 192.168.255.255
#
# In case you want to be able to connect directly to the Internet (i.e. not
# behind a NAT, ADSL router, etc...), you need real official assigned
# numbers. Do not try to invent your own network numbers but instead get one
# from your network provider (if any) or from your regional registry (ARIN,
# APNIC, LACNIC, RIPE NCC, or AfriNIC.)
#

CJ145
02-14-08, 08:51 PM
I love you <3

(and i should keep a backup of /etc from now on).

Shelnutt2
02-15-08, 04:50 AM
I love you <3

(and i should keep a backup of /etc from now on).

Glad its working :clap:

cw823
02-15-08, 07:15 AM
Crapola, someone else calls their PCs Speedy? I have SpeedyQuad, SpeedyServer, SpeedyGame....and of course, good ole' Speedy

CJ145
02-15-08, 08:49 AM
Only one of them.

Speedy (main fastest comp), Clunker (AXP Server), Amplitude (HTPC), BLNT (laptop >.>), Frequency (PS3), Smoothy (router).