Confused: machine dropped from top to bottom of rankings?

Message boards : Number crunching : Confused: machine dropped from top to bottom of rankings?
Message board moderation

To post messages, you must log in.

AuthorMessage
TPR_Mojo
Volunteer tester

Send message
Joined: 18 Apr 00
Posts: 323
Credit: 7,001,052
RAC: 0
United Kingdom
Message 134021 - Posted: 7 Jul 2005, 23:42:28 UTC
Last modified: 7 Jul 2005, 23:46:39 UTC

This machine

http://setiathome.berkeley.edu/show_host_detail.php?hostid=1154305

was my top producer. Now its RAC is below anything else I own. Why? I can guarantee the machine is running as it runs my 13 diskless machines farm. But its RAC seems to have been at best zeroed?

I think the original host record disappeared, as I had to merge two "ghost" machines to make this one. Database error?
ID: 134021 · Report as offensive
Iztok s52d (and friends)

Send message
Joined: 12 Jan 01
Posts: 136
Credit: 393,469,375
RAC: 116
Slovenia
Message 134022 - Posted: 7 Jul 2005, 23:48:48 UTC - in response to Message 134021.  

This machine

http://setiathome.berkeley.edu/show_host_detail.php?hostid=1154305

was my top producer. Now its RAC is below anything else I own. Why? I can guarantee the machine is running as it runs my 13 diskless machines farm. But its RAC seems to have been at best zeroed?


Hello!

I think by merging you keep RAC only if new machine has not returned any WU.
This machine was created "7 Jul 2005 20:01:34 UTC", but you merged and so
it shows all WUs crunched before.

Wait two to three weeks.

BR
Iztok
ID: 134022 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 15 May 99
Posts: 692
Credit: 135,197,781
RAC: 211
Germany
Message 134024 - Posted: 7 Jul 2005, 23:49:40 UTC
Last modified: 7 Jul 2005, 23:56:40 UTC

You had many Computation Errors lately on that one. Maybe there is a hw-problem with this mashine.

It gets:
"Maximum disk usage exceeded"-errors. Maybe you should alter your disk settings for this one.

_\|/_
U r s
ID: 134024 · Report as offensive
TPR_Mojo
Volunteer tester

Send message
Joined: 18 Apr 00
Posts: 323
Credit: 7,001,052
RAC: 0
United Kingdom
Message 134026 - Posted: 7 Jul 2005, 23:51:32 UTC

Thanks guys but look again. This machine has crunched 60000 cobblestones, it is not a new addition....
ID: 134026 · Report as offensive
TPR_Mojo
Volunteer tester

Send message
Joined: 18 Apr 00
Posts: 323
Credit: 7,001,052
RAC: 0
United Kingdom
Message 134036 - Posted: 8 Jul 2005, 0:09:09 UTC - in response to Message 134024.  



It gets:
"Maximum disk usage exceeded"-errors. Maybe you should alter your disk settings for this one.


Preferences:

Use no more than 100 GB disk space
Leave at least 1 GB disk space free
Use no more than 75% of total disk space


Ouput from "df -H"

/dev/hda2 79G 6.7G 68G 9%

In other words, I have 79Gb space, 6.7Gb used (which includes 13 other BOINC installations), 68GB free, 9% used.

So although BOINC errors with "Max disk space exceeded", how can that be? And that STILL doesn't explain why a machine running 365/7/24 has an RAC of 3 and a total of 63k+ cobblestones?
ID: 134036 · Report as offensive
Iztok s52d (and friends)

Send message
Joined: 12 Jan 01
Posts: 136
Credit: 393,469,375
RAC: 116
Slovenia
Message 134039 - Posted: 8 Jul 2005, 0:14:03 UTC - in response to Message 134026.  

Thanks guys but look again. This machine has crunched 60000 cobblestones, it is not a new addition....


Already said:
- created today
- rest is merged
- problems reported to Berkeley:
Can't acquire lockfile - exiting
or
process exited with code 251 (0xfb)


SETI@home error -5 Can't open file
(work_unit.sah) in read_wu_state() errno=2

Something wrong with disks, mounting? Maybe you try to run two copies?
Maybe you try to run is as different user (not owner of temporary files)?
File permissions?
Have you merged yourself or it was done automagically?

BR
Iztok


ID: 134039 · Report as offensive
Profile StokeyBob
Avatar

Send message
Joined: 31 Aug 03
Posts: 848
Credit: 2,218,691
RAC: 0
United States
Message 134044 - Posted: 8 Jul 2005, 0:22:26 UTC

The Wiki has some stuff on the lockfile problem.

BOINC Wiki
ID: 134044 · Report as offensive
Profile StokeyBob
Avatar

Send message
Joined: 31 Aug 03
Posts: 848
Credit: 2,218,691
RAC: 0
United States
Message 134051 - Posted: 8 Jul 2005, 0:31:46 UTC
Last modified: 8 Jul 2005, 0:33:07 UTC

It also has some stuff on "process_exited_with_code 251".



<a> BOINC Wiki[/url]
ID: 134051 · Report as offensive
TPR_Mojo
Volunteer tester

Send message
Joined: 18 Apr 00
Posts: 323
Credit: 7,001,052
RAC: 0
United Kingdom
Message 134053 - Posted: 8 Jul 2005, 0:32:34 UTC
Last modified: 8 Jul 2005, 0:45:45 UTC

Ok I'll make this real simple.

I know this machine has issues, I think it is memory but I am still tracking it down. That isn't my point, it has crunched 60k+ CS successfully so it isn't really that bad. Plus I have 13 machines which rely on this one to function and they are still running and producing results. So the machine runs.

This machine has a little brother, it shows an RAC of 206.75, hostID 898337
This machine's host record has disappeared completely, RAC should be 220+
I manually merged two ghost copies earlier today, all live WU's are there, but RAC is 3+ not 220+

Where has my machine's host record gone?


ID: 134053 · Report as offensive
John McLeod VII
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jul 99
Posts: 24806
Credit: 790,712
RAC: 0
United States
Message 134068 - Posted: 8 Jul 2005, 1:29:33 UTC - in response to Message 134053.  

Ok I'll make this real simple.

I know this machine has issues, I think it is memory but I am still tracking it down. That isn't my point, it has crunched 60k+ CS successfully so it isn't really that bad. Plus I have 13 machines which rely on this one to function and they are still running and producing results. So the machine runs.

This machine has a little brother, it shows an RAC of 206.75, hostID 898337
This machine's host record has disappeared completely, RAC should be 220+
I manually merged two ghost copies earlier today, all live WU's are there, but RAC is 3+ not 220+

Where has my machine's host record gone?


When a machine stops producing good work, the RAC will drop. This is to be expected.


BOINC WIKI
ID: 134068 · Report as offensive
Profile StokeyBob
Avatar

Send message
Joined: 31 Aug 03
Posts: 848
Credit: 2,218,691
RAC: 0
United States
Message 134115 - Posted: 8 Jul 2005, 2:56:02 UTC

If I click on the link you posted;

http://setiathome.berkeley.edu/show_host_detail.php?hostid=1154305

Then click on the number next to the results at the bottom.

Then if I click on the numbers all the way to the left I get linked to a page that shows a result.

http://setiathome.berkeley.edu/result.php?resultid=81953270

Maybe if you look through them you can figure out when and why it started coming up with invalid work units being reported.
ID: 134115 · Report as offensive

Message boards : Number crunching : Confused: machine dropped from top to bottom of rankings?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.