Can different machines be this different?

Message boards : Number crunching : Can different machines be this different?
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Jean-David Beyer

Send message
Joined: 10 Jun 99
Posts: 60
Credit: 1,301,105
RAC: 1
United States
Message 1998428 - Posted: 16 Jun 2019, 17:04:24 UTC

Workunit 3519822979
name blc33_2bit_guppi_58406_31708_HIP20440_0117.5281.818.22.45.29.vlar
application SETI@home v8
created 15 Jun 2019, 11:29:26 UTC
minimum quorum 2
initial replication 3
max # of error/total/success tasks 5, 10, 5
Task
click for details Computer Sent Time reported
or deadline
explain Status Run time
(sec) CPU time
(sec) Credit Application
7772265929 8565391 15 Jun 2019, 16:47:29 UTC 15 Jun 2019, 16:52:39 UTC Completed, validation inconclusive 29.28 15.05 pending SETI@home v8 v8.00 (cuda32)
windows_intelx86
7772265930 6846706 15 Jun 2019, 16:47:32 UTC 16 Jun 2019, 11:44:51 UTC Completed, validation inconclusive 7,928.05 7,560.70 pending SETI@home v8 v8.00
x86_64-pc-linux-gnu


My machine (6846706) ran this task and seemed to work correctly. The other machine (8565391) also ran it, and its stderr file is so completely different that I cannot imagine we ever ran the same task. Yet neither of them seem to have failed. Do results differ this much often?
ID: 1998428 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22200
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1998431 - Posted: 16 Jun 2019, 17:38:59 UTC

As this work unit has, so far, not been validated (status reports as "Completed, validation inconclusive") I'm not surprised that the stderr are so different.
There are a number of things to consider when looking at stderr, not the least is what applications were being used, the detail of the stderr is very much at the behest of the developers. The other computer has a very high "inconclusive" and "invalid" rates which indicate that computer has an issue, possibly hardware or driver related.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1998431 · Report as offensive
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5124
Credit: 276,046,078
RAC: 462
Message 1998432 - Posted: 16 Jun 2019, 17:42:08 UTC - in response to Message 1998428.  
Last modified: 16 Jun 2019, 17:43:20 UTC


7772265930 6846706 15 Jun 2019, 16:47:32 UTC 16 Jun 2019, 11:44:51 UTC Completed, validation inconclusive 7,928.05 7,560.70 pending SETI@home v8 v8.00
x86_64-pc-linux-gnu

My machine (6846706) ran this task and seemed to work correctly. The other machine (8565391) also ran it, and its stderr file is so completely different that I cannot imagine we ever ran the same task. Yet neither of them seem to have failed. Do results differ this much often?


validation inconclusive 

Is a pretty common issue although it is not a "high volume" issue.

My understanding is some machines/apps have a lower chance of getting the same calculated results as other machines. That cause's the "validation inconclusive" for both machines.
This causes the Seti scheduler to send out the task again for a "tie breaker".
And eventually everyone gets credit for the crunching (I think).

Tom
A proud member of the OFA (Old Farts Association).
ID: 1998432 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1998434 - Posted: 16 Jun 2019, 17:47:11 UTC - in response to Message 1998428.  

Sadly, your wingmate's computer (8565391) is consistently producing rubbish - for the time being, at least.

All tasks for computer 8565391

Look at the number of tasks found to be, first, inconclusive, and ultimately invalid.

That's why every task is sent to two, independent, computers. When rubbish results are encountered, the work is checked a third, fourth, fifth time - as long as it takes (within reason) to be sure that the work is done properly. Your contribution will, most likely, be eventually confirmed by another volunteer and accepted as valid.
ID: 1998434 · Report as offensive
Profile Joseph Stateson Project Donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 309
Credit: 70,759,933
RAC: 3
United States
Message 1998436 - Posted: 16 Jun 2019, 17:56:25 UTC - in response to Message 1998428.  

I looked up that work unit and its"name"
blc33_2bit_guppi_58406_31708_HIP20440_0117.5281.818.22.45.29.vlar
at the location
https://setiathome.berkeley.edu/workunit.php?wuid=3519822979

All results completed are inconclusive and it is pretty obvious there are huge differences in the results.
The GPU task had info about OpenCL and your CPU one was simpler.

Be interesting to see that the ATI system returns and other systems will probably receive the task if the ATI one is also much different.
ID: 1998436 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1998439 - Posted: 16 Jun 2019, 18:23:31 UTC

If I see a CUDA32 app, I normally immediately discount the machine.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1998439 · Report as offensive
Profile Jean-David Beyer

Send message
Joined: 10 Jun 99
Posts: 60
Credit: 1,301,105
RAC: 1
United States
Message 1998467 - Posted: 16 Jun 2019, 21:58:15 UTC - in response to Message 1998434.  

Look at the number of tasks found to be, first, inconclusive, and ultimately invalid.


Very interesting. He certainly has a very fast machine though. Too bad the results seem so often wrong.

State: All (3520) · In progress (25) · Validation pending (1333) · Validation inconclusive (1337) · Valid (33) · Invalid (792) · Error (0)

ID: 1998467 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1998469 - Posted: 16 Jun 2019, 22:12:33 UTC

It's not fast, it's just overflowing perfectly good work. ;-)

Cheers.
ID: 1998469 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1998475 - Posted: 16 Jun 2019, 22:46:52 UTC - in response to Message 1998469.  

It's not fast, it's just overflowing perfectly good work. ;-)

Cheers.

Yes, because it is using the CUDA32 application. Unless one knows exactly the specs of their gpu and the status of which deprecated legacy graphics driver to properly run the CUDA32 application, the normal thing to happen is to trash perfectly good work and keep multiple wingmen busy.

In my opinion the CUDA23, CUD32, CUDA42 and CUDA50 apps should all be deprecated since the scheduler can't make a good decision about which compatible app to send to clients anymore. The SoG app should be the one and only gpu application for Multiband.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1998475 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1998476 - Posted: 16 Jun 2019, 23:05:00 UTC - in response to Message 1998475.  

It's not fast, it's just overflowing perfectly good work. ;-)

Cheers.
Yes, because it is using the CUDA32 application. Unless one knows exactly the specs of their gpu and the status of which deprecated legacy graphics driver to properly run the CUDA32 application, the normal thing to happen is to trash perfectly good work and keep multiple wingmen busy.

In my opinion the CUDA23, CUD32, CUDA42 and CUDA50 apps should all be deprecated since the scheduler can't make a good decision about which compatible app to send to clients anymore. The SoG app should be the one and only gpu application for Multiband.
And goodness knows when that 10yrs old pre-Fermi card was last cleaned.

Also pre-Fermi GPU's and Win10 are a bad combination considering its poor driver function/support.

Cheers.
ID: 1998476 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1998485 - Posted: 17 Jun 2019, 0:52:53 UTC - in response to Message 1998467.  

Look at the number of tasks found to be, first, inconclusive, and ultimately invalid.

Very interesting. He certainly has a very fast machine though. Too bad the results seem so often wrong.
State: All (3520) · In progress (25) · Validation pending (1333) · Validation inconclusive (1337) · Valid (33) · Invalid (792) · Error (0)

. . The apparent speed of this machine is a deception to a degree because of the mis-operation and rubbish results. If that were resolved its run times would be longer but the results valid.

Stephen

:(
ID: 1998485 · Report as offensive

Message boards : Number crunching : Can different machines be this different?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.