Work fetch bug in BOINC v5.10.13

Message boards : Number crunching : Work fetch bug in BOINC v5.10.13
Message board moderation

To post messages, you must log in.

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14690
Credit: 200,643,578
RAC: 874
United Kingdom
Message 616288 - Posted: 8 Aug 2007, 11:21:21 UTC
Last modified: 8 Aug 2007, 11:52:26 UTC

During the outage recovery, I noticed a problem related to 'Redundant result - Cancelled by server'. This is a heads-up in case anyone - particularly those with fast processors and large caches - hits the same thing.

When the servers came back up, my fast box was allocated a lot of WUs quite quickly, but struggled to download them through the congestion. However, other people obviously did download them, because some became redundant before I'd even managed to complete the download.

Once I'd reported the (aborted) result, my trusty BoincView was showing green across the board, but no new work was being fetched. I discovered that the machine was still re-trying five downloads, but getting 'file not found' (obviously, because the WU had been assimilated and the datapak deleted from the server). I just aborted the meaningless downloads, and work fetch started back up as normal.

BOINC needs to check for, and cancel, any associated transfers when doing a server-mandated abort. I'll log a Trac report. Edit - trac 366.
ID: 616288 · Report as offensive
Profile SATAN
Avatar

Send message
Joined: 27 Aug 06
Posts: 835
Credit: 2,129,006
RAC: 0
United Kingdom
Message 616294 - Posted: 8 Aug 2007, 12:22:30 UTC

Richard, it has been around since I joined. It is not just a problem with 5.10.13. It just appears that more people are experiencing it due to reduced quorom of results needed to validate a unit.
ID: 616294 · Report as offensive
gomeyer
Volunteer tester

Send message
Joined: 21 May 99
Posts: 488
Credit: 50,370,425
RAC: 0
United States
Message 616335 - Posted: 8 Aug 2007, 14:33:46 UTC - in response to Message 616288.  

. . . I discovered that the machine was still re-trying five downloads, but getting 'file not found' (obviously, because the WU had been assimilated and the datapak deleted from the server). I just aborted the meaningless downloads, and work fetch started back up as normal. . . .

I noticed the same thing on 3 machines. One actually ran out of work in one of two threads.
Don't know if this has happened before, but it is the first time I've seen that on any of my computers.
I am running BOINC v5.10.13.
ID: 616335 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51561
Credit: 1,018,363,574
RAC: 1,004
United States
Message 616873 - Posted: 9 Aug 2007, 8:03:16 UTC

Nice catch. I hope they find the little bug and stomp it. Ah yes, programs are never finished, always a work in progress, eh?
"Time is simply the mechanism that keeps everything from happening all at once."

ID: 616873 · Report as offensive

Message boards : Number crunching : Work fetch bug in BOINC v5.10.13


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.