Another 'Sorry' thread (should we sticky one?)

Message boards : Number crunching : Another 'Sorry' thread (should we sticky one?)
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Woodgie
Avatar

Send message
Joined: 6 Dec 99
Posts: 134
Credit: 89,630,417
RAC: 55
United Kingdom
Message 1622803 - Posted: 3 Jan 2015, 15:53:28 UTC
Last modified: 3 Jan 2015, 16:01:28 UTC

An apology to anyone relying on Hermes (6376517) to validate workunits but last night I did a dumb thing and updated BOINC.

Hermes is a an old Mac OS X 10.6.8 Mac Mini (CoreDuo) and really didn't like BOINC 7.4.36 (couldn't start the daemon etc.) So I RTFM and rolled back to 7.2.42. The problem is, in rolling back I went and deleted the workunits it was crunching.

Live and learn, huh? (also, 'stop and think' would have helped...) Sorry about that.
~W

ID: 1622803 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1622837 - Posted: 3 Jan 2015, 18:08:48 UTC

What is there to be sorry about? The previous work was flagged as Abandoned.
Looks like all of that work has already gone out to a 3rd host to be processed with a few already returning their results.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1622837 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1623156 - Posted: 4 Jan 2015, 11:31:34 UTC - in response to Message 1622837.  

What is there to be sorry about?

That's what I also ask myself every time I see such a "sorry thread". Shit happens.
ID: 1623156 · Report as offensive
Profile JanniCash
Avatar

Send message
Joined: 17 Nov 03
Posts: 57
Credit: 1,276,920
RAC: 0
United States
Message 1623270 - Posted: 4 Jan 2015, 19:57:43 UTC

I think there is a slight difference between aborting (manually or automatic) tasks and leaving them hanging "in progress". In the former case they are reassigned to another system in a timely fashion. In the latter case they need to time out, which takes considerable time.

I created one of those "in progress" hanging tasks myself by destroying a test VM before cleaning out the task list. Is there any way to abort that or is it better to just wait?

http://setiathome.berkeley.edu/workunit.php?wuid=1657323561
ID: 1623270 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1623285 - Posted: 4 Jan 2015, 20:28:29 UTC - in response to Message 1623270.  

I think there is a slight difference between aborting (manually or automatic) tasks and leaving them hanging "in progress". In the former case they are reassigned to another system in a timely fashion. In the latter case they need to time out, which takes considerable time.

I created one of those "in progress" hanging tasks myself by destroying a test VM before cleaning out the task list. Is there any way to abort that or is it better to just wait?

http://setiathome.berkeley.edu/workunit.php?wuid=1657323561

You could create a new VM with the same configuration and machine name. Then merge the two hosts. Once merged the new host would get the old task as a resend. Which could then be aborted.
Or you can just let it time out. Which is normally what I do. As the merging of hosts is never guaranteed.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1623285 · Report as offensive
Profile JanniCash
Avatar

Send message
Joined: 17 Nov 03
Posts: 57
Credit: 1,276,920
RAC: 0
United States
Message 1623324 - Posted: 4 Jan 2015, 21:34:15 UTC - in response to Message 1623285.  

You could create a new VM with the same configuration and machine name. Then merge the two hosts.


I'll give that a try. Since the original one that create the problem was a test deployment of an OVF template, this may actually work. Unless the MAC address of the machine is somehow part of the host identity.

Thanks!
ID: 1623324 · Report as offensive
Profile JanniCash
Avatar

Send message
Joined: 17 Nov 03
Posts: 57
Credit: 1,276,920
RAC: 0
United States
Message 1623349 - Posted: 4 Jan 2015, 23:01:41 UTC - in response to Message 1623324.  

You could create a new VM with the same configuration and machine name. Then merge the two hosts.


I'll give that a try.


It automagically merged the host and abandoned the taks. All good now.


Thanks again.
ID: 1623349 · Report as offensive
Profile Woodgie
Avatar

Send message
Joined: 6 Dec 99
Posts: 134
Credit: 89,630,417
RAC: 55
United Kingdom
Message 1623481 - Posted: 5 Jan 2015, 6:24:40 UTC - in response to Message 1623349.  

It automagically merged the host and abandoned the taks. All good now.


Thanks again.


This is what appears to have happened with Hermes. I was a bit confused when HAL9000 pointed out they'd been abandoned as I hadn't abandoned them, just deleted them from the machine.

So they'll be re-assigned in a timely fashion, good. All's well that ends well then?
~W

ID: 1623481 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1623664 - Posted: 5 Jan 2015, 13:18:45 UTC - in response to Message 1623481.  
Last modified: 5 Jan 2015, 13:19:24 UTC

It automagically merged the host and abandoned the taks. All good now.


Thanks again.


This is what appears to have happened with Hermes. I was a bit confused when HAL9000 pointed out they'd been abandoned as I hadn't abandoned them, just deleted them from the machine.

So they'll be re-assigned in a timely fashion, good. All's well that ends well then?

I think the task status of "Abandoned" is a server side abort vs when you select Abort from the client the task status will be "Aborted by user".
however knowing BOINC there could be 37 different triggers for those task status.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1623664 · Report as offensive

Message boards : Number crunching : Another 'Sorry' thread (should we sticky one?)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.