Panic Mode On (101) Server Problems?

Message boards : Number crunching : Panic Mode On (101) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · 16 . . . 27 · Next

AuthorMessage
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1740994 - Posted: 10 Nov 2015, 1:11:26 UTC

Also to note.. not that it really matters at all, but if you take a look at your application details page for any machine... the old, discontinued apps have been purged (APv505/6, MB6).
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1740994 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1740997 - Posted: 10 Nov 2015, 1:19:34 UTC - in response to Message 1740993.  

TL

Abort any MB tasks that were split on Nov 4 with a tag of _4 or over. They will be invalid anyways, might as well take and error for it now than waste the time on it.

The faster we can get these out of the system, the better.
ID: 1740997 · Report as offensive
Profile TimeLord04
Volunteer tester
Avatar

Send message
Joined: 9 Mar 06
Posts: 21140
Credit: 33,933,039
RAC: 23
United States
Message 1741000 - Posted: 10 Nov 2015, 1:33:49 UTC - in response to Message 1740997.  

TL

Abort any MB tasks that were split on Nov 4 with a tag of _4 or over. They will be invalid anyways, might as well take and error for it now than waste the time on it.

The faster we can get these out of the system, the better.

Well, it seems that both my systems are clear of any Nov 4 WUs... So, I hope that means that after the 23 Inconclusives are marked Invalid that I will be clear of this mess.


TL
TimeLord04
Have TARDIS, will travel...
Come along K-9!
Join Calm Chaos
ID: 1741000 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1741002 - Posted: 10 Nov 2015, 1:41:36 UTC - in response to Message 1741000.  

TL

Abort any MB tasks that were split on Nov 4 with a tag of _4 or over. They will be invalid anyways, might as well take and error for it now than waste the time on it.

The faster we can get these out of the system, the better.

Well, it seems that both my systems are clear of any Nov 4 WUs... So, I hope that means that after the 23 Inconclusives are marked Invalid that I will be clear of this mess.


TL


Actually I will continue to work them, they all have to be worked until there have been enough Invalids to make them go away. Until then they will continue to circle around.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1741002 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1741005 - Posted: 10 Nov 2015, 1:48:24 UTC - in response to Message 1741000.  

I see two _7 and one _4 on your AMD.

And one _9 on your Intel.

They are all Nov 4 original splits.
ID: 1741005 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1741026 - Posted: 10 Nov 2015, 3:22:11 UTC - in response to Message 1740997.  

Abort any MB tasks that were split on Nov 4 with a tag of _4 or over. They will be invalid anyways, might as well take and error for it now than waste the time on it.

The faster we can get these out of the system, the better.

Thanks. Should have thought of this myself, sure crunched a ton of VLARs I didn't need to ...
ID: 1741026 · Report as offensive
Profile TimeLord04
Volunteer tester
Avatar

Send message
Joined: 9 Mar 06
Posts: 21140
Credit: 33,933,039
RAC: 23
United States
Message 1741032 - Posted: 10 Nov 2015, 3:42:21 UTC - in response to Message 1741005.  

I see two _7 and one _4 on your AMD.

And one _9 on your Intel.

They are all Nov 4 original splits.

Unfortunately; you are mistaken. I just rechecked my Tasks in queue on both machines; and NO Nov 4 units are present. You must be seeing completed units that are in the Inconclusive category, and waiting for validation to be marked Invalid... As stated, I have 23 of those waiting for validation.

I have plenty of 08ap11... 11no11... 08ja11...

The ONLY 04... Unit I have in queue is on Exeter, (GTX-760), AMD machine, and is 04ap11...

Unless, they are GHOST units; all I need on top of faulty Invalids, so, don't know what to say - but NO Nov 4 Units on my two systems.


TL
TimeLord04
Have TARDIS, will travel...
Come along K-9!
Join Calm Chaos
ID: 1741032 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1741041 - Posted: 10 Nov 2015, 4:29:23 UTC
Last modified: 10 Nov 2015, 4:30:52 UTC

Side note, which I noticed the last time this broken-WU debacle happened.. if you happen to be lucky enough to get the _9 task... that one will not reset your consecutive valid count. My single-core machine (side-note: since it was made in 2007, it finally just reached 1M credits) got a _9 the last time around and I contemplated aborting it, but I had looked at some other WUs that were affected and noticed that the machine that got _9's consecutive, didn't have their consecutive valid counter reset since the _9 task doesn't get marked as an error/invalid.

Of course, it doesn't really matter if your consecutive streak has already been obliterated. I was fortunate enough in that previous event to have only gotten one single bad WU out of that, and even luckier to get a _9.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1741041 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1741043 - Posted: 10 Nov 2015, 4:31:53 UTC - in response to Message 1741032.  

you have to look on the web page and see when the WU was initially processed, not when it was resent to you ( for the 7th time).
ID: 1741043 · Report as offensive
Profile TimeLord04
Volunteer tester
Avatar

Send message
Joined: 9 Mar 06
Posts: 21140
Credit: 33,933,039
RAC: 23
United States
Message 1741047 - Posted: 10 Nov 2015, 4:48:48 UTC - in response to Message 1741043.  
Last modified: 10 Nov 2015, 5:03:55 UTC

you have to look on the web page and see when the WU was initially processed, not when it was resent to you ( for the 7th time).

Looking at the web page; NO UNITS of Nov 4th or 5th can be aborted; as requested by your original post to me... ALL of these Units have been completed and marked Invalid, now up to 60; or are in the Inconclusive pile; now down to 22.

So, AS I STATED, my queue is CLEAR of these units... I have NO MORE to be processed.


TL

[EDIT:]

Also, it seems that the units marked as Invalid or Inconclusive start from Nov 4 and go through Nov 8 issuance; AGAIN, ALL completed, and marked, or waiting to be marked; so UNABLE to be aborted.
TimeLord04
Have TARDIS, will travel...
Come along K-9!
Join Calm Chaos
ID: 1741047 · Report as offensive
ChrisD
Volunteer tester

Send message
Joined: 25 Sep 99
Posts: 158
Credit: 2,496,342
RAC: 0
Denmark
Message 1741088 - Posted: 10 Nov 2015, 8:54:17 UTC
Last modified: 10 Nov 2015, 9:03:57 UTC

WU's from 23oc11ah.17390 keeps arriving in my cache, and every one is reported as invalid. Theses are split on 8th and 9th of Nov.

Just pruned my caches of 11 WU's receoved today with _5, _7 and _9.

ChrisD
ID: 1741088 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1741089 - Posted: 10 Nov 2015, 9:45:22 UTC - in response to Message 1741088.  

WU's from 23oc11ah.17390 keeps arriving in my cache, and every one is reported as invalid. Theses are split on 8th and 9th of Nov.

I have plenty of 08ap11... 11no11... 08ja11...

The ONLY 04... Unit I have in queue is... is 04ap11...

We still seem to have some severe temporal dislocation in this thread.

The problem is neither when the original data was recorded at Arecibo, nor when the individual task was replicated to send out to the latest user:



but only when the original workunit was carved out of the recording tape:

ID: 1741089 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1741097 - Posted: 10 Nov 2015, 10:33:29 UTC - in response to Message 1740994.  

Also to note.. not that it really matters at all, but if you take a look at your application details page for any machine... the old, discontinued apps have been purged (APv505/6, MB6).

I asked for an option to hide them (#1433), David chose to remove them. Is probably easier.
ID: 1741097 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1741101 - Posted: 10 Nov 2015, 10:45:39 UTC - in response to Message 1741097.  

Also to note.. not that it really matters at all, but if you take a look at your application details page for any machine... the old, discontinued apps have been purged (APv505/6, MB6).

I asked for an option to hide them (#1433), David chose to remove them. Is probably easier.

But, sadly, it removes the opportunity to verify that a new application or application version is performing as intended by comparison with previous versions. We can, for example, no longer look at the relative APR values for setiathome_enhanced and SETI@home v7 (the previous and current MB apps). Those should indicate whether <rsc_fpops_est> has been calibrated properly, and hence (IMH - but lone - O) whether CreditNew is likely to score jobs realistically.
ID: 1741101 · Report as offensive
Profile TimeLord04
Volunteer tester
Avatar

Send message
Joined: 9 Mar 06
Posts: 21140
Credit: 33,933,039
RAC: 23
United States
Message 1741108 - Posted: 10 Nov 2015, 12:46:42 UTC - in response to Message 1741089.  

WU's from 23oc11ah.17390 keeps arriving in my cache, and every one is reported as invalid. Theses are split on 8th and 9th of Nov.

I have plenty of 08ap11... 11no11... 08ja11...

The ONLY 04... Unit I have in queue is... is 04ap11...

We still seem to have some severe temporal dislocation in this thread.

The problem is neither when the original data was recorded at Arecibo, nor when the individual task was replicated to send out to the latest user:



but only when the original workunit was carved out of the recording tape:


That being the case, Richard, (now I give up, because I don't understand how, or where to find that date), I have looked in my BOINC queues on both machines for ANY units with "_7", "_4", or "_9" in each and EVERY WU in queue... NOTHING exists in either machine.

Therefore, I conclude, once again, that I am clear of these WUs... They have run their course on my machines...

Thank you - sincerely,


TL
TimeLord04
Have TARDIS, will travel...
Come along K-9!
Join Calm Chaos
ID: 1741108 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1741137 - Posted: 10 Nov 2015, 15:01:37 UTC - in response to Message 1741108.  

WU's from 23oc11ah.17390 keeps arriving in my cache, and every one is reported as invalid. Theses are split on 8th and 9th of Nov.

I have plenty of 08ap11... 11no11... 08ja11...

The ONLY 04... Unit I have in queue is... is 04ap11...

We still seem to have some severe temporal dislocation in this thread.

The problem is neither when the original data was recorded at Arecibo, nor when the individual task was replicated to send out to the latest user:



but only when the original workunit was carved out of the recording tape:


That being the case, Richard, (now I give up, because I don't understand how, or where to find that date), I have looked in my BOINC queues on both machines for ANY units with "_7", "_4", or "_9" in each and EVERY WU in queue... NOTHING exists in either machine.

Therefore, I conclude, once again, that I am clear of these WUs... They have run their course on my machines...

Thank you - sincerely,


TL

Some people seem to confuse task & workunit or use the terms interchangeably. You need to use the site to see the workunit information.
On your task list click on the ID number under the Work unit click for details column instead of the ID under the Task
click for details
column.

I like to think of it as:
A task contains the record of the host & it's result data.
A workunit contains the records of all the tasks for that workunit.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1741137 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1741150 - Posted: 10 Nov 2015, 15:44:54 UTC - in response to Message 1740467.  

Yes, finally some Mac nVidia GPUs. Thank you very much.
Now to see how long it lasts.
;-)

Appears to have been broken again for about a day or so...
NVIDIA
Mac
No GPU tasks reported
ID: 1741150 · Report as offensive
Profile Jimbocous Project Donor
Volunteer tester
Avatar

Send message
Joined: 1 Apr 13
Posts: 1853
Credit: 268,616,081
RAC: 1,349
United States
Message 1741180 - Posted: 10 Nov 2015, 23:59:48 UTC

So we're back from maintenance, but unless I'm mistaken the splitters have not yet been turned up??
ID: 1741180 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1741184 - Posted: 11 Nov 2015, 0:11:38 UTC - in response to Message 1741180.  
Last modified: 11 Nov 2015, 0:56:59 UTC

So we're back from maintenance, but unless I'm mistaken the splitters have not yet been turned up??

Read Technical News.

Splits (Nov 10 2015)

BUT ALSO we needed to update some fields in the current science database schema to also make the database itself telescope agnostic. Just a few "alter table" commands to lengthen the tape name fields beyond 20 characters. We thought these alters would take a few hours (and completed before the end of today's Tuesday outage). Now it looks like it might take a day. We can't split/assimilate any new work until the alters are finished. Oh well. We're going to run out of work tonight, but should have fresh work sometime tomorrow morning. It is a holiday tomorrow, so cut us some slack, if it's later than tomorrow morning :).


Claggy
ID: 1741184 · Report as offensive
Profile JaundicedEye
Avatar

Send message
Joined: 14 Mar 12
Posts: 5375
Credit: 30,870,693
RAC: 1
United States
Message 1741204 - Posted: 11 Nov 2015, 1:06:21 UTC

it is a holiday tomorrow, so cut us some slack, if it's later than tomorrow morning :)

Veteran's day is not a Federal Holiday........although it should be!

"Sour Grapes make a bitter Whine." <(0)>
ID: 1741204 · Report as offensive
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · 16 . . . 27 · Next

Message boards : Number crunching : Panic Mode On (101) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.