Panic Mode On (99) Server Problems?

Message boards : Number crunching : Panic Mode On (99) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 26 · Next

AuthorMessage
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34257
Credit: 79,922,639
RAC: 80
Germany
Message 1710313 - Posted: 9 Aug 2015, 8:52:08 UTC

My cache is still full but it takes up to 7 requests here.
Got a batch of 33 GPU tasks one hour ago.


With each crime and every kindness we birth our future.
ID: 1710313 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1710319 - Posted: 9 Aug 2015, 9:19:14 UTC

It has been over 4 hours ago when I got two Nvidia GPU wu's... and they were done under 20 minutes... After that, 25 requests and no tasks...
ID: 1710319 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1710334 - Posted: 9 Aug 2015, 10:48:45 UTC

yup no gpu work since hours
I came down with a bad case of i don't give a crap
ID: 1710334 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1710350 - Posted: 9 Aug 2015, 11:55:06 UTC
Last modified: 9 Aug 2015, 11:57:12 UTC

Upped my cache level on my T5500 to pull in some VLARs, i think all but 5 tasks are VLARs, will do the non-VLARs first, then resends,
with 100 tasks on board this is about two weeks work, have set NNT for now.

Edit: Did the same on my T8100, all it's picked up so far are VLARs.

With the MB v7 tasks ready send close to 400k, the splitters are eithier still going or there are a lot of Wu's timing out, or becoming inconclusive.

Claggy
ID: 1710350 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22191
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1710359 - Posted: 9 Aug 2015, 12:26:57 UTC

Roll on the release of the OpenCL application for VLARs on GPUs.

Thinking aloud - would it be possible to make that application a special application only for use against VLARs? This might overcome the performance hit that was being suffered in Beta when using the application for normals and shorties.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1710359 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1710360 - Posted: 9 Aug 2015, 12:35:53 UTC - in response to Message 1710350.  

Upped my cache level on my T5500 to pull in some VLARs...

Did the same on an i5, and topped up to 100 tasks exclusively with first-run VLARs.

We're currently working through the (many) tapes recorded on 16 May: there was some interesting astronomy going on that day, according to the Arecibo Observatory Telescope Schedule.

http://www.naic.edu/vscience/schedule/tpfiles/MinchintagA2048tp.pdf
http://www.naic.edu/vscience/schedule/tpfiles/LorimertagA2854tp.pdf
http://www.naic.edu/vscience/schedule/tpfiles/KaspitagP2030tp.pdf
http://www.naic.edu/vscience/schedule/tpfiles/DenevatagP2859tp.pdf

(though possibly of more interest to Einstein than to SETI)
ID: 1710360 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5517
Credit: 528,817,460
RAC: 242
United States
Message 1710371 - Posted: 9 Aug 2015, 13:28:30 UTC - in response to Message 1710359.  

Roll on the release of the OpenCL application for VLARs on GPUs.

Thinking aloud - would it be possible to make that application a special application only for use against VLARs? This might overcome the performance hit that was being suffered in Beta when using the application for normals and shorties.



I know they asked that question about a week ago. I think there still wasn't enough ATI GPU users testing it to give them a yes or no as to if it would work. Anyone have any further information?
ID: 1710371 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1710385 - Posted: 9 Aug 2015, 14:07:24 UTC - in response to Message 1710319.  

It has been over 4 hours ago when I got two Nvidia GPU wu's... and they were done under 20 minutes... After that, 25 requests and no tasks...


Suddenly I did get 50 tasks for Nvidia, after that, zero for several times
ID: 1710385 · Report as offensive
Profile Zalster Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 27 May 99
Posts: 5517
Credit: 528,817,460
RAC: 242
United States
Message 1710399 - Posted: 9 Aug 2015, 14:48:28 UTC - in response to Message 1710385.  

Yeah, seems like we caught a break. A bunch just downloaded onto 1 machine. The other got a dozen or some more.
ID: 1710399 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1710406 - Posted: 9 Aug 2015, 15:15:56 UTC
Last modified: 9 Aug 2015, 15:50:12 UTC

I've just been idly wondering why I have so many validated tasks still showing on my list. And found the cut-off point:

Task 4294795247 --> WU 1858492176 (just a slow wingmate, all good)
Task 4294972138 --> WU 1858575073 (Unable to handle request, can't find workunit)

Anyone recognise the value 4,294,967,296 - between those two task IDs? Yup, 2^32. Oops.

Edit - Task 4294967296 itself has been validated, but is in the "can't find workunit" state.
ID: 1710406 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1710423 - Posted: 9 Aug 2015, 15:48:21 UTC - in response to Message 1710406.  

Task 4294972138 --> WU 1858575073 (Unable to handle request, can't find workunit)

JBird reported 8 of those in Q&A.
ID: 1710423 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1710425 - Posted: 9 Aug 2015, 15:49:20 UTC - in response to Message 1710406.  

I've just been idly wondering why I have so many validated tasks still showing on my list. And found the cut-off point:

Task 4294795247 --> WU 1858492176 (just a slow wingmate, all good)
Task 4294972138 --> WU 1858575073 (Unable to handle request, can't find workunit)

Anyone recognise the value 4,294,967,296 - between those two task IDs? Yup, 2^32. Oops.

Yes, the data base purge seems to have stopped right after they changed the size of the WU names. Maybe they forgot to change the purge program to match?
Donald
Infernal Optimist / Submariner, retired
ID: 1710425 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1710431 - Posted: 9 Aug 2015, 15:57:04 UTC - in response to Message 1710425.  

I'd reckon the file deletion program is returning an error for result files associated with a ResultID >= 2^32, accounting for the line

Result files waiting for deletion 7,847,326

on the server status page. And if the result file can't be deleted, the result status in the database can't transition to "ready to purge". So both the file storage area on the server disks is filling up (slowly, these are small files), and the table rowcount in the database is growing inexorably. Better put that on their ToDo list for Tuesday.
ID: 1710431 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1710466 - Posted: 9 Aug 2015, 17:06:41 UTC

The dB is so gonna explode if it keeps growing like that
I came down with a bad case of i don't give a crap
ID: 1710466 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1710470 - Posted: 9 Aug 2015, 17:17:24 UTC - in response to Message 1710399.  
Last modified: 9 Aug 2015, 17:20:41 UTC

Yeah, seems like we caught a break. A bunch just downloaded onto 1 machine. The other got a dozen or some more.


That was short one, again only VLAR's. My 560Ti got two tasks, and they are completed in 10 minutes...

I hope that those 21(2/3)my tapes will have more non-VLAR's... if not, then is time to panic :)

EDIT: As I was typing, I got 14 GPU tasks from 16my15am -tapes
ID: 1710470 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1710479 - Posted: 9 Aug 2015, 17:32:51 UTC - in response to Message 1710466.  

The dB is so gonna explode if it keeps growing like that

Both David and Eric have acknowledged my report - they're on the case.

I reckon the dB can hold out until Tuesday - we've been higher than this, with the WU table exploding too, and lived to tell the tale ...
ID: 1710479 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1710485 - Posted: 9 Aug 2015, 17:45:46 UTC - in response to Message 1710479.  

I'd expect they have some script running that warns them when magic number N about the amount of results/tasks waiting for 'X' is passed and that it then plays a trumpet at them. Perhaps that the sound's been turned off.
ID: 1710485 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1710489 - Posted: 9 Aug 2015, 17:50:26 UTC - in response to Message 1710485.  

I'd expect they have some script running that warns them when magic number N about the amount of results/tasks waiting for 'X' is passed and that it then plays a trumpet at them. Perhaps that the sound's been turned off.

I think we are the default trumpeters ;)
ID: 1710489 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1710492 - Posted: 9 Aug 2015, 17:52:25 UTC - in response to Message 1710489.  

I just sent in a report that the security of the web sites (BOINC and Seti) is a bit lacking, in that Poodle can attack, the certificate is going to be blocked next year and such fun things. I think that with stuff like that they turn me up. ;-)
ID: 1710492 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1710525 - Posted: 9 Aug 2015, 18:48:18 UTC - in response to Message 1710359.  
Last modified: 9 Aug 2015, 18:51:21 UTC

Roll on the release of the OpenCL application for VLARs on GPUs.

Thinking aloud - would it be possible to make that application a special application only for use against VLARs? This might overcome the performance hit that was being suffered in Beta when using the application for normals and shorties.

There is a CPU App sitting at Beta right now that would basically Double the number of VLARs processed per hour in OSX. Right now the FLOPS reading is;
Mac OS X/Intel 7.00 29 May 2013, 21:14:00 UTC 30,342 GigaFLOPS
That number would Double as soon as the CPU App is released, that's a Lot of VLARs. Yet, the App just sits at Beta even though not a Single Machine has had a problem with it in the Months it's been there.

Strange isn't it...
ID: 1710525 · Report as offensive
Previous · 1 . . . 13 · 14 · 15 · 16 · 17 · 18 · 19 . . . 26 · Next

Message boards : Number crunching : Panic Mode On (99) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.