Panic Mode On (83) Server Problems?

Message boards : Number crunching : Panic Mode On (83) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 21 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1364621 - Posted: 4 May 2013, 23:17:00 UTC - in response to Message 1364615.  


Network traffic went from 186Mb/s up to over 300Mb/s.
Grant
Darwin NT
ID: 1364621 · Report as offensive
Lionel

Send message
Joined: 25 Mar 00
Posts: 680
Credit: 563,640,304
RAC: 597
Australia
Message 1364630 - Posted: 4 May 2013, 23:36:52 UTC - in response to Message 1364533.  

Is anyone getting any work? Or am I alone in this respect?


no you're not ... getting zip here myself mate ...



ID: 1364630 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1364647 - Posted: 5 May 2013, 0:43:49 UTC - in response to Message 1364630.  

Is anyone getting any work? Or am I alone in this respect?


no you're not ... getting zip here myself mate ...

As mentioned before- i'm getting work, but it's only on every 3rd to 10th request that i do get it.

Grant
Darwin NT
ID: 1364647 · Report as offensive
Starman
Avatar

Send message
Joined: 15 May 99
Posts: 204
Credit: 81,351,915
RAC: 25
Canada
Message 1364656 - Posted: 5 May 2013, 2:40:29 UTC

Not getting too many here, especially AP v6 Open CL, Those are the main diet for the main cruncher and that cupboard has been bare since Thursday and only received 3 since Apr. 30. One of my other crunchers has managed to get 11 in the past 2 days and has 38 of them. I guess it's all in the timing of the request.
ID: 1364656 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1364658 - Posted: 5 May 2013, 3:05:23 UTC - in response to Message 1364656.  

I guess it's all in the timing of the request.

Yep.
Grant
Darwin NT
ID: 1364658 · Report as offensive
ExchangeMan
Volunteer tester

Send message
Joined: 9 Jan 00
Posts: 115
Credit: 157,719,104
RAC: 0
United States
Message 1364659 - Posted: 5 May 2013, 3:06:53 UTC

In the last half hour my wish was finally granted. Got a nice load of CPU, GPU and AP! Now that AP splitters are running, maybe things will get back to normal.
ID: 1364659 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1364660 - Posted: 5 May 2013, 3:08:43 UTC - in response to Message 1364621.  

Network traffic went from 186Mb/s up to over 300Mb/s.

Yeah, probably due to APs being split again, and sent out as soon as they hit the feeder.....
Donald
Infernal Optimist / Submariner, retired
ID: 1364660 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1364661 - Posted: 5 May 2013, 3:17:42 UTC - in response to Message 1364586.  

Philip J. Fry wrote:
But it looks strange.
Everytime I look to the server status page, the AP splitter are 'Not Running'.
My BOINC don't get new AP WUs since 3 days. Normally every day ~ 5 AP WUs.

Grant (SSSF) wrote:
You will only get work if work is being produced.
For work to be produced the splitters have to be running.
They can't run if there isn't any data for them to process.
There isn't any data for them to process, so they're not running.

And why there isn't any data for them to process? ;-)

The tapes don't have useful AP data in it since 3 days?

Philip, for all the time that you have been here I thought that you would be able to read the Server Status page and there for understand the reason without having to ask.

AP's are split off the same files that MB's are but the AP's are split off that much faster than the MB's are being that there are a lot less AP's coming from 1 file then there are MB's.

If you had a look at the files to be split the last few days you would of noticed a lot of files there that had already finished splitting AP's but were still waiting to complete MB's splitting.

Now that those files are almost finished now I expect that new files will start to be loaded again and AP's will be split very quickly again and will have to wait for the MB's to catch up again.

Cheers.

And, just for the record, those data files are shown on the right-hand side of the Server Status Page, and shows for each file whether it is being split for MB, AP, waiting, or Done.
Donald
Infernal Optimist / Submariner, retired
ID: 1364661 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1364687 - Posted: 5 May 2013, 6:58:35 UTC

I've gotten 19 APs today with probably about 100 requests. Was able to fill up my 10-day cache. Then I was updating my spreadsheet and noticed that I got assigned my 2500th AP for r557 on this CPU.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1364687 · Report as offensive
Mark Lybeck

Send message
Joined: 9 Aug 99
Posts: 245
Credit: 216,677,290
RAC: 173
Finland
Message 1364690 - Posted: 5 May 2013, 7:09:31 UTC

I do not seem to be getting any work for one of my computers.
Is BOINC able to detect and send work only to the fastest hosts if you have multiple hosts?

No GPU work. Recently my other computer got 97 new tasks...

05/05/2013 09:50:43 | SETI@home | [sched_op] Starting scheduler request
05/05/2013 09:50:43 | SETI@home | Sending scheduler request: To fetch work.
05/05/2013 09:50:43 | SETI@home | Reporting 1 completed tasks
05/05/2013 09:50:43 | SETI@home | Requesting new tasks for CPU
05/05/2013 09:50:43 | SETI@home | [sched_op] CPU work request: 84087.83 seconds; 0.00 devices
05/05/2013 09:50:43 | SETI@home | [sched_op] NVIDIA work request: 0.00 seconds; 0.00 devices
05/05/2013 09:50:46 | SETI@home | Scheduler request completed: got 4 new tasks
05/05/2013 09:50:46 | SETI@home | [sched_op] Server version 701
05/05/2013 09:50:46 | SETI@home | Project requested delay of 303 seconds
05/05/2013 09:50:46 | SETI@home | [sched_op] estimated total CPU task duration: 33068 seconds
05/05/2013 09:50:46 | SETI@home | [sched_op] estimated total NVIDIA task duration: 0 seconds
05/05/2013 09:50:46 | SETI@home | [sched_op] handle_scheduler_reply(): got ack for task 03mr13ad.25526.11928.11.11.138_0
05/05/2013 09:50:46 | SETI@home | [sched_op] Deferring communication for 5 min 3 sec
05/05/2013 09:50:46 | SETI@home | [sched_op] Reason: requested by project
05/05/2013 09:50:48 | SETI@home | Started download of 02my11ae.26493.3909.10.11.84.vlar
05/05/2013 09:50:48 | SETI@home | Started download of 02my11ae.26493.3909.10.11.214.vlar
05/05/2013 09:50:48 | SETI@home | Started download of 02my11ae.26493.3909.10.11.120.vlar
05/05/2013 09:50:48 | SETI@home | Started download of 02my11ae.26493.3909.10.11.220.vlar
05/05/2013 09:50:52 | SETI@home | Finished download of 02my11ae.26493.3909.10.11.84.vlar
05/05/2013 09:50:52 | SETI@home | Finished download of 02my11ae.26493.3909.10.11.214.vlar
05/05/2013 09:50:52 | SETI@home | Finished download of 02my11ae.26493.3909.10.11.120.vlar
05/05/2013 09:50:52 | SETI@home | Finished download of 02my11ae.26493.3909.10.11.220.vlar
05/05/2013 09:55:51 | SETI@home | [sched_op] Starting scheduler request
05/05/2013 09:55:51 | SETI@home | Sending scheduler request: To fetch work.
05/05/2013 09:55:51 | SETI@home | Requesting new tasks for CPU
05/05/2013 09:55:51 | SETI@home | [sched_op] CPU work request: 52269.95 seconds; 0.00 devices
05/05/2013 09:55:51 | SETI@home | [sched_op] NVIDIA work request: 0.00 seconds; 0.00 devices
05/05/2013 09:55:54 | SETI@home | Scheduler request completed: got 0 new tasks
05/05/2013 09:55:54 | SETI@home | [sched_op] Server version 701
05/05/2013 09:55:54 | SETI@home | No tasks sent
05/05/2013 09:55:54 | SETI@home | No tasks are available for SETI@home Enhanced
05/05/2013 09:55:54 | SETI@home | No tasks are available for Astropulse v505
05/05/2013 09:55:54 | SETI@home | No tasks are available for SETI@home v7
05/05/2013 09:55:54 | SETI@home | No tasks are available for AstroPulse v6
05/05/2013 09:55:54 | SETI@home | Tasks for AMD/ATI GPU are available, but your preferences are set to not accept them
05/05/2013 09:55:54 | SETI@home | This computer has reached a limit on tasks in progress
05/05/2013 09:55:54 | SETI@home | Project has no tasks available
05/05/2013 09:55:54 | SETI@home | Project requested delay of 303 seconds
05/05/2013 09:55:54 | SETI@home | [sched_op] Deferring communication for 5 min 3 sec
05/05/2013 09:55:54 | SETI@home | [sched_op] Reason: requested by project
05/05/2013 10:02:42 | SETI@home | update requested by user
05/05/2013 10:02:46 | SETI@home | [sched_op] Starting scheduler request
05/05/2013 10:02:46 | SETI@home | Sending scheduler request: Requested by user.
05/05/2013 10:02:46 | SETI@home | Requesting new tasks for CPU and NVIDIA
05/05/2013 10:02:46 | SETI@home | [sched_op] CPU work request: 53948.89 seconds; 0.00 devices
05/05/2013 10:02:46 | SETI@home | [sched_op] NVIDIA work request: 233280.00 seconds; 2.00 devices
05/05/2013 10:02:49 | SETI@home | Scheduler request completed: got 0 new tasks
05/05/2013 10:02:49 | SETI@home | [sched_op] Server version 701
05/05/2013 10:02:49 | SETI@home | Project has no tasks available
05/05/2013 10:02:49 | SETI@home | Project requested delay of 303 seconds
05/05/2013 10:02:49 | SETI@home | [sched_op] Deferring communication for 5 min 3 sec
05/05/2013 10:02:49 | SETI@home | [sched_op] Reason: requested by project

ID: 1364690 · Report as offensive
Mark Lybeck

Send message
Joined: 9 Aug 99
Posts: 245
Credit: 216,677,290
RAC: 173
Finland
Message 1364693 - Posted: 5 May 2013, 7:14:53 UTC

Hoorray got just 5 GPU WU. Too bad that will only last for about 3 minutes.
ID: 1364693 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1365001 - Posted: 6 May 2013, 1:03:37 UTC - in response to Message 1364726.  


Hmm, there was a spike in downloads, then it dropped down consideraly. About the same time i got a whole bunch of donwloads that wouldn't.
They've only just downloaded now.
Grant
Darwin NT
ID: 1365001 · Report as offensive
Profile HAL9000
Volunteer tester
Avatar

Send message
Joined: 11 Sep 99
Posts: 6534
Credit: 196,805,888
RAC: 57
United States
Message 1365119 - Posted: 6 May 2013, 13:23:55 UTC

Sorry guys. I'm getting all the WU's right now.
5/6/2013 9:00:12 AM SETI@home Scheduler request completed: got 79 new tasks

That was one of my machines first request after getting "Not requesting tasks: project is not highest priority" all weekend and running down its queue. I have to saw BOINV 7 can really be pants about work fetch. Giving it didn't even check the 2nd project since the 30th of last month.
SETI@home classic workunits: 93,865 CPU time: 863,447 hours
Join the [url=http://tinyurl.com/8y46zvu]BP6/VP6 User Group[
ID: 1365119 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1365127 - Posted: 6 May 2013, 14:38:28 UTC

Good gravy, there's a lot of b*tching in here about not getting enough work.

I don't pay close attention to what I have on hand, usually just checking once a day on weekdays. Occasionally, my i7 has a couple fewer in progress than the limit, but it has not come close to running out since the server move outage (if then). Actually, it has let itself run dry of Einstein for now. (Or is Einstein also having feed problems and I haven't noticed?)

I forget, are the colo guys supposed to be loading splitter tapes (drives), or do the project guys still have to do that? If it's only the project guys, well, hey, it was the weekend. They only go in on their own time to fix major problems, and major problems are what the colo guys ARE there to take care of.

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1365127 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1365143 - Posted: 6 May 2013, 15:31:41 UTC - in response to Message 1365127.  

Good gravy, there's a lot of b*tching in here about not getting enough work.

I don't pay close attention to what I have on hand, usually just checking once a day on weekdays. Occasionally, my i7 has a couple fewer in progress than the limit, but it has not come close to running out since the server move outage (if then). Actually, it has let itself run dry of Einstein for now. (Or is Einstein also having feed problems and I haven't noticed?)

I forget, are the colo guys supposed to be loading splitter tapes (drives), or do the project guys still have to do that? If it's only the project guys, well, hey, it was the weekend. They only go in on their own time to fix major problems, and major problems are what the colo guys ARE there to take care of.

As far as most of us are guessing, and I don't think the actual process has been confirmed by Matt or Eric, the hard drives from Arecibo still go to the Seti lab.
And the data is then transferred over the network link to the splitters at the colo. This we suspect is what causes the huge spikes in the upload link on the cricket graphs (the blue line, which is inbound to the servers).
How much of this process is automated and how much requires manual intervention by the boyz in da lab is up to guesses. Perhaps Matt can clarify some day.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1365143 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1365193 - Posted: 6 May 2013, 17:28:50 UTC - in response to Message 1365143.  

I forget, are the colo guys supposed to be loading splitter tapes (drives), or do the project guys still have to do that? If it's only the project guys, well, hey, it was the weekend. They only go in on their own time to fix major problems, and major problems are what the colo guys ARE there to take care of.

As far as most of us are guessing, and I don't think the actual process has been confirmed by Matt or Eric, the hard drives from Arecibo still go to the Seti lab.
And the data is then transferred over the network link to the splitters at the colo. This we suspect is what causes the huge spikes in the upload link on the cricket graphs (the blue line, which is inbound to the servers).
How much of this process is automated and how much requires manual intervention by the boyz in da lab is up to guesses. Perhaps Matt can clarify some day.

As I recall from Matt's posts, the data from Arecibo is downloaded from the transfer drives at the lab, then sent to archive storage. As the supply of ready-to-split data files gets low, new data files are called up from storage, processed for RFI blanking, then transferred down the hill to the splitter storage portion of the JBOD (Or the RFI blanking may be done by a background server that was also moved to the CoLo). These transfers are done manually by Matt/Jeff/etc.

Once the ready-to-split files are transferred to the JBOD, when the splitters get to a certain low point, more files are automatically mounted for splitting.
Donald
Infernal Optimist / Submariner, retired
ID: 1365193 · Report as offensive
musicplayer

Send message
Joined: 17 May 10
Posts: 2430
Credit: 926,046
RAC: 0
Message 1365249 - Posted: 6 May 2013, 19:32:34 UTC
Last modified: 6 May 2013, 19:35:54 UTC

Blanked? Do you perhaps mean RFI?

Anyway, this perhaps should be put up somewhere else here, but the four types of numbers the Seti@home client is supposed to catch may not necessarily be what may be in an alien signal.

Therefore it is likely to go undetected.

Regardless of a task being a shorty or not (carrying out the gaussian search in the second instance), the numbers returned for spike, pulse and possible triplet should be the same otherwise.

Still, many people are convinced that narrowband signals do not consist of gaussians at all, whether or not they are computed by means of the spikes and pulses which may be present in a given task.

In other words, there should exists an algorithm which filters out all the numbers which may be the result of natural causes in order to be able to catch the signal (or signal curve) which may be present in an extraterrestrial signal.

However it may not be that interesting to process data sets which may be mostly empty when it comes to relevant contents.
ID: 1365249 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1365345 - Posted: 7 May 2013, 5:32:50 UTC - in response to Message 1365249.  

Blanked? Do you perhaps mean RFI?

Anyway, this perhaps should be put up somewhere else here

Indeed, probably the Seti Science forum.....
Donald
Infernal Optimist / Submariner, retired
ID: 1365345 · Report as offensive
David S
Volunteer tester
Avatar

Send message
Joined: 4 Oct 99
Posts: 18352
Credit: 27,761,924
RAC: 12
United States
Message 1365423 - Posted: 7 May 2013, 13:43:35 UTC

Okay, panickers, what say you now?

MB ready to send is now upward of 245,000, still with only 6 splitters running. (FWIW, AP ready to send is about 2,000 with all its splitters running.)

Can we still say the MB/PFB splitters can't keep up with demand? Or is something else going on that's been causing the ready number to be so low recently?

David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.

ID: 1365423 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1365429 - Posted: 7 May 2013, 14:01:06 UTC
Last modified: 7 May 2013, 14:01:49 UTC

Okay, panickers, what say you now?

MB ready to send is now upward of 245,000, still with only 6 splitters running. (FWIW, AP ready to send is about 2,000 with all its splitters running.)

Can we still say the MB/PFB splitters can't keep up with demand? Or is something else going on that's been causing the ready number to be so low recently?

Not convinced. They had real problems when (a) no AP was available, and (b) the MB work was mostly shorties. They are keeping up now with a better ratio of long/short MB tasks and the AP tasks. For example, I've received 364 tasks in the last 24 hours per BOINCTasks (staying at limits for the period). That number was ranging from 700 to 800 when we got almost all shorties, no AP, and the ready to send was depleted. Maybe there was a fix, but I'll wait and see.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1365429 · Report as offensive
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 21 · Next

Message boards : Number crunching : Panic Mode On (83) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.