Panic Mode On (81) Server Problems?


log in

Advanced search

Message boards : Number crunching : Panic Mode On (81) Server Problems?

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 21 · Next
Author Message
juan BFBProject donor
Volunteer tester
Avatar
Send message
Joined: 16 Mar 07
Posts: 5486
Credit: 316,050,285
RAC: 147,375
Brazil
Message 1333722 - Posted: 2 Feb 2013, 0:27:48 UTC - in response to Message 1333720.
Last modified: 2 Feb 2013, 0:33:37 UTC

That what i fear. All my hosts are allready running in dryland. And no new work is DL neither from SETI or E@H

Try Primegrid: http://www.primegrid.com/

To much stuck UL boinc does not DL any aditional work from any project.

"Too many uploads" operates separately, project by project.

There's no reason within BOINC why a lot of stalled SETI uploads should prevent new work being fetched and downloaded from other projects like Einstein or Primegrid. I fetched all the work I needed from Einstein, on a host with too many SETI uploads, within the last 40 minutes.

Nice to see you arround Richard,

I imagine that but why E@H gives me the same msg as SETI? I´m talking about hosts who have few hundreds of stalled UL of each project.

01/02/2013 22:24:54 | Einstein@Home | Not requesting tasks: too many uploads in progress

In another hand i have very bad times running SETI @ Primegrid at the same time in the past, probabily they fix the problem but i not try again. In my case reseting the host is not an option and thats the only way to recover from the problem (a compleate driver crash) at that time.
____________

Richard HaselgroveProject donor
Volunteer tester
Send message
Joined: 4 Jul 99
Posts: 8801
Credit: 53,388,228
RAC: 42,692
United Kingdom
Message 1333725 - Posted: 2 Feb 2013, 0:40:41 UTC - in response to Message 1333722.

01/02/2013 22:24:54 | Einstein@Home | Not requesting tasks: too many uploads in progress

No problem here for Einstein:

02/02/2013 00:30:47 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_0
02/02/2013 00:30:47 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_1
02/02/2013 00:30:50 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_0
02/02/2013 00:30:50 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_1
02/02/2013 00:30:50 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_2
02/02/2013 00:30:50 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_3
02/02/2013 00:30:52 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_2
02/02/2013 00:30:52 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_3
02/02/2013 00:30:52 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_4
02/02/2013 00:30:52 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_5
02/02/2013 00:30:54 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_4
02/02/2013 00:30:54 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_5
02/02/2013 00:30:54 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_6
02/02/2013 00:30:54 | Einstein@Home | Started upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_7
02/02/2013 00:30:56 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_6
02/02/2013 00:30:56 | Einstein@Home | Finished upload of p2030.20120201.G178.56-02.52.N.b3s0g0.00000_1576_2_7
02/02/2013 00:32:06 | Einstein@Home | [sched_op] Starting scheduler request
02/02/2013 00:32:06 | Einstein@Home | Sending scheduler request: To fetch work.
02/02/2013 00:32:06 | Einstein@Home | Reporting 3 completed tasks, requesting new tasks for NVIDIA GPU
02/02/2013 00:32:06 | Einstein@Home | [sched_op] CPU work request: 0.00 seconds; 0.00 CPUs
02/02/2013 00:32:06 | Einstein@Home | [sched_op] NVIDIA GPU work request: 53.02 seconds; 0.00 GPUs
02/02/2013 00:32:10 | Einstein@Home | Scheduler request completed: got 1 new tasks
02/02/2013 00:32:12 | Einstein@Home | Started download of p2030.20121014.G49.84-01.06.C.b3s0g0.00000_1416.bin4
02/02/2013 00:32:20 | Einstein@Home | Finished download of p2030.20121014.G49.84-01.06.C.b3s0g0.00000_1416.bin4
etc.

You may want to give those Einstein uploads a retry.

juan BFBProject donor
Volunteer tester
Avatar
Send message
Joined: 16 Mar 07
Posts: 5486
Credit: 316,050,285
RAC: 147,375
Brazil
Message 1333727 - Posted: 2 Feb 2013, 0:46:59 UTC - in response to Message 1333725.
Last modified: 2 Feb 2013, 1:20:43 UTC

01/02/2013 22:24:54 | Einstein@Home | Not requesting tasks: too many uploads in progress

No problem here for Einstein:

You may want to give those Einstein uploads a retry.


I allready made that a lot of times and allways receive the same answer, and BTW i have the same problem in at least 3 hosts (don´t check the rest)

Look this example (After few retry´s).

01/02/2013 22:48:12 | Einstein@Home | update requested by user
01/02/2013 22:48:15 | SETI@home | update requested by user
01/02/2013 22:48:16 | Einstein@Home | Sending scheduler request: Requested by user.
01/02/2013 22:48:16 | Einstein@Home | Not requesting tasks: too many uploads in progress
01/02/2013 22:48:18 | Einstein@Home | Scheduler request completed
01/02/2013 22:48:24 | SETI@home | Sending scheduler request: Requested by user.
01/02/2013 22:48:24 | SETI@home | Not requesting tasks: too many uploads in progress
01/02/2013 22:48:29 | SETI@home | Scheduler request completed

(edit) i just check some other hosts and in host with fewer UL stalled the ask for new WU is working fine.
____________

Gone
Send message
Joined: 31 May 99
Posts: 150
Credit: 125,774,760
RAC: 0
United Kingdom
Message 1333731 - Posted: 2 Feb 2013, 1:14:01 UTC

If you want one that works, I made a new GPUUG at Distrigen

http://boinc.freerainbowtables.com/distrrtgen/team_display.php?teamid=2443

Pays well and works !

Reg
____________

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5942
Credit: 62,299,584
RAC: 36,518
Australia
Message 1333733 - Posted: 2 Feb 2013, 1:20:20 UTC - in response to Message 1333727.

I allready made that a lot of times and allways receive the same answer, and BTW i have the same problem in at least 3 hosts (don´t check the rest)

Are you still using proxies? Are those hosts all using the same proxy?
____________
Grant
Darwin NT.

juan BFBProject donor
Volunteer tester
Avatar
Send message
Joined: 16 Mar 07
Posts: 5486
Credit: 316,050,285
RAC: 147,375
Brazil
Message 1333737 - Posted: 2 Feb 2013, 1:28:33 UTC - in response to Message 1333733.
Last modified: 2 Feb 2013, 1:29:40 UTC

I allready made that a lot of times and allways receive the same answer, and BTW i have the same problem in at least 3 hosts (don´t check the rest)

Are you still using proxies? Are those hosts all using the same proxy?

No proxies, and 3 diferent ISP, DL are ok, UL start to get problem in SETI few hours ago, then after ending to crunch the GPU cache the host automaticaly switch to crunching E@H (0 resource share). But as they are a 2x690 hosts they crunches a lot of WU in few hours, then after some time the E@H UL stop and the ask for new work was stoped. Still have some SETI cpu work on them.

But that not happened in the slower 2x670 hosts, the SETI UL is stalled but the E@H ask for new work is working fine (UL at E@H is not working too) but as they are slower i belive the problem will apear in few hours.

I was thinking to activate a 3 project to see what happens but is to late and i have drink to many beers to try. Maybe tomorrow.

@Mark do you have any news about the Tbret kitty?
____________

Ron
Volunteer tester
Send message
Joined: 24 Aug 99
Posts: 42
Credit: 34,542,964
RAC: 0
United States
Message 1333773 - Posted: 2 Feb 2013, 2:43:25 UTC

Thank you Eric!

N9JFE David SProject donor
Volunteer tester
Avatar
Send message
Joined: 4 Oct 99
Posts: 12619
Credit: 14,977,138
RAC: 9,082
United States
Message 1333774 - Posted: 2 Feb 2013, 2:44:20 UTC - in response to Message 1333773.
Last modified: 2 Feb 2013, 2:57:35 UTC

Thank you Eric!

+1

[edit] Yay! All my uploads just went through with just a few clicks of the retry button.
____________
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.


Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5942
Credit: 62,299,584
RAC: 36,518
Australia
Message 1333781 - Posted: 2 Feb 2013, 3:16:21 UTC - in response to Message 1333774.

Thank you Eric!

+1

And another one from me.

Hit retry & they've started to go through, about as fast as downloads (ie almost stationary), but they are going through.
____________
Grant
Darwin NT.

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5942
Credit: 62,299,584
RAC: 36,518
Australia
Message 1333783 - Posted: 2 Feb 2013, 3:24:01 UTC - in response to Message 1333781.


Damn.
Now getting Scheduler timeouts & expired VLARs that got resent to the GPU.
:-(
____________
Grant
Darwin NT.

fscheel
Send message
Joined: 13 Apr 12
Posts: 73
Credit: 11,135,641
RAC: 0
United States
Message 1333784 - Posted: 2 Feb 2013, 3:24:55 UTC

Thank You Eric!!!!!!!!!!!!!!!!!

N9JFE David SProject donor
Volunteer tester
Avatar
Send message
Joined: 4 Oct 99
Posts: 12619
Credit: 14,977,138
RAC: 9,082
United States
Message 1333805 - Posted: 2 Feb 2013, 4:21:26 UTC

It appears that when Eric fixed the upload server, he broke the SSP. It hasn't updated in almost 2 hours and shows the upload server disabled.

I would rather not revert to the previous situation, though.

____________
David
Sitting on my butt while others boldly go,
Waiting for a message from a small furry creature from Alpha Centauri.


Profile James Sotherden
Avatar
Send message
Joined: 16 May 99
Posts: 9117
Credit: 37,508,870
RAC: 32,570
United States
Message 1333821 - Posted: 2 Feb 2013, 4:56:51 UTC - in response to Message 1333809.

It appears that when Eric fixed the upload server, he broke the SSP. It hasn't updated in almost 2 hours and shows the upload server disabled.

I would rather not revert to the previous situation, though.

Well, he did what he could do by remote. He is aware of the situation, and if more severe remedial means are necessary, he might undertake. Or, he might take the well deserved weekend off, as most of us expect after a week of work, and just take the weekend off with his lady and her raccoons.

And I agree 100%. Those guys bust butt all week. I say if they cant fix it by remote, O well Monday will come soon enough.
Ive had to be on call on the weekends for where I work. Even getting two hours pay for every call in, no matter if it took 10 minutes to fix the problem was worth ruining my week end off. And the Lab guys dont even get paid if they go in on a day off. Though they have done it!




____________

Old James

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5942
Credit: 62,299,584
RAC: 36,518
Australia
Message 1333823 - Posted: 2 Feb 2013, 4:57:29 UTC - in response to Message 1333805.

It appears that when Eric fixed the upload server, he broke the SSP. It hasn't updated in almost 2 hours and shows the upload server disabled.

I noticed the data was stale, but at least everything else is working.

____________
Grant
Darwin NT.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 21 · Next

Message boards : Number crunching : Panic Mode On (81) Server Problems?

Copyright © 2014 University of California