Panic Mode On (62) Server problems?


log in

Advanced search

Message boards : Number crunching : Panic Mode On (62) Server problems?

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next
Author Message
Kevin Olley
Send message
Joined: 3 Aug 99
Posts: 368
Credit: 35,328,637
RAC: 670
United Kingdom
Message 1175621 - Posted: 4 Dec 2011, 14:50:32 UTC - in response to Message 1175614.

I did just get one work request successfully through for my top rig by spending some quality time with my mouse and the retry button.

Meow meow meow!


I tried that but mine was too far gone, I only had shorties left and I could not press the button fast enough to keep up with them.



____________
Kevin


Kevin Olley
Send message
Joined: 3 Aug 99
Posts: 368
Credit: 35,328,637
RAC: 670
United Kingdom
Message 1175623 - Posted: 4 Dec 2011, 14:52:32 UTC - in response to Message 1175610.


41 VLAR's in the pot, lets see how x41g can handle them, I am going easy only 1 per card.



I am looking at about an hour per VLAR on GPU, so x41g looks as good as or slightly better than previous releases on 470's.



____________
Kevin


Miklos M.
Send message
Joined: 5 May 99
Posts: 760
Credit: 16,675,464
RAC: 10,598
United States
Message 1175643 - Posted: 4 Dec 2011, 16:05:19 UTC

No uploads and no downloads of any kind now.
____________

Profile Wiggo
Avatar
Send message
Joined: 24 Jan 00
Posts: 7942
Credit: 98,330,223
RAC: 26,986
Australia
Message 1175702 - Posted: 4 Dec 2011, 20:35:32 UTC - in response to Message 1175643.

Well things here are business as usual with my 3 rigs bouncing on/off the limits still.

Cheers.
____________

musicplayer
Send message
Joined: 17 May 10
Posts: 1475
Credit: 745,525
RAC: 711
Message 1175703 - Posted: 4 Dec 2011, 20:36:45 UTC

Umm, it is up and running again. Great!

Profile ivan
Volunteer tester
Avatar
Send message
Joined: 5 Mar 01
Posts: 639
Credit: 146,916,944
RAC: 63,322
United Kingdom
Message 1175706 - Posted: 4 Dec 2011, 20:55:43 UTC - in response to Message 1175621.

I did just get one work request successfully through for my top rig by spending some quality time with my mouse and the retry button.

Meow meow meow!


I tried that but mine was too far gone, I only had shorties left and I could not press the button fast enough to keep up with them.




To save having to monitor the Retry button I made up a little cron job and a wee awk script:

crontab entry:
* * * * * source /home/Compaq_Owner/retryfiles

retryfiles:

cd c:
cd 'Program Files/BOINC'
./boinccmd.exe --get_file_transfers | gawk -f retry.awk

Program Files\BOINC\retry.awk:

/name/ { n = $2;}
/ xfer active: no/ { system("./boinccmd --file_transfer http://setiathome.berkeley.edu/ " n " retry");}

In other words, every minute cron runs retryfiles; retryfiles lists the files awaiting transfer and hands the results to retry.awk. The awk script stores the name of each file as the data passes through it, then if it sees that that file has an inactive transfer it spawns a system command to tell boinccmd to retry the transfer... The nice thing about doing it this way is that I only bother the Berkeley servers if I find a transfer that's in a wait-for-retry state.

This is working well on my two main Windows/NVIDIA boxes; their tasks in progress are slowly rising whereas this morning they both had empty caches. The commands as given should work out-of-the-box with most Linux installations; for Windows you need to install cygwin and its cron service -- or other equivalent software.

To run the script less often than every minute use */<n> as the first entry in the crontab line, where <n> is however many minutes you want to delay between instances, e.g. */5 for every fifth minute.

Sten-Arne
Volunteer tester
Send message
Joined: 1 Nov 08
Posts: 3684
Credit: 21,181,274
RAC: 5,724
Sweden
Message 1175708 - Posted: 4 Dec 2011, 20:58:44 UTC - in response to Message 1175702.

Well things here are business as usual with my 3 rigs bouncing on/off the limits still.

Cheers.


Same here, the two "faster" ones, are at their limits, and the ATOM doesn't want any more, it's so slow that it stays well below the server limits.

I have a combined WU count of 1679 WU's in progress. So no problems here in uploading or downloading.
____________

Kevin Olley
Send message
Joined: 3 Aug 99
Posts: 368
Credit: 35,328,637
RAC: 670
United Kingdom
Message 1175710 - Posted: 4 Dec 2011, 21:09:02 UTC - in response to Message 1175706.


I tried that but mine was too far gone, I only had shorties left and I could not press the button fast enough to keep up with them.




To save having to monitor the Retry button I made up a little cron job and a wee awk script:



Snip.

This may be useful to some, a little basic to others, but its way above my head, mouse clicking is about my limit:-)


____________
Kevin


Profile ivan
Volunteer tester
Avatar
Send message
Joined: 5 Mar 01
Posts: 639
Credit: 146,916,944
RAC: 63,322
United Kingdom
Message 1175720 - Posted: 4 Dec 2011, 22:19:30 UTC - in response to Message 1175710.


To save having to monitor the Retry button I made up a little cron job and a wee awk script:



Snip.

This may be useful to some, a little basic to others, but its way above my head, mouse clicking is about my limit:-)



Each to his own, Kev. You know I need to understand this sort of thing for my job; the beauty of computers is that they can relieve us of button-clicking duty, they never get tired. So ultimately a little bit of time spent learning something like awk (or python, or perl if you want even more capability) _can_ pay you back in added flexibility. But it's not compulsory...

Cheers, mate!

Kevin Olley
Send message
Joined: 3 Aug 99
Posts: 368
Credit: 35,328,637
RAC: 670
United Kingdom
Message 1175734 - Posted: 4 Dec 2011, 22:44:00 UTC - in response to Message 1175720.


Each to his own, Kev. You know I need to understand this sort of thing for my job; the beauty of computers is that they can relieve us of button-clicking duty, they never get tired. So ultimately a little bit of time spent learning something like awk (or python, or perl if you want even more capability) _can_ pay you back in added flexibility. But it's not compulsory...

Cheers, mate!


I am just a lorry driver by trade, yes they are sneeking into the cabs, there is even a bunch of electronics between my right foot and the engine now but the thing I have to worry about is whats on the dash (ie: call out tow truck).

I only dabble with computers for pleasure and due to other commitments time is in very short supply, If or when I can find the time increasing my capabilities with a computer is on my to do list.



____________
Kevin


Cosmic_Ocean
Avatar
Send message
Joined: 23 Dec 00
Posts: 2327
Credit: 8,868,786
RAC: 781
United States
Message 1175767 - Posted: 5 Dec 2011, 2:37:03 UTC

My APs upload first try every time one finishes. Scheduler requests go through every time, but about 99.5% of the time respond with "no tasks available" or "your app_info.xml file doesn't have a usable version of Seti@Home Enhanced." On the rare occasion that I do get issued an AP, it instant-fails 1-10 times and then finally goes through. I haven't hit any buttons in at least a month.
____________

Linux laptop uptime: 1484d 22h 42m
Ended due to UPS failure, found 14 hours after the fact

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5918
Credit: 61,707,527
RAC: 20,147
Australia
Message 1175798 - Posted: 5 Dec 2011, 7:33:32 UTC - in response to Message 1175192.


Take a look at Scarecrow's graphs.
A surge as the backlog of uploads goes through is to be expected- but for it to be sustained at over 110,000 for several hours. Talk about a hammering.
____________
Grant
Darwin NT.

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5918
Credit: 61,707,527
RAC: 20,147
Australia
Message 1175805 - Posted: 5 Dec 2011, 8:53:30 UTC - in response to Message 1175798.
Last modified: 5 Dec 2011, 9:12:56 UTC

And it would appear it was too much for too long- the uploads are backing up yet again.


EDIT- they have finally cleaered, but the inbound traffic is looking jagged again. Not a good sign.
____________
Grant
Darwin NT.

Profile Zapped SparkyProject donor
Volunteer moderator
Volunteer tester
Avatar
Send message
Joined: 30 Aug 08
Posts: 8926
Credit: 1,321,346
RAC: 699
United Kingdom
Message 1175917 - Posted: 5 Dec 2011, 19:07:02 UTC

One, and a third of an astropulse left to go. I'm starting to panic a bit.
____________
In an alternate universe, it was a ZX81 that asked for clothes, boots and motorcycle.

Client error 418: I'm a teapot

Tropical Goldfish Fish 15: Squeaky bras 'R us

Illusions of normality sufferer

Terror Australis
Volunteer tester
Send message
Joined: 14 Feb 04
Posts: 1759
Credit: 206,463,674
RAC: 11,581
Australia
Message 1175976 - Posted: 6 Dec 2011, 1:19:53 UTC
Last modified: 6 Dec 2011, 1:49:32 UTC

Oops, Looks like uploads have gone MGD again.

All my rigs are now in "project backoff" for uploads and all have been getting "No Tasks Available" when asking for work for some hours.

And it's now just after knock off time in Berkeley so there will be no-one there to apply the rubber hammer. :P

EDIT: Looks like I was wrong about the rubber hammer, about 15 minutes after I posted, all uploads cleared at good speed.

T.A.

Amauri
Volunteer tester
Send message
Joined: 18 May 08
Posts: 26
Credit: 480,373
RAC: 147
Brazil
Message 1176024 - Posted: 6 Dec 2011, 4:14:00 UTC - in response to Message 1175706.

To save having to monitor the Retry button I made up a little cron job and a wee awk script:

crontab entry:
* * * * * source /home/Compaq_Owner/retryfiles

retryfiles:

cd c:
cd 'Program Files/BOINC'
./boinccmd.exe --get_file_transfers | gawk -f retry.awk

Program Files\BOINC\retry.awk:

/name/ { n = $2;}
/ xfer active: no/ { system("./boinccmd --file_transfer http://setiathome.berkeley.edu/ " n " retry");}



Great job, Ivan, thank you!

Grant (SSSF)
Send message
Joined: 19 Aug 99
Posts: 5918
Credit: 61,707,527
RAC: 20,147
Australia
Message 1176044 - Posted: 6 Dec 2011, 8:27:24 UTC - in response to Message 1176024.


Once again, uploads accumulate.
____________
Grant
Darwin NT.

MikeN
Send message
Joined: 24 Jan 11
Posts: 302
Credit: 32,822,708
RAC: 5,535
United Kingdom
Message 1176067 - Posted: 6 Dec 2011, 13:14:28 UTC

Cricket graph has just base lined for uploads and downloads. Thats probably that until after the weekly outage.
____________

Profile SciManStevProject donor
Volunteer tester
Avatar
Send message
Joined: 20 Jun 99
Posts: 4897
Credit: 83,865,763
RAC: 13,434
United States
Message 1176070 - Posted: 6 Dec 2011, 13:34:55 UTC

I'm still dreaming of the day when my GPU's don't run dry 2-3 times a week.

Steve
____________
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 11 · Next

Message boards : Number crunching : Panic Mode On (62) Server problems?

Copyright © 2014 University of California