Panic Mode On (110) Server Problems?

Message boards : Number crunching : Panic Mode On (110) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 38 · Next

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917822 - Posted: 9 Feb 2018, 5:18:49 UTC - in response to Message 1917818.  

If the "active" users were heavy-duty crunchers, maybe no difference in project throughput.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917822 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917839 - Posted: 9 Feb 2018, 6:35:08 UTC - in response to Message 1917822.  

If the "active" users were heavy-duty crunchers, maybe no difference in project throughput.

Yep.
If would depend on how much the top 2% of active users contribute to the total Returned-per-hour number.
Grant
Darwin NT
ID: 1917839 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917843 - Posted: 9 Feb 2018, 7:42:51 UTC

Hmmm.
The daily glitch appears to have moved about 45min to an hour later.
Grant
Darwin NT
ID: 1917843 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917847 - Posted: 9 Feb 2018, 7:47:39 UTC - in response to Message 1917843.  

Seems like it hits here at the same time. Just posted in the lounge the site went dark and no connections were going through. Notice at Haveland they just dumped a boatload of wu deletions.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917847 · Report as offensive
Stephen "Heretic" Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 3310
Credit: 67,453,296
RAC: 101,287
Australia
Message 1917859 - Posted: 9 Feb 2018, 13:28:03 UTC - in response to Message 1917847.  

Seems like it hits here at the same time. Just posted in the lounge the site went dark and no connections were going through. Notice at Haveland they just dumped a boatload of wu deletions.


. . Well looking at the tapes splitting it looks like being a dry old weekend ...

Stephen

:(
ID: 1917859 · Report as offensive
kittyman Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 49796
Credit: 910,585,081
RAC: 180,258
United States
Message 1917864 - Posted: 9 Feb 2018, 13:59:20 UTC

I've been seeing the servers get tied in knots lately every night when I get home at midnight CST.
Hard to contact the servers or the forum pages at all.
What meowing lurks in the hearts of man? The kittyman knows....MEOWhahahahahahha!

Have made friends here.
Most were cats.
ID: 1917864 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917884 - Posted: 9 Feb 2018, 16:02:46 UTC

I found it strange when Grant mentioned he believe the project communications strangeness happened 45 minutes earlier than normal and for me it was exactly the same.

For me the project becomes unreachable or slow at 11:30 - 12:00 Midnight PST every night. That points to something locally happening at each of our locations since we are in very different time zones.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917884 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917885 - Posted: 9 Feb 2018, 16:10:50 UTC - in response to Message 1917859.  


. . Well looking at the tapes splitting it looks like being a dry old weekend ...

Stephen

:(

Looks like they added a bit more data overnight. When I went to bed the splitters only had about 1250 channels left to do. Now the unsplit channels are over 2000.

We are going to need more than that though to make it through to Monday.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917885 · Report as offensive
Profile Ghan-buri-Ghan Mike Project Donor

Send message
Joined: 27 Dec 15
Posts: 81
Credit: 27,097,356
RAC: 79,232
United States
Message 1917916 - Posted: 9 Feb 2018, 20:49:33 UTC

BOINC permits users via their cc_config files to report all results immediately. That means many more, albeit much shorter, queries to the servers (as opposed to say an hourly update). Would restricting (or asking users not to use ) this setting help or hurt the server load issue? Could hourly updates be staggered by sorting users (maybe alphabetically by user name or some other criteria) to even our the server load? Just a thought.....
ID: 1917916 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917925 - Posted: 9 Feb 2018, 21:40:03 UTC - in response to Message 1917884.  
Last modified: 9 Feb 2018, 21:40:33 UTC

I found it strange when Grant mentioned he believe the project communications strangeness happened 45 minutes earlier than normal and for me it was exactly the same.

Later than usual.

On the Haveland graphs it used to be a bit after 06:00. The last couple of days it's been around 07:00.
Grant
Darwin NT
ID: 1917925 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917927 - Posted: 9 Feb 2018, 21:45:17 UTC - in response to Message 1917916.  

BOINC permits users via their cc_config files to report all results immediately. That means many more, albeit much shorter, queries to the servers (as opposed to say an hourly update). Would restricting (or asking users not to use ) this setting help or hurt the server load issue? Could hourly updates be staggered by sorting users (maybe alphabetically by user name or some other criteria) to even our the server load? Just a thought.....

Better to have things spread out over a 24 hour period than lots of short, sharp, heavy demands on the system.
Getting work after an outage can be difficult- having groups of people all trying to report & get work at the same time will be lots of such events throughout the day. Add to the fact that I have to limit the number of results I report at time, otherwise i get nothing but Scheduler errors after an outage means such a system would just lead to other problems. Lots of smaller Scheduler requests through out the day are better than lots of large ones at one time.
Grant
Darwin NT
ID: 1917927 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11973
Credit: 117,684,494
RAC: 51,906
United Kingdom
Message 1917932 - Posted: 9 Feb 2018, 22:21:49 UTC - in response to Message 1917927.  

I have to limit the number of results I report at time, otherwise i get nothing but Scheduler errors ...
Having a less verbose std_err report per task in the sched_request file might help with that one.
ID: 1917932 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917938 - Posted: 9 Feb 2018, 22:34:07 UTC - in response to Message 1917932.  

I have to limit the number of results I report at time, otherwise i get nothing but Scheduler errors ...
Having a less verbose std_err report per task in the sched_request file might help with that one.

How do I configure that?
Grant
Darwin NT
ID: 1917938 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 11973
Credit: 117,684,494
RAC: 51,906
United Kingdom
Message 1917940 - Posted: 9 Feb 2018, 22:50:45 UTC - in response to Message 1917938.  

I have to limit the number of results I report at time, otherwise i get nothing but Scheduler errors ...
Having a less verbose std_err report per task in the sched_request file might help with that one.
How do I configure that?
Speak to the developer of whichever app you're using - it might even be in documentation.

An extra command line parameter of -v0 (or it might be -v 0) might help with that - it might be wise to experiment offline in a test bench environment first.
ID: 1917940 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917942 - Posted: 9 Feb 2018, 22:59:48 UTC - in response to Message 1917940.  

An extra command line parameter of -v0 (or it might be -v 0) might help with that - it might be wise to experiment offline in a test bench environment first.

Ah, I've vague memories of v settings when looking at optimisation in the readmes.
Thanks.
Grant
Darwin NT
ID: 1917942 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917960 - Posted: 10 Feb 2018, 0:24:38 UTC - in response to Message 1917942.  

Just looked at the readme's for both the MB cpu and SoG apps. -v 1 is the default reporting and -v 0 disables almost all outputs.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917960 · Report as offensive
Stephen "Heretic" Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 3310
Credit: 67,453,296
RAC: 101,287
Australia
Message 1917965 - Posted: 10 Feb 2018, 0:55:08 UTC - in response to Message 1917885.  


. . Well looking at the tapes splitting it looks like being a dry old weekend ...

Stephen

:(

Looks like they added a bit more data overnight. When I went to bed the splitters only had about 1250 channels left to do. Now the unsplit channels are over 2000.

We are going to need more than that though to make it through to Monday.


. . Absotively ...

Stephen

:)
ID: 1917965 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917992 - Posted: 10 Feb 2018, 3:43:56 UTC - in response to Message 1917965.  

Looks like we got more data for the weekend. Still haven't seen the old Arecibo data Eric said Jeff was trying to massage into shape for splitting.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917992 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 9823
Credit: 126,741,161
RAC: 83,934
Australia
Message 1917993 - Posted: 10 Feb 2018, 3:45:26 UTC

Well that sucks- an errored WU.

<message>
finish file present too long
</message>
Grant
Darwin NT
ID: 1917993 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 4398
Credit: 263,182,060
RAC: 597,779
United States
Message 1917994 - Posted: 10 Feb 2018, 3:49:43 UTC - in response to Message 1917993.  

Had a few of those myself last week. Sucks.
Seti@Home classic workunits:20,676 CPU time:74,226 hours
ID: 1917994 · Report as offensive
Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 38 · Next

Message boards : Number crunching : Panic Mode On (110) Server Problems?


 
©2018 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.