| Author |
Message |
Matt LebofskyVolunteer moderator Project administrator Project developer Project scientist
 Send message
Joined: 1 Mar 99 Posts: 1379 Credit: 74,079 RAC: 0

|
|
It's fairly clear that the recent updates we made to the general mysql/state counts/splitter fold has vastly improved our recent weekend woes. There were still a couple dips here and there, but no wild swings like before.
Except this morning one particular query - from the scheduler - was clogging the works. We figured we'll just let it push through, i.e. let nature take its course. We assumed it was an expensive lookup, but after a couple hours of waiting I ran the same query on the replica and found there was only one (!) row in question. So what the heck is mysql doing? We killed the query and eventually the logjam cleared.
I'm finally scraping up enough space to pull a lot more work up from our archives, so Astropulse will be kicking in again, at least at some low level. This should also help reduce the deman on our limited resources since those workunits take longer to process, which means a lighter load on our database/download/upload/scheduling servers.
- Matt
____________
-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude |
|
|
|
|
|
I can certain say that the title holds true for right now.
But, I'm running out of work again... check the queries if there's anything stuck again...
Edit: Never mind, the work just took freakishly really long to get via scheduler requests...
____________
What if Fiction was Fact and Fact was Fiction and vice versa? |
|
|
|
|
|
Ah, CUDA maxed the pipe out for MB.
Thanks for the updates.
____________
|
|
|
|
|
... so Astropulse will be kicking in again, at least at some low level. ...
Thanks for the info.
Could you clarify, please, whether the data fetched back from storage (like 05mr09ad currently splitting) will be split and scheduled for the general _v5 (503) application we've been using for a while, or the new _v505 application installed for Windows only on 10 June? It would help the optimisers plan their testing and releases. |
|
|
|
|
|
Excellent news Matt. I know the community in general is probably glad to hear that AP work is coming, and Im sure your pipeline/databases are going to like you more as well. |
|
|
|
|
|
. . . Thanks for the Updates Matt
< To All @ Berkeley - the Best to Each of You Today - May All go well . . .
____________
BOINC Wiki . . .
Science Status Page . . .
|
|
|
|
|
|
FYI, the thread of 23 June is unreadable:
Database Error
Unable to handle request
No thread with id 54313. Please check the link and try again.
____________
The SETI@Home Gauntlet 2012 april 16 - 30| info / chat | STATS |
|
|
|
|
|
Not only this thread. :-(
I have seen this error in the NC forum also.
And I have noticed that some threads says that the last post was made a few hours ago but in the thread itself the last post was made yesterday or even months ago.
See as exsample these threads:
http://setiathome.berkeley.edu/forum_thread.php?id=54092
http://setiathome.berkeley.edu/forum_thread.php?id=53702
____________
|
|
|
|
|
|
BTW, whoever kicked the server this morning, thanks a lot :)
when I looked up server status after not being able to see my tasks (results) I saw red
muchly appreciated, muchÃsimas gracias, obrigado, vielen herzlichen Dank, merci beaucoup, domo arigato gosaimashita, efcharistó, mille grazie, hvala lijepo, spasibo bolshoe
____________
Robi |
|
|
CSend message
Joined: 3 Apr 99 Posts: 238 Credit: 5,913,213 RAC: 2,664

|
|
The server may be mostly up, but there's almost no I/O going on. See http://fragment1.berkeley.edu:80/newcricket/grapher.cgi?target=/router-interfaces/inr-250/gigabitethernet2_3&ranges=d%3Aw&view=Octets
C
____________
Join Team MacNN |
|
|
|
|
|
well
something else
currently i can t download wu; (from server to my pc)
they all are pending and all copy fail with wrong size; as below
24/06/2009 17:46:58 SETI@home Started download of 06ap09ac.31280.4162.13.8.0
24/06/2009 17:46:59 Internet access OK - project servers may be temporarily down.
24/06/2009 17:47:20 Project communication failed: attempting access to reference site
24/06/2009 17:47:20 SETI@homeTemporarily failed download of 06ap09ac.31280.4162.13.8.0: connect() failed
24/06/2009 17:47:20 SETI@home Backing off 15 min 54 sec on download of 06ap09ac.31280.4162.13.8.0
24/06/2009 17:47:20 SETI@home [error] File 06ap09ac.31280.3753.13.8.31 has wrong size: expected 375333, got 0
24/06/2009 17:47:20 SETI@home Started download of 06ap09ac.31280.3753.13.8.31
24/06/2009 17:47:21 Internet access OK - project servers may be temporarily down.
24/06/2009 17:47:38 Project communication failed: attempting access to reference site
24/06/2009 17:47:38 SETI@home Temporarily failed download of 01mr09af.22260.17659.13.8.97: HTTP error
24/06/2009 17:47:38 SETI@home Backing off 2 min 3 sec on download of 01mr09af.22260.17659.13.8.97
24/06/2009 17:47:38 SETI@home [error] File 06ap09ac.31280.3753.13.8.144 has wrong size: expected 375331, got 0
24/06/2009 17:47:38 SETI@home Started download of 06ap09ac.31280.3753.13.8.144
24/06/2009 17:47:39 Internet access OK - project servers may be temporarily down.
...
cheers
jeanpierre@jpp
____________
|
|
|
|
|
|
I'm getting much the same as Jeanpierre:
Wed Jun 24 12:04:32 2009|SETI@home|Started download of 06ap09ab.914.4571.15.8.189
Wed Jun 24 12:06:42 2009|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
Wed Jun 24 12:06:48 2009|SETI@home|Scheduler request succeeded: got 0 new tasks
Wed Jun 24 12:07:00 2009|SETI@home|Started download of 01mr09af.22260.22976.13.8.195
Wed Jun 24 12:07:44 2009||Project communication failed: attempting access to reference site
Wed Jun 24 12:07:44 2009|SETI@home|Temporarily failed download of 06ap09ab.914.4571.15.8.189: HTTP error
Wed Jun 24 12:07:44 2009|SETI@home|Backing off 1 hr 9 min 40 sec on download of 06ap09ab.914.4571.15.8.189
Wed Jun 24 12:07:44 2009|SETI@home|Started download of 06ap09aa.31651.10706.15.8.205
Wed Jun 24 12:07:45 2009||Internet access OK - project servers may be temporarily down.
Wed Jun 24 12:08:15 2009||Project communication failed: attempting access to reference site
Wed Jun 24 12:08:15 2009|SETI@home|Temporarily failed download of 01mr09af.22260.22976.13.8.195: connect() failed
Wed Jun 24 12:08:15 2009|SETI@home|Backing off 1 hr 29 min 42 sec on download of 01mr09af.22260.22976.13.8.195
Wed Jun 24 12:08:15 2009|SETI@home|Started download of 06ap09ab.20174.23794.14.8.5
Wed Jun 24 12:08:16 2009||Internet access OK - project servers may be temporarily down.
Wed Jun 24 12:09:30 2009||Project communication failed: attempting access to reference site
Wed Jun 24 12:09:30 2009|SETI@home|Temporarily failed download of 06ap09ab.20174.23794.14.8.5: connect() failed
Wed Jun 24 12:09:30 2009|SETI@home|Backing off 1 hr 6 min 4 sec on download of 06ap09ab.20174.23794.14.8.5
Wed Jun 24 12:09:30 2009|SETI@home|Started download of 01mr09ad.18953.68833.16.8.126
Wed Jun 24 12:09:31 2009||Internet access OK - project servers may be temporarily down.
|
|
|
|
|
|
Oh, and all my tasks are "downloading": seems like one big mess.
____________
|
|
|
|
|
|
Ditto in Chicago. Download is starting but receiving zero bytes for each file, in two different location on Mac and Win.
Darek
____________
|
|
|
|
|
|
BTW, the top sticky thread (Long outgae or something like that) seems to be broken as well. I get an error when clicked.
____________
|
|
|
|
|
BTW, the top sticky thread (Long outgae or something like that) seems to be broken as well. I get an error when clicked.
Yep, same thing happens when I try it. Also having problems downloading work so I suspect it's a database error. I'm sure they'll fix it as soon as they can.
____________
|
|
|
|
|
|
Anyone noticed that the works units they managed to send back today still appear as in progress?
____________
|
|
|
|
|
|
Dan Sosebee,
From yours:
Yep, same thing happens when I try it. Also having problems downloading work so I suspect it's a database error. I'm sure they'll fix it as soon as they can.
I'm sure they will, I just wish they'd post some information about what is going on.
Rad |
|
|
|
|
I just wish they'd post some information about what is going on.
Do you want them to stop working on the problem(s) and post, or do you want them to put all of their efforts into fixing it and post later?
____________
|
|
|
Volunteer tester Send message
Joined: 9 Apr 02 Posts: 12016 Credit: 19,621,345 RAC: 41,718

|
BTW, whoever kicked the server this morning, thanks a lot :)
when I looked up server status after not being able to see my tasks (results) I saw red
muchly appreciated, muchÃsimas gracias, obrigado, vielen herzlichen Dank, merci beaucoup, domo arigato gosaimashita, efcharistó, mille grazie, hvala lijepo, spasibo bolshoe
I've never quite understood why people get so emotional (angry, seeing red) over anything at this project when 99% of the time things can be explained in a rational way when given a chance and a bit of patience.
____________
|
|
|