Not Perfect but Better (Jun 22 2009)

Message boards : Technical News : Not Perfect but Better (Jun 22 2009)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 910238 - Posted: 22 Jun 2009, 20:53:56 UTC

It's fairly clear that the recent updates we made to the general mysql/state counts/splitter fold has vastly improved our recent weekend woes. There were still a couple dips here and there, but no wild swings like before.

Except this morning one particular query - from the scheduler - was clogging the works. We figured we'll just let it push through, i.e. let nature take its course. We assumed it was an expensive lookup, but after a couple hours of waiting I ran the same query on the replica and found there was only one (!) row in question. So what the heck is mysql doing? We killed the query and eventually the logjam cleared.

I'm finally scraping up enough space to pull a lot more work up from our archives, so Astropulse will be kicking in again, at least at some low level. This should also help reduce the deman on our limited resources since those workunits take longer to process, which means a lighter load on our database/download/upload/scheduling servers.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 910238 · Report as offensive
Profile Jack Zhang
Volunteer tester
Avatar

Send message
Joined: 2 Jul 06
Posts: 206
Credit: 6,142,449
RAC: 0
Canada
Message 910261 - Posted: 22 Jun 2009, 21:56:54 UTC - in response to Message 910238.  
Last modified: 22 Jun 2009, 22:08:52 UTC

I can certain say that the title holds true for right now.

But, I'm running out of work again... check the queries if there's anything stuck again...

Edit: Never mind, the work just took freakishly really long to get via scheduler requests...
What if Fiction was Fact and Fact was Fiction and vice versa?
ID: 910261 · Report as offensive
Profile Gary Charpentier Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 25 Dec 00
Posts: 31015
Credit: 53,134,872
RAC: 32
United States
Message 910265 - Posted: 22 Jun 2009, 22:34:47 UTC - in response to Message 910238.  

Ah, CUDA maxed the pipe out for MB.

Thanks for the updates.

ID: 910265 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14679
Credit: 200,643,578
RAC: 874
United Kingdom
Message 910266 - Posted: 22 Jun 2009, 22:42:29 UTC - in response to Message 910238.  

... so Astropulse will be kicking in again, at least at some low level. ...

Thanks for the info.

Could you clarify, please, whether the data fetched back from storage (like 05mr09ad currently splitting) will be split and scheduled for the general _v5 (503) application we've been using for a while, or the new _v505 application installed for Windows only on 10 June? It would help the optimisers plan their testing and releases.
ID: 910266 · Report as offensive
Profile Rick

Send message
Joined: 27 Mar 01
Posts: 8
Credit: 25,223,770
RAC: 0
United States
Message 910419 - Posted: 23 Jun 2009, 12:44:55 UTC

Excellent news Matt. I know the community in general is probably glad to hear that AP work is coming, and Im sure your pipeline/databases are going to like you more as well.
ID: 910419 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 910468 - Posted: 23 Jun 2009, 16:19:08 UTC



. . . Thanks for the Updates Matt

< To All @ Berkeley - the Best to Each of You Today - May All go well . . .




BOINC Wiki . . .

Science Status Page . . .
ID: 910468 · Report as offensive
Profile S@NL - Eesger - www.knoop.nl
Avatar

Send message
Joined: 7 Oct 01
Posts: 385
Credit: 50,200,038
RAC: 0
Netherlands
Message 910688 - Posted: 24 Jun 2009, 11:42:22 UTC

FYI, the thread of 23 June is unreadable:

Database Error
Unable to handle request

No thread with id 54313. Please check the link and try again.

The SETI@Home Gauntlet 2012 april 16 - 30| info / chat | STATS
ID: 910688 · Report as offensive
RoosStar

Send message
Joined: 16 Oct 99
Posts: 51
Credit: 12,900,339
RAC: 20
Netherlands
Message 910703 - Posted: 24 Jun 2009, 12:38:25 UTC

Not only this thread. :-(
I have seen this error in the NC forum also.
And I have noticed that some threads says that the last post was made a few hours ago but in the thread itself the last post was made yesterday or even months ago.
See as exsample these threads:

http://setiathome.berkeley.edu/forum_thread.php?id=54092
http://setiathome.berkeley.edu/forum_thread.php?id=53702
ID: 910703 · Report as offensive
Profile Robi

Send message
Joined: 24 Oct 00
Posts: 33
Credit: 886,890
RAC: 1
United States
Message 910705 - Posted: 24 Jun 2009, 12:40:29 UTC

BTW, whoever kicked the server this morning, thanks a lot :)
when I looked up server status after not being able to see my tasks (results) I saw red
muchly appreciated, muchísimas gracias, obrigado, vielen herzlichen Dank, merci beaucoup, domo arigato gosaimashita, efcharistó, mille grazie, hvala lijepo, spasibo bolshoe
Robi
ID: 910705 · Report as offensive
C

Send message
Joined: 3 Apr 99
Posts: 240
Credit: 7,716,977
RAC: 0
United States
Message 910710 - Posted: 24 Jun 2009, 12:51:17 UTC

ID: 910710 · Report as offensive
JPP

Send message
Joined: 31 May 99
Posts: 18
Credit: 59,436,360
RAC: 47
France
Message 910746 - Posted: 24 Jun 2009, 15:50:22 UTC

well
something else
currently i can t download wu; (from server to my pc)
they all are pending and all copy fail with wrong size; as below
24/06/2009 17:46:58 SETI@home Started download of 06ap09ac.31280.4162.13.8.0
24/06/2009 17:46:59 Internet access OK - project servers may be temporarily down.
24/06/2009 17:47:20 Project communication failed: attempting access to reference site
24/06/2009 17:47:20 SETI@homeTemporarily failed download of 06ap09ac.31280.4162.13.8.0: connect() failed
24/06/2009 17:47:20 SETI@home Backing off 15 min 54 sec on download of 06ap09ac.31280.4162.13.8.0
24/06/2009 17:47:20 SETI@home [error] File 06ap09ac.31280.3753.13.8.31 has wrong size: expected 375333, got 0
24/06/2009 17:47:20 SETI@home Started download of 06ap09ac.31280.3753.13.8.31
24/06/2009 17:47:21 Internet access OK - project servers may be temporarily down.
24/06/2009 17:47:38 Project communication failed: attempting access to reference site
24/06/2009 17:47:38 SETI@home Temporarily failed download of 01mr09af.22260.17659.13.8.97: HTTP error
24/06/2009 17:47:38 SETI@home Backing off 2 min 3 sec on download of 01mr09af.22260.17659.13.8.97
24/06/2009 17:47:38 SETI@home [error] File 06ap09ac.31280.3753.13.8.144 has wrong size: expected 375331, got 0
24/06/2009 17:47:38 SETI@home Started download of 06ap09ac.31280.3753.13.8.144
24/06/2009 17:47:39 Internet access OK - project servers may be temporarily down.

...
cheers
jeanpierre@jpp

ID: 910746 · Report as offensive
Radford Bunker

Send message
Joined: 12 Mar 09
Posts: 8
Credit: 6,073,787
RAC: 0
United States
Message 910754 - Posted: 24 Jun 2009, 16:12:17 UTC

I'm getting much the same as Jeanpierre:

Wed Jun 24 12:04:32 2009|SETI@home|Started download of 06ap09ab.914.4571.15.8.189
Wed Jun 24 12:06:42 2009|SETI@home|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks
Wed Jun 24 12:06:48 2009|SETI@home|Scheduler request succeeded: got 0 new tasks
Wed Jun 24 12:07:00 2009|SETI@home|Started download of 01mr09af.22260.22976.13.8.195
Wed Jun 24 12:07:44 2009||Project communication failed: attempting access to reference site
Wed Jun 24 12:07:44 2009|SETI@home|Temporarily failed download of 06ap09ab.914.4571.15.8.189: HTTP error
Wed Jun 24 12:07:44 2009|SETI@home|Backing off 1 hr 9 min 40 sec on download of 06ap09ab.914.4571.15.8.189
Wed Jun 24 12:07:44 2009|SETI@home|Started download of 06ap09aa.31651.10706.15.8.205
Wed Jun 24 12:07:45 2009||Internet access OK - project servers may be temporarily down.
Wed Jun 24 12:08:15 2009||Project communication failed: attempting access to reference site
Wed Jun 24 12:08:15 2009|SETI@home|Temporarily failed download of 01mr09af.22260.22976.13.8.195: connect() failed
Wed Jun 24 12:08:15 2009|SETI@home|Backing off 1 hr 29 min 42 sec on download of 01mr09af.22260.22976.13.8.195
Wed Jun 24 12:08:15 2009|SETI@home|Started download of 06ap09ab.20174.23794.14.8.5
Wed Jun 24 12:08:16 2009||Internet access OK - project servers may be temporarily down.
Wed Jun 24 12:09:30 2009||Project communication failed: attempting access to reference site
Wed Jun 24 12:09:30 2009|SETI@home|Temporarily failed download of 06ap09ab.20174.23794.14.8.5: connect() failed
Wed Jun 24 12:09:30 2009|SETI@home|Backing off 1 hr 6 min 4 sec on download of 06ap09ab.20174.23794.14.8.5
Wed Jun 24 12:09:30 2009|SETI@home|Started download of 01mr09ad.18953.68833.16.8.126
Wed Jun 24 12:09:31 2009||Internet access OK - project servers may be temporarily down.
ID: 910754 · Report as offensive
Radford Bunker

Send message
Joined: 12 Mar 09
Posts: 8
Credit: 6,073,787
RAC: 0
United States
Message 910759 - Posted: 24 Jun 2009, 16:24:26 UTC

Oh, and all my tasks are "downloading": seems like one big mess.




ID: 910759 · Report as offensive
dydek

Send message
Joined: 3 Jul 01
Posts: 3
Credit: 1,421,791
RAC: 0
United States
Message 910794 - Posted: 24 Jun 2009, 18:00:07 UTC

Ditto in Chicago. Download is starting but receiving zero bytes for each file, in two different location on Mac and Win.

Darek
ID: 910794 · Report as offensive
dydek

Send message
Joined: 3 Jul 01
Posts: 3
Credit: 1,421,791
RAC: 0
United States
Message 910795 - Posted: 24 Jun 2009, 18:01:11 UTC

BTW, the top sticky thread (Long outgae or something like that) seems to be broken as well. I get an error when clicked.
ID: 910795 · Report as offensive
Danny Sosebee

Send message
Joined: 10 Jun 02
Posts: 53
Credit: 1,022,592
RAC: 0
United States
Message 910800 - Posted: 24 Jun 2009, 18:19:20 UTC - in response to Message 910795.  

BTW, the top sticky thread (Long outgae or something like that) seems to be broken as well. I get an error when clicked.


Yep, same thing happens when I try it. Also having problems downloading work so I suspect it's a database error. I'm sure they'll fix it as soon as they can.
ID: 910800 · Report as offensive
Profile Space Cowboy
Volunteer tester
Avatar

Send message
Joined: 24 Apr 00
Posts: 43
Credit: 1,730,621
RAC: 0
United Kingdom
Message 910802 - Posted: 24 Jun 2009, 18:30:25 UTC

Anyone noticed that the works units they managed to send back today still appear as in progress?
ID: 910802 · Report as offensive
Radford Bunker

Send message
Joined: 12 Mar 09
Posts: 8
Credit: 6,073,787
RAC: 0
United States
Message 910819 - Posted: 24 Jun 2009, 19:22:07 UTC - in response to Message 910800.  


Dan Sosebee,

From yours:

Yep, same thing happens when I try it. Also having problems downloading work so I suspect it's a database error. I'm sure they'll fix it as soon as they can.


I'm sure they will, I just wish they'd post some information about what is going on.

Rad
ID: 910819 · Report as offensive
1mp0£173
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 8423
Credit: 356,897
RAC: 0
United States
Message 910874 - Posted: 24 Jun 2009, 21:24:05 UTC - in response to Message 910819.  

I just wish they'd post some information about what is going on.

Do you want them to stop working on the problem(s) and post, or do you want them to put all of their efforts into fixing it and post later?

ID: 910874 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 910886 - Posted: 24 Jun 2009, 22:17:38 UTC - in response to Message 910705.  

BTW, whoever kicked the server this morning, thanks a lot :)
when I looked up server status after not being able to see my tasks (results) I saw red
muchly appreciated, muchísimas gracias, obrigado, vielen herzlichen Dank, merci beaucoup, domo arigato gosaimashita, efcharistó, mille grazie, hvala lijepo, spasibo bolshoe


I've never quite understood why people get so emotional (angry, seeing red) over anything at this project when 99% of the time things can be explained in a rational way when given a chance and a bit of patience.
ID: 910886 · Report as offensive
1 · 2 · Next

Message boards : Technical News : Not Perfect but Better (Jun 22 2009)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.