Panic Mode On (96) Server Problems?

Message boards : Number crunching : Panic Mode On (96) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · Next

AuthorMessage
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13720
Credit: 208,696,464
RAC: 304
Australia
Message 1662963 - Posted: 9 Apr 2015, 9:32:16 UTC - in response to Message 1662956.  
Last modified: 9 Apr 2015, 9:32:48 UTC

15mins more and my 2 rigs will be totally onto backup work. :-(

Cheers.

I'm using it as a chance to reduce my power bill. Dropped from 420W or so down to 185W, the old Core 2 Duo still chugging along.
UPS run time has gone from less than 10min to over 30min.
Grant
Darwin NT
ID: 1662963 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1662970 - Posted: 9 Apr 2015, 9:44:09 UTC - in response to Message 1662963.  

15mins more and my 2 rigs will be totally onto backup work. :-(

Cheers.

I'm using it as a chance to reduce my power bill. Dropped from 420W or so down to 185W, the old Core 2 Duo still chugging along.
UPS run time has gone from less than 10min to over 30min.

With the temps dropping up here in the highlands Grant I need all the heat that the rigs can produce at this time of year. ;-)

Cheers.
ID: 1662970 · Report as offensive
Rasputin42
Volunteer tester

Send message
Joined: 25 Jul 08
Posts: 412
Credit: 5,834,661
RAC: 0
United States
Message 1662971 - Posted: 9 Apr 2015, 9:47:02 UTC

Maybe Seti is too big. They should split it into a few INDEPENDENT Projects.
That way,there should allways be some work to crunch.
Seti had reliabilaty issues, ever since i knew about it, years ago.
ID: 1662971 · Report as offensive
Profile Brent Norman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 1 Dec 99
Posts: 2786
Credit: 685,657,289
RAC: 835
Canada
Message 1662974 - Posted: 9 Apr 2015, 10:00:09 UTC - in response to Message 1662971.  

I think one big problem there is with SETI is the size of the MB task files. They are so small and designed for way back when everyone had modems.

Now they are just swamped sending out all those little files and processing them. I seen the 'big boys' bragging that they do a task every 9s, wow talk about server load. Split,send,receive,verify,assimilate ... hell that takes them more time than the big crunchers do!

Sure I could just say "make the task files bigger" but then begins the monumental task of combining new and old data.
ID: 1662974 · Report as offensive
Profile Raistmer
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 16 Jun 01
Posts: 6325
Credit: 106,370,077
RAC: 121
Russia
Message 1662985 - Posted: 9 Apr 2015, 10:53:54 UTC - in response to Message 1662974.  


Sure I could just say "make the task files bigger" but then begins the monumental task of combining new and old data.

It was in plans more than year already. Things going slow these days.
AFAIK that task "biggering" now replaced with new project of SETI 8. Not only bigger data chunk size but maybe some new ways to process that data. Too little details known still.
ID: 1662985 · Report as offensive
The Jedi Alliance - Ranger
Avatar

Send message
Joined: 27 Dec 00
Posts: 72
Credit: 60,982,863
RAC: 0
United States
Message 1663036 - Posted: 9 Apr 2015, 13:40:02 UTC

It's 6:40 AM Thursday Berkeley time. GPU's have been out of work since Tuesday afternoon. CPU's will be out of work in 4-5 hours. It's cold outside and my computers aren't generating enough heat to keep my office warm.

I actually got 1 Cuda WU last night. 15 minutes later it was gone...
ID: 1663036 · Report as offensive
FeK9

Send message
Joined: 20 May 99
Posts: 40
Credit: 61,229,677
RAC: 26
South Africa
Message 1663043 - Posted: 9 Apr 2015, 13:51:25 UTC

Ping :)
Noli tangere circulos meos...
ID: 1663043 · Report as offensive
Victor Wedge
Avatar

Send message
Joined: 3 Apr 04
Posts: 28
Credit: 12,569,503
RAC: 0
Message 1663049 - Posted: 9 Apr 2015, 14:01:47 UTC
Last modified: 9 Apr 2015, 14:07:01 UTC

Alright!!!! New GPU WUs just arrived! :-)



EDIT: Aw shucks! I only got 4 Cuda WUs. That will last through my 2nd cup of coffee.
ID: 1663049 · Report as offensive
Profile Cactus Bob
Avatar

Send message
Joined: 19 May 99
Posts: 209
Credit: 10,924,287
RAC: 29
Canada
Message 1663051 - Posted: 9 Apr 2015, 14:11:04 UTC

Hopefully you got more than I got this morning when I turned my computer on. They lasted 4 minutes each and then I got 0 tasks. I am assuming those are resends of tasks that have failed or went invalid on other computers. A simple resend/

Just a guess I don't think it means the DB is all fixed and happy again.

Bob

--------------
Working on the perfect sig file
Sometimes I wonder, what happened to all the people I gave directions to?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
SETI@home classic workunits 4,321
SETI@home classic CPU time 22,169 hours
ID: 1663051 · Report as offensive
Victor Wedge
Avatar

Send message
Joined: 3 Apr 04
Posts: 28
Credit: 12,569,503
RAC: 0
Message 1663059 - Posted: 9 Apr 2015, 14:52:23 UTC

The four WUs I received an hour ago completed quickly. But, GREAT NEWS ... I just checked again and I've now got a full load. :-)
ID: 1663059 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1663062 - Posted: 9 Apr 2015, 14:59:07 UTC - in response to Message 1663059.  
Last modified: 9 Apr 2015, 15:05:07 UTC

The four WUs I received an hour ago completed quickly. But, GREAT NEWS ... I just checked again and I've now got a full load. :-)


You lucky one. One user did abort almost 200 tasks and You got them...

And something weird... Your host 7334605 has 340 tasks in progress when it should have only 200 (100 for CPU and 100 for GPU).... Or is that You have lost tasks in progress? Did You made Reset Project or something similar?
"Please keep Your signature under four lines so Internet traffic doesn't go up too much"

- In 1992 when I had my first e-mail address -
ID: 1663062 · Report as offensive
Victor Wedge
Avatar

Send message
Joined: 3 Apr 04
Posts: 28
Credit: 12,569,503
RAC: 0
Message 1663073 - Posted: 9 Apr 2015, 15:31:45 UTC - in response to Message 1663062.  

And something weird... Your host 7334605 has 340 tasks in progress when it should have only 200 (100 for CPU and 100 for GPU).... Or is that You have lost tasks in progress? Did You made Reset Project or something similar?

I don't know the answer to your question. I haven't reset. I'm seeing 187 WUs in queue, and 12 crunching = 199.

I also noticed on checking that I've got a really high number of errors (46). I believe the most I've seen previously was 3 or 4.
ID: 1663073 · Report as offensive
WezH
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 576
Credit: 67,033,957
RAC: 95
Finland
Message 1663081 - Posted: 9 Apr 2015, 15:40:15 UTC - in response to Message 1663073.  

I also noticed on checking that I've got a really high number of errors (46). I believe the most I've seen previously was 3 or 4.


After browsing Your "In Progress" I did notice that Your host has 10 'Anonymous platform' tasks, all sent 19 Mar 2015. Did You install optimized (Lunatics) application and then uninstalled it?

If so, that could cause so many In Progress -tasks and errors. Servers currently don't send "lost" tasks. So if You do reset or install/uninstall optimized applications, those tasks which were in progress at that time will be lost, server just sends new tasks, not those which were in progress.

Eventually those tasks will time out.
"Please keep Your signature under four lines so Internet traffic doesn't go up too much"

- In 1992 when I had my first e-mail address -
ID: 1663081 · Report as offensive
Victor Wedge
Avatar

Send message
Joined: 3 Apr 04
Posts: 28
Credit: 12,569,503
RAC: 0
Message 1663082 - Posted: 9 Apr 2015, 15:43:07 UTC - in response to Message 1663081.  

I also noticed on checking that I've got a really high number of errors (46). I believe the most I've seen previously was 3 or 4.


After browsing Your "In Progress" I did notice that Your host has 10 'Anonymous platform' tasks, all sent 19 Mar 2015. Did You install optimized (Lunatics) application and then uninstalled it?

<snip>

Eventually those tasks will time out.

I guess that explains it, to you at least (heh). Yes, I did install/uninstall Lunatics.
ID: 1663082 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1663088 - Posted: 9 Apr 2015, 15:47:35 UTC - in response to Message 1663073.  
Last modified: 9 Apr 2015, 15:55:09 UTC

And something weird... Your host 7334605 has 340 tasks in progress when it should have only 200 (100 for CPU and 100 for GPU).... Or is that You have lost tasks in progress? Did You made Reset Project or something similar?

I don't know the answer to your question. I haven't reset. I'm seeing 187 WUs in queue, and 12 crunching = 199.

I also noticed on checking that I've got a really high number of errors (46). I believe the most I've seen previously was 3 or 4.

Your host has these numbers;
In progress (347) · Validation pending (122) · Validation inconclusive (5) · Valid (234) · Invalid (0) · Error (46)
That means you have at Least 147 lost tasks attributed to your host. Looking at the task list, every task from this point onward is most likely a lost task that will eventually time out, http://setiathome.berkeley.edu/results.php?hostid=7334605&offset=560&show_names=0&state=0&appid=
Due to tasks having different expiration dates, some will expire sooner than others.

As others have said, turning off "resend lost tasks" has contributed to problems with the database before. There isn't any reason to think the current database problems haven't been assisted by someone turning off "resend lost tasks".
ID: 1663088 · Report as offensive
Victor Wedge
Avatar

Send message
Joined: 3 Apr 04
Posts: 28
Credit: 12,569,503
RAC: 0
Message 1663102 - Posted: 9 Apr 2015, 16:05:02 UTC - in response to Message 1663088.  

As others have said, turning off "resend lost tasks" has contributed to problems with the database before. There isn't any reason to think the current database problems haven't been assisted by someone turning off "resend lost tasks".

You're speaking slightly over my head. Please clear this up for me. I don't know what you mean by 'turning off "resend lost tasks"'. It sounds as if you're suggesting I did something wrong which could cause DB problems. Or failed to do something I should have done.

I haven't done anything of this nature that I'm aware of. The last thing I want to do is cause problems to the system.
ID: 1663102 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1663105 - Posted: 9 Apr 2015, 16:12:43 UTC - in response to Message 1663102.  

As others have said, turning off "resend lost tasks" has contributed to problems with the database before. There isn't any reason to think the current database problems haven't been assisted by someone turning off "resend lost tasks".

You're speaking slightly over my head. Please clear this up for me. I don't know what you mean by 'turning off "resend lost tasks"'. It sounds as if you're suggesting I did something wrong which could cause DB problems. Or failed to do something I should have done.

I haven't done anything of this nature that I'm aware of. The last thing I want to do is cause problems to the system.

No worries...you did not do anything wrong.
'Resend lost tasks' is a feature of Boinc that is turned on and off on the servers, not by the users. Right now it has been disabled due to the extra load it puts on the servers and database, and the database has been a bit troublesome lately.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1663105 · Report as offensive
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1663109 - Posted: 9 Apr 2015, 16:13:53 UTC - in response to Message 1663102.  

Unless you have direct control over the servers I doubt you have done anything "wrong".
The people controlling the servers turned off 'resend lost tasks' at the server level suspecting that feature of somehow interfering with their webpage server. Some people here told them at the time that turning off that feature has resulted in database problems in the past. We now have database problems currently....I'm sure it's just a coincidence.
;-)
ID: 1663109 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1663111 - Posted: 9 Apr 2015, 16:18:11 UTC

Meanwhile, back on the ranch. Err....the kitty crunching farm....

The kitties are wandering about aimlessly meowing out loud and sniffing about looking for Seti WUs......
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1663111 · Report as offensive
Victor Wedge
Avatar

Send message
Joined: 3 Apr 04
Posts: 28
Credit: 12,569,503
RAC: 0
Message 1663113 - Posted: 9 Apr 2015, 16:19:05 UTC

Well, that's certainly a relief. Thanks to both of you for the explanation. ;-)
ID: 1663113 · Report as offensive
Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · Next

Message boards : Number crunching : Panic Mode On (96) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.