Panic Mode On (85) Server Problems?

Message boards : Number crunching : Panic Mode On (85) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · Next

AuthorMessage
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1453545 - Posted: 13 Dec 2013, 14:36:39 UTC - in response to Message 1453540.  

One of my units failed with this error.
Long time no see.

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>22jl08aa.7244.207012.438086664195.12.53_4_0</file_name>
<error_code>-131</error_code>
</file_xfer_error>


I've got 3 of those too. Every host on every one of them reports the same error. One has already reached "too many errors" status and the others will soon.

If I had time, I'd remote into my machine and abort any more of that series that it may have.

Specifically, abort any WUs that have a 411KB datapak size, instead of the normal 367KB (as discussed earlier in this thread).

That's worth doing, because the error happens at the end of processing, so a lot of time and power is wasted.
ID: 1453545 · Report as offensive
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 1453575 - Posted: 13 Dec 2013, 16:30:20 UTC
Last modified: 13 Dec 2013, 16:32:28 UTC

I'm having download issues:

13-12-2013 17:25:30 | SETI@home | [error] MD5 check failed for ap_02no13aa_B2_P0_00300_20131212_14449.wu
13-12-2013 17:25:30 | SETI@home | [error] expected 7534b6a53c1d19a55c5f7bc3e428a54c, got 50a4ec6e1c6df9bb1590e3e2c446d805
13-12-2013 17:25:30 | SETI@home | [error] Checksum or signature error for ap_02no13aa_B2_P0_00300_20131212_14449.wu

Never seen that before. Related to the upload issues ?

EDIT: looks like I'm not the only one having issues with this file:

http://setiathome.berkeley.edu/workunit.php?wuid=1377309002

Tom
ID: 1453575 · Report as offensive
kittyman Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Jul 00
Posts: 51468
Credit: 1,018,363,574
RAC: 1,004
United States
Message 1453576 - Posted: 13 Dec 2013, 16:35:06 UTC - in response to Message 1453575.  
Last modified: 13 Dec 2013, 16:50:53 UTC

I'm having download issues:

13-12-2013 17:25:30 | SETI@home | [error] MD5 check failed for ap_02no13aa_B2_P0_00300_20131212_14449.wu
13-12-2013 17:25:30 | SETI@home | [error] expected 7534b6a53c1d19a55c5f7bc3e428a54c, got 50a4ec6e1c6df9bb1590e3e2c446d805
13-12-2013 17:25:30 | SETI@home | [error] Checksum or signature error for ap_02no13aa_B2_P0_00300_20131212_14449.wu

Never seen that before. Related to the upload issues ?

EDIT: looks like I'm not the only one having issues with this file:

http://setiathome.berkeley.edu/workunit.php?wuid=1377309002

Tom

I had 3 failed downloads about 2 hours ago, but things seem to have been OK since then.

EDIT...And one more just 10 minutes ago.
"Freedom is just Chaos, with better lighting." Alan Dean Foster

ID: 1453576 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1453911 - Posted: 14 Dec 2013, 15:09:12 UTC - in response to Message 1453902.  

Looks like the problem with georgem, still shows up from time to time. I just got this one:

Samsung-Laptop

SETI@home 2013-12-14 15:39 Sending scheduler request: To fetch work.
SETI@home 2013-12-14 15:39 Requesting new tasks for CPU
SETI@home 2013-12-14 15:39 Scheduler request completed: got 1 new tasks
SETI@home 2013-12-14 15:39 Started download of ap_02dc13ac_B5_P1_00219_20131212_17429.wu
SETI@home 2013-12-14 15:39 Giving up on download of ap_02dc13ac_B5_P1_00219_20131212_17429.wu: file not found

Won't be around much longer - there were a couple after you, but no more ;)

http://setiathome.berkeley.edu/workunit.php?wuid=1377309023

ID: 1453911 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1453937 - Posted: 14 Dec 2013, 17:10:23 UTC - in response to Message 1453545.  

One of my units failed with this error.
Long time no see.

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>22jl08aa.7244.207012.438086664195.12.53_4_0</file_name>
<error_code>-131</error_code>
</file_xfer_error>


I've got 3 of those too. Every host on every one of them reports the same error. One has already reached "too many errors" status and the others will soon.

If I had time, I'd remote into my machine and abort any more of that series that it may have.

Specifically, abort any WUs that have a 411KB datapak size, instead of the normal 367KB (as discussed earlier in this thread).

That's worth doing, because the error happens at the end of processing, so a lot of time and power is wasted.

It appears I caught one of these overnight, WU #1375646975. It came and went while I was sleeping, so no chance to do anything about it. It also looks like it'll be sent out to a couple more unsuspecting hosts before it finally dies.
ID: 1453937 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1453945 - Posted: 14 Dec 2013, 17:45:38 UTC - in response to Message 1453937.  

One of my units failed with this error.
Long time no see.

</stderr_txt>
<message>
upload failure: <file_xfer_error>
<file_name>22jl08aa.7244.207012.438086664195.12.53_4_0</file_name>
<error_code>-131</error_code>
</file_xfer_error>


I've got 3 of those too. Every host on every one of them reports the same error. One has already reached "too many errors" status and the others will soon.

If I had time, I'd remote into my machine and abort any more of that series that it may have.

Specifically, abort any WUs that have a 411KB datapak size, instead of the normal 367KB (as discussed earlier in this thread).

That's worth doing, because the error happens at the end of processing, so a lot of time and power is wasted.

It appears I caught one of these overnight, WU #1375646975. It came and went while I was sleeping, so no chance to do anything about it. It also looks like it'll be sent out to a couple more unsuspecting hosts before it finally dies.

Just found another one lurking on a different machine, WU #1375888293. It only arrived about an hour ago, so I aborted it and it looks like it won't be resent, although it's still "In progress" on one unlucky host.
ID: 1453945 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1453983 - Posted: 14 Dec 2013, 20:41:58 UTC - in response to Message 1453945.  
Last modified: 14 Dec 2013, 20:42:39 UTC

More problems brewing?
Aother stuck tape? Result creation rate has dropped off, ready-to-send buffer is OK, but showing signs of being affected.
Also the AP Awaiting Validation & assimilator queues are backing up.
Also noticed a lot more database queries per seconbd than usual over the last few feeks. Usually it's less than 1,000 with a couple of spkies over 1,000 each day. Last week it was around 1,500 as a minimum for most of the week. Even now it's running around 1,000 with spikes up to 3,000.
Grant
Darwin NT
ID: 1453983 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1453987 - Posted: 14 Dec 2013, 20:55:06 UTC
Last modified: 14 Dec 2013, 20:57:57 UTC

What is: <error_code>-131</error_code>

I get few WU crunched and at the end exit with this error.
ID: 1453987 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1453989 - Posted: 14 Dec 2013, 21:17:08 UTC - in response to Message 1453987.  

What is: <error_code>-131</error_code>

With the crash of the sever there were a batch of WUs that are around 416kB in size instead of the usual 367kB (or so) in size. That's the error that results when they are processed & you try to report them.

Just had a look in my data directory & noticed a bunch that are 22kB in size.

Grant
Darwin NT
ID: 1453989 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1453990 - Posted: 14 Dec 2013, 21:20:05 UTC - in response to Message 1453989.  
Last modified: 14 Dec 2013, 21:20:21 UTC

What is: <error_code>-131</error_code>

With the crash of the sever there were a batch of WUs that are around 416kB in size instead of the usual 367kB (or so) in size. That's the error that results when they are processed & you try to report them.

Just had a look in my data directory & noticed a bunch that are 22kB in size.

Thanks for the info.

Anything to worry about beside the loose of processing time? If they are with the wrong size and will end on a error i imagine is easy to clear them at the server side or not?
ID: 1453990 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14650
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1453992 - Posted: 14 Dec 2013, 21:23:40 UTC - in response to Message 1453989.  

What is: <error_code>-131</error_code>

With the crash of the sever there were a batch of WUs that are around 416kB in size instead of the usual 367kB (or so) in size. That's the error that results when they are processed & you try to report them.

Just had a look in my data directory & noticed a bunch that are 22kB in size.

The small ones are normally (but check) ones which end with a double underscore: _n_0

Those are the results you're crunching ready to send back to the server. Don't delete them, whatever you do!
ID: 1453992 · Report as offensive
Profile Bill G Special Project $75 donor
Avatar

Send message
Joined: 1 Jun 01
Posts: 1282
Credit: 187,688,550
RAC: 182
United States
Message 1453993 - Posted: 14 Dec 2013, 21:28:56 UTC - in response to Message 1453989.  
Last modified: 14 Dec 2013, 21:30:34 UTC


Just had a look in my data directory & noticed a bunch that are 22kB in size.

Take a look at those small files again Grant, the appear to be additional files attached to a vlar with a vlar_1_0 suffix on them. They have the same name as the vlar.
I have no idea what that means however?

edit::I see you answered my question Richard. Thanks for that info.

SETI@home classic workunits 4,019
SETI@home classic CPU time 34,348 hours
ID: 1453993 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1454032 - Posted: 14 Dec 2013, 22:58:07 UTC - in response to Message 1453983.  

Aother stuck tape?

Actually, I think 20jn12ac might finally be unstuck, after all these months. I just noticed a task from that file just arrived on one of my machines.
ID: 1454032 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13736
Credit: 208,696,464
RAC: 304
Australia
Message 1454053 - Posted: 15 Dec 2013, 0:45:05 UTC - in response to Message 1453993.  

edit::I see you answered my question Richard. Thanks for that info.

Yep, thanks Richard.
Grant
Darwin NT
ID: 1454053 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34754
Credit: 261,360,520
RAC: 489
Australia
Message 1454054 - Posted: 15 Dec 2013, 0:59:59 UTC - in response to Message 1454032.  

Aother stuck tape?

Actually, I think 20jn12ac might finally be unstuck, after all these months. I just noticed a task from that file just arrived on one of my machines.

It'll be interesting to see if 20jn12ac actually finishes this time without causing the bad effect like it has many times before.

Cheers.
ID: 1454054 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1454055 - Posted: 15 Dec 2013, 1:04:54 UTC

Pretty impressive shape of the throughput on the cricket graph. 560mbit for..approaching 30 continuous hours now.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1454055 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1454085 - Posted: 15 Dec 2013, 5:05:01 UTC - in response to Message 1454054.  

Aother stuck tape?

Actually, I think 20jn12ac might finally be unstuck, after all these months. I just noticed a task from that file just arrived on one of my machines.

It'll be interesting to see if 20jn12ac actually finishes this time without causing the bad effect like it has many times before.

Cheers.

It certainly does look like its stuck again, on the last channel. Perhaps they need to shoot it and put it out of it's misery. However, the tasks I did receive from that file seem to process okay. I just finished 3 on one machine, WU #s 1378744682, 1378744651, and 1378744673, without any problems, so I guess the data's okay, if the splitter can just dig it out. I've also got 2 in the queue on another machine, but they probably won't run until tomorrow.
ID: 1454085 · Report as offensive
Profile Jaudy
Avatar

Send message
Joined: 13 Jan 10
Posts: 128
Credit: 1,651,057
RAC: 0
Canada
Message 1454086 - Posted: 15 Dec 2013, 5:13:26 UTC
Last modified: 15 Dec 2013, 5:16:28 UTC

Hi !

Four days ago. my modem stopped to function. I called the internet server (Bell Canada) and they sent me a new one. Since I restart the connexion with Seti, it seems that I can not send you the results of the works. Plus, Seti give me a new user number. I spoke with Bell Canada (Server service) and my internet connexion is OK. But having completed 10 works for you and nothing has been transferred for validation. The forums access are functioning properly.

Please, I need suggestions...

Thank you.
ID: 1454086 · Report as offensive
Profile Tim
Volunteer tester
Avatar

Send message
Joined: 19 May 99
Posts: 211
Credit: 278,575,259
RAC: 0
Greece
Message 1454091 - Posted: 15 Dec 2013, 6:29:28 UTC - in response to Message 1454086.  

Hi !

Four days ago. my modem stopped to function. I called the internet server (Bell Canada) and they sent me a new one. Since I restart the connexion with Seti, it seems that I can not send you the results of the works. Plus, Seti give me a new user number. I spoke with Bell Canada (Server service) and my internet connexion is OK. But having completed 10 works for you and nothing has been transferred for validation. The forums access are functioning properly.

Please, I need suggestions...

Thank you.


Disable the internal firewall of the modem to see what happen.

Tim
ID: 1454091 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1454262 - Posted: 15 Dec 2013, 20:04:10 UTC - in response to Message 1454086.  
Last modified: 15 Dec 2013, 20:06:59 UTC

If you have a new modem this will not influence BOINC.

At least a reboot of the PC and the new modem should work with the PC (maybe - after successful detection/installation of driver).

If you open the BOINC Manager/tab 'Transfers', there are deferred results ready for upload?
Select the results and press 'retry' or something similar (I use an other Manager).

If this won't resolve your problem, please write a message again.

* Best regards! :-) * Philip J. Fry, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *
ID: 1454262 · Report as offensive
Previous · 1 . . . 18 · 19 · 20 · 21 · 22 · 23 · Next

Message boards : Number crunching : Panic Mode On (85) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.