Stoppage of crunching with XP and 4.05

Message boards : Number crunching : Stoppage of crunching with XP and 4.05
Message board moderation

To post messages, you must log in.

AuthorMessage
Pascal, K G
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2343
Credit: 150,491
RAC: 0
United States
Message 21460 - Posted: 2 Sep 2004, 15:25:50 UTC
Last modified: 2 Sep 2004, 15:30:16 UTC

JUst had BOINC stop crunching, as it preempted CPDN to start on Seti WUWU.. Seti WUWU showed that it was running under the status tab but the time to completion never moved until I shut BOINC down and restarted. All is well now, this is the 1st time I have had any problems.

P4 3.0
1gb ram
DSL
XP sp1
Ver 4.05

On both my boxes I only get 1 or 2 WUWUS and never have more than 3 on hand... I have cache set to 7 days. Must be the way it works now .......




M7 Seti@h Berkeley's Staff Friends Club ©
ID: 21460 · Report as offensive
Tony Martin

Send message
Joined: 5 Dec 99
Posts: 91
Credit: 69,723
RAC: 0
United States
Message 21479 - Posted: 2 Sep 2004, 16:17:45 UTC

Read the posts at this thread http://setiweb.ssl.berkeley.edu/forum_thread.php?id=3303

and let me know if its the same problem as mine.

Thanks
Tony
ID: 21479 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 21496 - Posted: 2 Sep 2004, 17:15:27 UTC - in response to Message 21468.  

I think I had a similar problem. WinXP, SP2, Boinc 4.05, Seti 4.03. When I checked my notebook computer this morning, I noticed the WU seemed to be stuck at 75.10%. Anyone that has a Dell 600M knows how warm the lower left side of the computer gets when it's crunching full blast, and it was cool. I checked the task manager and the system idle process was running at 95%; Seti 4.03 was at zero.

Exiting Boinc and restarting work - the WU started crunching normally.
Power options are set to always on. Just the screen saver blanks at 10 minutes. Running on A/C.
ID: 21496 · Report as offensive
Pascal, K G
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2343
Credit: 150,491
RAC: 0
United States
Message 21499 - Posted: 2 Sep 2004, 17:17:12 UTC - in response to Message 21496.  

> I think I had a similar problem. WinXP, SP2, Boinc 4.05, Seti 4.03. When I
> checked my notebook computer this morning, I noticed the WU seemed to be stuck
> at 75.10%. Anyone that has a Dell 600M knows how warm the lower left side of
> the computer gets when it's crunching full blast, and it was cool. I checked
> the task manager and the system idle process was running at 95%; Seti 4.03 was
> at zero.
>
> Exiting Boinc and restarting work - the WU started crunching normally.
> Power options are set to always on. Just the screen saver blanks at 10
> minutes. Running on A/C.
> Seti@h Berkeley's Staff Friends Club ©[/b]
ID: 21499 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 21574 - Posted: 2 Sep 2004, 21:33:16 UTC

I checked my stderr.txt file and found the last error was on 8/29/04. That kind of surprised me. I think that's the date I updated to 4.05, but I'm not sure.

Here's the error log:
2004-08-29 19:34:31 [SETI@home] Deferring communication with project for 18 minutes and 28 seconds
2004-08-29 19:34:53 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:34:53 [SETI@home] No schedulers responded
2004-08-29 19:34:53 [SETI@home] Deferring communication with project for 1 hours, 43 minutes, and 39 seconds
2004-08-29 19:34:56 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:34:56 [SETI@home] No schedulers responded
2004-08-29 19:34:56 [SETI@home] Deferring communication with project for 14 minutes and 58 seconds
2004-08-29 19:35:09 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:35:09 [SETI@home] No schedulers responded
2004-08-29 19:35:09 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2004-08-29 19:36:15 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:36:15 [SETI@home] No schedulers responded
2004-08-29 19:36:15 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2004-08-29 19:37:17 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:37:17 [SETI@home] No schedulers responded
2004-08-29 19:37:17 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2004-08-29 19:38:19 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:38:19 [SETI@home] No schedulers responded
2004-08-29 19:38:19 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2004-08-29 19:39:29 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2004-08-29 19:39:29 [SETI@home] No schedulers responded
2004-08-29 19:39:29 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2004-08-29 19:41:02 [SETI@home] No work from project
2004-08-29 19:41:02 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
2004-08-29 19:44:38 [SETI@home] No work from project
2004-08-29 19:44:38 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
2004-08-29 19:47:18 [SETI@home] No work from project
2004-08-29 19:47:18 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
2004-08-29 19:47:35 [SETI@home] No work from project
2004-08-29 19:47:35 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
2004-08-29 19:47:50 [SETI@home] No work from project
2004-08-29 19:47:50 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
2004-08-29 19:51:59 [SETI@home] No work from project
2004-08-29 19:51:59 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
2004-08-29 19:56:16 [SETI@home] No work from project
2004-08-29 19:56:16 [SETI@home] Deferring communication with project for 10 minutes and 0 seconds
ID: 21574 · Report as offensive
LitchKiraly

Send message
Joined: 27 Feb 03
Posts: 1
Credit: 8,818
RAC: 0
United States
Message 21755 - Posted: 3 Sep 2004, 5:56:44 UTC
Last modified: 3 Sep 2004, 6:07:13 UTC

I too have seen this happen...

given the text from the last poster could it be that when the scheduler goes down the Preemptive scheduler component failing in its attempt to communicate with the scheduler causes the w/u's to stop processing...


The reason I think this may be an issue is because I believe that the scheduler looks to your profile for how much work you want to keep on hand and compares it to the amount you have left on your machine to process.. since it cannot do that in the event of a downed scheduler the Gui Hangs and does not continue to process work since it has been unable to refer back to the CGI?



ID: 21755 · Report as offensive
Tony Martin

Send message
Joined: 5 Dec 99
Posts: 91
Credit: 69,723
RAC: 0
United States
Message 21776 - Posted: 3 Sep 2004, 7:13:57 UTC - in response to Message 21479.  
Last modified: 3 Sep 2004, 19:45:41 UTC

> Read the posts at this thread
> http://setiweb.ssl.berkeley.edu/forum_thread.php?id=3303
>
> and let me know if its the same problem as mine.
>
> Thanks
> Tony
>
>
My problem was caused by an old Win95 game. I've now set XP to run it in compataibility mode for Win95 and it seems to have fixed my problem. You can see my posts in the thread location above. Hope you find out what is causing your problems. Sorry I wasn't any help.

Tony

9-3-2004
Correction it didn't fix my problem. As posted in the above thread.
ID: 21776 · Report as offensive
Profile Bakareth
Avatar

Send message
Joined: 31 Aug 01
Posts: 44
Credit: 7,619,743
RAC: 0
United Kingdom
Message 21799 - Posted: 3 Sep 2004, 9:56:33 UTC - in response to Message 21776.  

Hi,
Same thing happened to me twice in the first 24h after I installed 4.05 and both times required a restart to solve the problem but oddly enough it hasn't happened since (I've cursed it now though!) without me changing any settings.. Running XP with SP-1 by the way..

Robert
S@h Berkeley's Staff Friends Club © member
ID: 21799 · Report as offensive
Profile M4rtyn
Volunteer tester
Avatar

Send message
Joined: 4 Aug 03
Posts: 48
Credit: 799,965
RAC: 0
United Kingdom
Message 21800 - Posted: 3 Sep 2004, 9:57:07 UTC - in response to Message 21755.  

> I too have seen this happen...
>
> given the text from the last poster could it be that when the scheduler goes
> down the Preemptive scheduler component failing in its attempt to communicate
> with the scheduler causes the w/u's to stop processing...
>
>
> The reason I think this may be an issue is because I believe that the
> scheduler looks to your profile for how much work you want to keep on hand and
> compares it to the amount you have left on your machine to process.. since it
> cannot do that in the event of a downed scheduler the Gui Hangs and does not
> continue to process work since it has been unable to refer back to the CGI?
>
>

I get the same problem to, regardless of the scheduler status.


M4rtyn

ID: 21800 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 21817 - Posted: 3 Sep 2004, 11:01:15 UTC
Last modified: 3 Sep 2004, 11:35:12 UTC

Hi All,
On this one machine so far, my units just stopped crunching, I exited, no correction, I restarted and all WU's Waiting to be crunched turned into "Ready to report. I got a code 144 on the message screen but this is what my "Sched Request Doc (XML Doc) stated:
global_prefs_source_email_hash>688417073152552886ed7d25edbd3629
-
http://setiathome.berkeley.edu/
100.000000
-
0.960481
0.960481
0.929826
-
345.706258
7347.547551
-
-18000
SIGNALSEARCHER
127.0.0.1
<p>1</p>
<p>Intel(R) Celeron(R) CPU 2.80GHz</p>
<p>Pentium</p>
<p>1474345549.738220</p>
<p>3338528048.199998</p>
<p>1000000000.000000</p>
<p>0</p>
<p>0</p>
<p>0</p>
<p>12738307504.656250</p>
Microsoft Windows XP
Home Edition, Service Pack 2, (05.01.2600.00)
527941632.000000
1000000.000000
1543565312.000000
75563532288.000000
48687480832.000000
04my04aa.2277.8080.715898.98_8
29415.109375
0
5
403
-
4.05
-
04my04aa.2277.8080.715898.98_8_0
12794.000000
65536.000000
11b63321703f7bc391f5c36ac16bec25
http://setiboincdata.ssl.berkeley.edu/sah_cgi/file_upload_handler
-
25ap04aa.23553.18256.484658.58_4
29614.171875
0
5
403
-
4.05
-
25ap04aa.23553.18256.484658.58_4_0
15821.000000
65536.000000
ed2f6c120592c63366e3522dddb578fa
http://setiboincdata.ssl.berkeley.edu/sah_cgi/file_upload_handler
-
04my04aa.25870.19664.878390.184_0
0.000000
0
5
403
-
Everything Good I suppose up to this point, now here is where the problem began:
4.05
SETI@Home Informational message -9 result_overflow NOTE: The number of results detected exceeds the storage space allocated.
-
04my04aa.25870.19664.878390.184_0_0
21314.000000
65536.000000
0cb691f603b1069e46c12fae6aea21b7
http://setiboincdata.ssl.berkeley.edu/sah_cgi/file_upload_handler
-
04my04aa.25870.19664.878390.175_2
0.000000
-185
5
403
-
4.05
Couldn't restart the app for this result: -144
7
0
-144
-
04my04aa.25870.19664.878390.176_0
0.000000
-185
5
403
-
4.05
Couldn't start the app for this result: error -144
7
0
-144
-
04my04aa.25870.19664.878390.157_2
0.000000
-185
5
403
-
4.05
Couldn't start the app for this result: error -144
7
0
-144
THIS HAPPENED FOR 56 MORE WU'S THE ONLY DIFFERENCE WAS THE WU NAME, THEN THE NEXT MESSAGE WAS:
-
4.05
file transfer error: couldn't get input files: 26ap04ab.12624.1298.642322.78: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04aa.14696.1840.84666.215_1
0.000000
-185
5
403
-
4.05
Couldn't start the app for this result: error -144
7
0
-144
-
05my04aa.14696.1840.84666.209_2
0.000000
-185
5
403
-
4.05
Couldn't start the app for this result: error -144
7
0
-144
-
05my04ab.18320.1857.1022148.93_0
0.000000
-185
5
403
-
4.05
Couldn't start the app for this result: error -144
7
0
-144
-
26ap04ab.12624.1298.642322.75_1
0.000000
-185
5
403
-
4.05
Couldn't start the app for this result: error -144
7
0
-144
-
05my04aa.14696.1840.84666.205_1
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04aa.14696.1840.84666.205: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04ab.18320.1857.1022148.89_0
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04ab.18320.1857.1022148.89: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04ab.18320.1857.1022148.81_1
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04ab.18320.1857.1022148.81: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04ab.18320.1857.1022148.83_1
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04ab.18320.1857.1022148.83: File downloaded was not the correct file or was garbage from bad URL
0
0
-
26ap04ab.12624.1298.642322.77_1
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 26ap04ab.12624.1298.642322.77: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04aa.14696.1840.84666.206_1
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04aa.14696.1840.84666.206: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04ab.18320.1857.1022148.77_2
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04ab.18320.1857.1022148.77: File downloaded was not the correct file or was garbage from bad URL
0
0
-
05my04ab.18320.1857.1022148.78_2
0.000000
0
5
403
-
4.05
file transfer error: couldn't get input files: 05my04ab.18320.1857.1022148.78: File downloaded was not the correct file or was garbage from bad URL
0
0
This is the same thing that happened when I thought it could be a conflict with McAfee. The difference is, I have Automatic updates disabled on McAfee now, and I had the Network communication disabled on Boinc. I was signed on to america Online, my computer had 3 connection interruptions during the night, and the only additional program that was running was "Eye of The Storm" Hurricane tracker. And it was uploading tracking information every 20 minutes.


Sorry for so long a post but this is the complete file. I hope someone can offer some help in resolving this issue.
Is WIN XP also involved in the 4.07 update or is this something new.

Warm Regards To All,

Rocky

ID: 21817 · Report as offensive
Profile Link
Volunteer tester

Send message
Joined: 20 May 99
Posts: 22
Credit: 1,192,239
RAC: 0
United States
Message 21870 - Posted: 3 Sep 2004, 15:18:27 UTC

Nobody has stated so I'll ask. Are these problems running with the GUI or CLI clients?

I'm running CLI on 37 seperate systems and have not seen this issue. I am not running any of the GUI clients though so have nothing to compare to.
ID: 21870 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 21871 - Posted: 3 Sep 2004, 15:23:44 UTC
Last modified: 3 Sep 2004, 15:33:06 UTC

Hi All,
I have notice that when Boinc stops acting properly, it is always on the computer that I am downloading other programs on, ie my wifes work files,
new programs, hurricane tracker updates, Mcafee updates (Covered in another thread).
If I enable Boinc's network settings while another program is downloading, when Boinc attempts to switch over to crunch a new WU, while you are in the process of downloading ANY OTHER FILE, Boinc stops crunching.

Results....Catastrophic

Any Additional thoughts? I do an awful lot of downloading, and this is apparently the only common denonmiator that exists.

Lets hope for a better today, and tomorrow ! !

Regards To All,

Rocky Cudd

Bye the way Link, I use CLI

ID: 21871 · Report as offensive
texasfit
Avatar

Send message
Joined: 11 May 03
Posts: 223
Credit: 500,626
RAC: 0
United States
Message 21873 - Posted: 3 Sep 2004, 15:36:15 UTC - in response to Message 21871.  
Last modified: 3 Sep 2004, 15:36:51 UTC

> Link,
>
> CLI
>
>
> Rocky
>
>
> > > > > > >

For what it's worth. I have two systems with WinXP running the GUI that have not had any problems running 4.05 BOINC. One has SP2 and the other is still running SP1. I have been running v4.05 from the first day that it was available. I am running just SETI at this time and not attached to any other projects, which may be causing some of the issues that others are seeing.

ID: 21873 · Report as offensive
Profile M4rtyn
Volunteer tester
Avatar

Send message
Joined: 4 Aug 03
Posts: 48
Credit: 799,965
RAC: 0
United Kingdom
Message 21877 - Posted: 3 Sep 2004, 15:50:51 UTC - in response to Message 21873.  
Last modified: 3 Sep 2004, 15:51:21 UTC

> > Link,
> >
> > CLI
> >
> >
> > Rocky
> >
> >
> > > > > > > >
>
> For what it's worth. I have two systems with WinXP running the GUI that have
> not had any problems running 4.05 BOINC. One has SP2 and the other is still
> running SP1. I have been running v4.05 from the first day that it was
> available. I am running just SETI at this time and not attached to any other
> projects, which may be causing some of the issues that others are seeing.
>
> <img> src="http://boinc.mundayweb.com/seti2/stats.php?userID=924&trans=off">
>

I also only run seti but still get the same problem,
I Have noticed it may be intermitant as some days it happens and others not
evan though I have not changed anything.


M4rtyn
ID: 21877 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 21880 - Posted: 3 Sep 2004, 16:04:44 UTC

Some added information: When my computer stopped crunching, I believe it was during the outage they had yesterday. It was also one of the few times I kept my network access active. Being on dial-up, I usually deactivate it most of the time.

I am using AVG anti-virus, so McAfee doesn't seem to be the cause (though it could be a general anti-virus software problem).

There's an interesting thread on this board:

http://setiweb.ssl.berkeley.edu/forum_thread.php?id=3656

Search for debt - It would be interesting to check that variable when one of us has BOINC Seti hang (before exiting and restarting).

ID: 21880 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 21932 - Posted: 3 Sep 2004, 18:43:13 UTC

Hi JAf,
Can you explain to me what i should look for in your statement:

>Search for debt - It would be interesting to check that variable when one of us has BOINC Seti hang (before exiting and restarting).

I will check whatever I can to help.

Regards,

Rocky

ID: 21932 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 21942 - Posted: 3 Sep 2004, 18:53:34 UTC - in response to Message 21932.  

> Hi JAf,
> Can you explain to me what i should look for in your statement:
>
> >Search for debt - It would be interesting to check that variable when one
> of us has BOINC Seti hang (before exiting and restarting).
>
> I will check whatever I can to help.
>
> Regards,
>
> Rocky
>
> <img> src="http://boinc.mundayweb.com/seti2/stats.php?userID=948&useCached=true">
>
ID: 21942 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 21947 - Posted: 3 Sep 2004, 19:02:27 UTC - in response to Message 21942.  

In your Boinc directory, there's a file called client_state.xml. Viewing the contents of the file you should see a statement toward the end of the block that looks like: 0.000000

I'm not sure that there's any significance to the freeze-up problem. I wasn't able to check my notebook earlier when I posted the above message. But now I see that debt is 0.0 and my computer is crunching away. My desktop was something like 28435.0 and it was crunching.

I suspect (this is just a guess and someone here can set me straight) the the debt value is zero when one has a full quota of Wu's.
ID: 21947 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 21966 - Posted: 3 Sep 2004, 19:41:20 UTC
Last modified: 3 Sep 2004, 19:59:11 UTC

Ok JAF,
I found what you are talking about, and my debt is at zero right now, but all my WU's are in a ready to report state and have not been crunched yet.
I would like to know what the "XML SIG" is. It shows as a 256 digit Alpha-numeric line located on the same page several times.

Regards,

Rocky
ID: 21966 · Report as offensive
Profile mlcudd
Volunteer tester
Avatar

Send message
Joined: 11 Apr 03
Posts: 782
Credit: 63,647
RAC: 0
United States
Message 21980 - Posted: 3 Sep 2004, 20:01:46 UTC

Hi All,
I just UnInstalled my SP2. Not knowing if this is any problem with "HangUps, but I will say that I am moving through Apps and pages at a much faster rate after the uninstall.
I know that it is a large program, but with the space I have it should not make any difference in performance at all.

If SP2 has an issue of "Slowing Down" processes, could this have adverse affects on how other programs run.

Warm Regrads,

Rocky
ID: 21980 · Report as offensive

Message boards : Number crunching : Stoppage of crunching with XP and 4.05


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.