What causes this ERROR and how to prevent?

Message boards : Number crunching : What causes this ERROR and how to prevent?
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · Next

AuthorMessage
Profile Fred J. Verster
Volunteer tester
Avatar

Send message
Joined: 21 Apr 04
Posts: 3252
Credit: 31,903,643
RAC: 0
Netherlands
Message 1006125 - Posted: 19 Jun 2010, 9:35:50 UTC - in response to Message 1005515.  

I removed the FLOPS enrty some time ago, after installing BOINC 6.10.56,
it wasn't necessary, afterall, runtimes were estimated quite correct after 2 or 3
MB (or AP) WU's, had run.
(Also no 'room' for errors, there)

ID: 1006125 · Report as offensive
Profile hiamps
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 1006244 - Posted: 19 Jun 2010, 16:04:51 UTC - in response to Message 1005296.  

...<message>
Maximum elapsed time exceeded
</message>


Please remove the flops estimates from your app info & restart Boinc. "someone" seems to be 'fiddling' with the server side fpop estimates ::S

I have never messed with my Flops estimates, and have been gone for a couple days do I need to look for the flop counter?
Official Abuser of Boinc Buttons...
And no good credit hound!
ID: 1006244 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1006257 - Posted: 19 Jun 2010, 16:23:40 UTC - in response to Message 1006244.  
Last modified: 19 Jun 2010, 16:36:51 UTC

...<message>
Maximum elapsed time exceeded
</message>


Please remove the flops estimates from your app info & restart Boinc. "someone" seems to be 'fiddling' with the server side fpop estimates ::S

I have never messed with my Flops estimates, and have been gone for a couple days do I need to look for the flop counter?


If you don't have any, don't worry about it ;) [Edit: ... Yet... still trying to work out why when CPu & cuda multibeam estimates dial in, and task duration correction factor approaches 1, why CPU Astropulse tasks with no flops entry is at least an order of magnitude out ... about 17 times too long an estimate with Core2 running optimised, and no flops entries in place.... hopefully it's some server side thing converging ... otherwise it's really quite broken somewhere.]

Jason
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1006257 · Report as offensive
S@NL - John van Gorsel
Volunteer tester
Avatar

Send message
Joined: 5 Jul 99
Posts: 193
Credit: 139,673,078
RAC: 0
Netherlands
Message 1006844 - Posted: 21 Jun 2010, 9:57:57 UTC - in response to Message 1005513.  


Removed Boinc and all it's directory's, including the work directory. Reinstalled everything, waiting now for work. Might be 3 hours before that happens.


Any results yet? I seem to have exactly the same problem on one of my pc's. Since June 19, ak_v8_win_ssse3x.exe seems crashes with every CPU unit at exactly 2 hr 6 min 11 sec (around 96% of completion).

For some reason, the failed units were reported and immediately dissapeared from the list. There are only 3 CPU units listed under errors for this host and two of these results appear to be failed GPU units. The Windows Event Log lists a total of 30 crashes of ak_v8_win_ssse3x.exe.

Fortunately (;-) there are upload problems so the newly crashed CPU tasks are not reported. I looked up the status in the client_state file and they failed with Exit code -177 and stderr_out for the failed units start with: Maximum elapsed time exceeded

This host in question has been running in this configuration for around 14 months without any problems.

Since this host is wasting a lot of tasks anyway I will start by removing the flops from the app_info file and see what happens...



Seti@Netherlands website
ID: 1006844 · Report as offensive
N1LEF

Send message
Joined: 29 Jul 10
Posts: 4
Credit: 8,118
RAC: 0
United States
Message 1021032 - Posted: 31 Jul 2010, 1:53:25 UTC

I thought I would add this to this post instead of making a new one. I am getting a runtime error but mine is not when starting the program but when trying to bring the graphics up for Seti, I get home.berkeley.edu\setigraphics_6.03_windows_intel86.exe any ideas ? I am new to this if you can not tell :-)
ID: 1021032 · Report as offensive
Josef W. Segur
Volunteer developer
Volunteer tester

Send message
Joined: 30 Oct 99
Posts: 4504
Credit: 1,414,761
RAC: 0
United States
Message 1021052 - Posted: 31 Jul 2010, 3:16:12 UTC - in response to Message 1021032.  

I thought I would add this to this post instead of making a new one. I am getting a runtime error but mine is not when starting the program but when trying to bring the graphics up for Seti, I get home.berkeley.edu\setigraphics_6.03_windows_intel86.exe any ideas ? I am new to this if you can not tell :-)

For a BOINC 6.x installation, the fully qualified name of the program which is launched to show SETI@home graphics ends with \projects\setiathome.berkeley.edu\setigraphics_6.03_windows_intelx86.exe indeed. I don't recall other users having a runtime error, can you give any more detail? There are many Windows XP SP3 users here who can probably help.
                                                                   Joe
ID: 1021052 · Report as offensive
Profile Neil Blaikie
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 143
Credit: 6,652,341
RAC: 0
Canada
Message 1021544 - Posted: 1 Aug 2010, 4:02:30 UTC

I would appreciate some help with this error as well

http://setiathome.berkeley.edu/result.php?resultid=1671699531.

All Cuda tasks are failing and yet nothing is showing as an error on my graphics card using any test software.

Thanks in advance
ID: 1021544 · Report as offensive
Profile Gundolf Jahn

Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 446,358
RAC: 0
Germany
Message 1021573 - Posted: 1 Aug 2010, 8:59:03 UTC - in response to Message 1021544.  

Does a reboot (power cycle) help? In case something is "stuck" in VRAM.

Gruß,
Gundolf
ID: 1021573 · Report as offensive
MarkJ Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 08
Posts: 1139
Credit: 80,854,192
RAC: 5
Australia
Message 1021615 - Posted: 1 Aug 2010, 12:22:04 UTC - in response to Message 1021544.  

I would appreciate some help with this error as well

http://setiathome.berkeley.edu/result.php?resultid=1671699531.

All Cuda tasks are failing and yet nothing is showing as an error on my graphics card using any test software.

Thanks in advance


The error messages suggest your drivers/dll files. I'd suggest you upgrade from the 191 driver to the current (258.96) drivers.

1. Set BOINC so it doesn't auto-start and shut it down.
2. Uninstall the old drivers via control panel, which will require a reboot.
3. When Win 7 comes up it will automatically reinstall basic driver functionality and want another reboot.
4. After that install the "proper" drivers (download from www.nvidia.com/drivers) and yet another reboot.

You should be right to start BOINC up after all that.
BOINC blog
ID: 1021615 · Report as offensive
Profile S@NL - eFMer - efmer.com/boinc
Volunteer tester
Avatar

Send message
Joined: 7 Jun 99
Posts: 512
Credit: 148,746,305
RAC: 0
United States
Message 1021623 - Posted: 1 Aug 2010, 12:49:38 UTC - in response to Message 1021615.  

use the
http://www.efmer.eu/forum_tt/index.php?topic=428.0
When you see the -177 errors
Check the -177 in the expert tab and reschedule.
This should eliminate the -177 errors.

The runtime estimate correction on the server side seems to be running with mixed results.
I've seen correction ratio for 0.3 to 40.
Some of these corrections are way off and will cause -177 errors.
This will also move the DCF around a bit, one WU has a correction of 1 and another of 40. This can move the DCF from 0.1 to 3 or something like that.
TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.
ID: 1021623 · Report as offensive
Profile Neil Blaikie
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 143
Credit: 6,652,341
RAC: 0
Canada
Message 1021646 - Posted: 1 Aug 2010, 14:26:42 UTC - in response to Message 1021615.  

Thank you for the help but it seems like there is still something wrong.

Still getting computation erros, any further help appreciated.

Might have to revert back to my old card and only use the GTS250 for gaming purposes ratehr than CUDA. Ah well
ID: 1021646 · Report as offensive
N1LEF

Send message
Joined: 29 Jul 10
Posts: 4
Credit: 8,118
RAC: 0
United States
Message 1021796 - Posted: 1 Aug 2010, 22:03:23 UTC

I fixed my problem. If anyone out there ends up with a runtime error having to do with the graphics and had put their own background image in the custom settings check the path. I took mine out and it is doing fine now.
ID: 1021796 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1021843 - Posted: 2 Aug 2010, 4:31:04 UTC

Here is an error code I havent seen. -529697949 (0xffffffffe06d7363)

Is there a master list some where of codes we can accsess?
[/quote]

Old James
ID: 1021843 · Report as offensive
Profile Miep
Volunteer moderator
Avatar

Send message
Joined: 23 Jul 99
Posts: 2412
Credit: 351,996
RAC: 0
Message 1022055 - Posted: 2 Aug 2010, 21:40:14 UTC - in response to Message 1021843.  

The boinc faq holds a partial list, but I couldn't find that one.
Carola
-------
I'm multilingual - I can misunderstand people in several languages!
ID: 1022055 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1022057 - Posted: 2 Aug 2010, 21:50:43 UTC - in response to Message 1021843.  
Last modified: 2 Aug 2010, 21:57:12 UTC

Here is an error code I havent seen. -529697949 (0xffffffffe06d7363)

Is there a master list some where of codes we can accsess?


You didn't posted an URL..
IIRC this is the BUG of Raistmers CUDA build. Sometimes the detection/output for to solve the -12 error fail and give this -529697949 'out of mem' error.

But, sometimes this 'out of mem' error happened also because of your GPU. Overheat, too much OC, faulty GPU/RAM.. or what ever.

So if you have an 'out of mem' error and your CUDA wingman have also -12 or -529697949 error, then nothing to worry about.


Maybe it would be well to add this error value to the boinc faq.
But I would guess this is already there (I don't have this URL in my mind ;-).
ID: 1022057 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 1022058 - Posted: 2 Aug 2010, 21:52:33 UTC - in response to Message 1022055.  

Not too weird, not all errors are in there. Most of these errors can be found in Google or the Microsoft knowledgebase though.

Taking for instance -529697949 (0xffffffffe06d7363), you take the number between the parenthesis, take all those 'f's off and you end up with the error you can look up in Google or the MSKB. In this case it's 0xE06D7363.

The errors starts with an E, which is an Exception. So it's a Microsoft C++ Exception code. Where it dropped is in the stderr.txt that got sent back to the project. It would be nice to have a link to the actual task, if you can find it. It's too bad that the replica database is still off line, which makes tracking this error quite difficult at this time.
ID: 1022058 · Report as offensive
Profile Jim_S
Avatar

Send message
Joined: 23 Feb 00
Posts: 4705
Credit: 64,560,357
RAC: 31
United States
Message 1022059 - Posted: 2 Aug 2010, 21:53:09 UTC - in response to Message 1021843.  

Here is an error code I havent seen. -529697949 (0xffffffffe06d7363)

Is there a master list some where of codes we can accsess?

I Googled 0xffffffffe06d7363 and found several responses...Too many to put here. You can try that and see if any of them fits your problem.

I Desire Peace and Justice, Jim Scott (Mod-Ret.)
ID: 1022059 · Report as offensive
Profile Miep
Volunteer moderator
Avatar

Send message
Joined: 23 Jul 99
Posts: 2412
Credit: 351,996
RAC: 0
Message 1022062 - Posted: 2 Aug 2010, 21:58:23 UTC - in response to Message 1022057.  

Maybe it would be well to add this error value to the boinc faq.
But I would guess this is already there (I don't have this URL in my mind ;-).


I usually pick it out of Ageless' signature. Poses the slightly smaller problem where I've seen him post recently :D
Carola
-------
I'm multilingual - I can misunderstand people in several languages!
ID: 1022062 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1022063 - Posted: 2 Aug 2010, 22:01:59 UTC - in response to Message 1022057.  

With a quick forum search here, I found my old Message 976963.

ID: 1022063 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1022065 - Posted: 2 Aug 2010, 22:07:11 UTC

Anyone like to google me "Exit status -108 (0xffffffffffffff94)" ?

Not one of the usual suspects, but plenty to choose from on host 1791152. Yes, that's the same one I confirmed "HTTP Internal Server Error" on a day or two ago (losing track of time). Also blowing all QuantumFIRE Alpha tasks similarly - Einstein is the only project running normally, though slowly.

Trouble is, I've got a long double cross-country drive tomorrow, and won't have a chance to solve it. All googlers greatly appreciated.

And the strange thing is, it's my daily driver - this post, all message boards, all email, come from it. I see no problems at all as I'm typing this. Only BOINC is affected. And the only thing I've done to BOINC is to remove Astropulse v5.00 from app_info......
ID: 1022065 · Report as offensive
Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : What causes this ERROR and how to prevent?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.