What causes this ERROR and how to prevent?


log in

Advanced search

Message boards : Number crunching : What causes this ERROR and how to prevent?

Previous · 1 · 2 · 3 · Next
Author Message
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar
Send message
Joined: 24 Nov 06
Posts: 5051
Credit: 73,832,145
RAC: 12,306
Australia
Message 1005515 - Posted: 18 Jun 2010, 4:21:07 UTC

Alright, let us know how it goes from scratch then.... Surely it shouldn't behave weirdly from a blank slate :-X. ( If it does, there's always some poking around we can do, and find relevant info to report to Berkeley )

Jason
____________
"It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change."
Charles Darwin

Profile Fred J. Verster
Volunteer tester
Avatar
Send message
Joined: 21 Apr 04
Posts: 3250
Credit: 31,890,317
RAC: 4,151
Netherlands
Message 1006125 - Posted: 19 Jun 2010, 9:35:50 UTC - in response to Message 1005515.

I removed the FLOPS enrty some time ago, after installing BOINC 6.10.56,
it wasn't necessary, afterall, runtimes were estimated quite correct after 2 or 3
MB (or AP) WU's, had run.
(Also no 'room' for errors, there)

____________

Profile hiamps
Volunteer tester
Avatar
Send message
Joined: 23 May 99
Posts: 4292
Credit: 72,971,319
RAC: 0
United States
Message 1006244 - Posted: 19 Jun 2010, 16:04:51 UTC - in response to Message 1005296.

...<message>
Maximum elapsed time exceeded
</message>


Please remove the flops estimates from your app info & restart Boinc. "someone" seems to be 'fiddling' with the server side fpop estimates ::S

I have never messed with my Flops estimates, and have been gone for a couple days do I need to look for the flop counter?
____________
Official Abuser of Boinc Buttons...
And no good credit hound!

Profile jason_gee
Volunteer developer
Volunteer tester
Avatar
Send message
Joined: 24 Nov 06
Posts: 5051
Credit: 73,832,145
RAC: 12,306
Australia
Message 1006257 - Posted: 19 Jun 2010, 16:23:40 UTC - in response to Message 1006244.
Last modified: 19 Jun 2010, 16:36:51 UTC

...<message>
Maximum elapsed time exceeded
</message>


Please remove the flops estimates from your app info & restart Boinc. "someone" seems to be 'fiddling' with the server side fpop estimates ::S

I have never messed with my Flops estimates, and have been gone for a couple days do I need to look for the flop counter?


If you don't have any, don't worry about it ;) [Edit: ... Yet... still trying to work out why when CPu & cuda multibeam estimates dial in, and task duration correction factor approaches 1, why CPU Astropulse tasks with no flops entry is at least an order of magnitude out ... about 17 times too long an estimate with Core2 running optimised, and no flops entries in place.... hopefully it's some server side thing converging ... otherwise it's really quite broken somewhere.]

Jason
____________
"It is not the strongest of the species that survives, nor the most intelligent that survives. It is the one that is the most adaptable to change."
Charles Darwin

S@NL - John van GorselProject donor
Volunteer tester
Avatar
Send message
Joined: 5 Jul 99
Posts: 190
Credit: 137,592,747
RAC: 6,728
Netherlands
Message 1006844 - Posted: 21 Jun 2010, 9:57:57 UTC - in response to Message 1005513.


Removed Boinc and all it's directory's, including the work directory. Reinstalled everything, waiting now for work. Might be 3 hours before that happens.


Any results yet? I seem to have exactly the same problem on one of my pc's. Since June 19, ak_v8_win_ssse3x.exe seems crashes with every CPU unit at exactly 2 hr 6 min 11 sec (around 96% of completion).

For some reason, the failed units were reported and immediately dissapeared from the list. There are only 3 CPU units listed under errors for this host and two of these results appear to be failed GPU units. The Windows Event Log lists a total of 30 crashes of ak_v8_win_ssse3x.exe.

Fortunately (;-) there are upload problems so the newly crashed CPU tasks are not reported. I looked up the status in the client_state file and they failed with Exit code -177 and stderr_out for the failed units start with: Maximum elapsed time exceeded

This host in question has been running in this configuration for around 14 months without any problems.

Since this host is wasting a lot of tasks anyway I will start by removing the flops from the app_info file and see what happens...

____________


Seti@Netherlands website

N1LEF
Send message
Joined: 29 Jul 10
Posts: 4
Credit: 8,118
RAC: 0
United States
Message 1021032 - Posted: 31 Jul 2010, 1:53:25 UTC

I thought I would add this to this post instead of making a new one. I am getting a runtime error but mine is not when starting the program but when trying to bring the graphics up for Seti, I get home.berkeley.edu\setigraphics_6.03_windows_intel86.exe any ideas ? I am new to this if you can not tell :-)

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4301
Credit: 1,070,204
RAC: 1,104
United States
Message 1021052 - Posted: 31 Jul 2010, 3:16:12 UTC - in response to Message 1021032.

I thought I would add this to this post instead of making a new one. I am getting a runtime error but mine is not when starting the program but when trying to bring the graphics up for Seti, I get home.berkeley.edu\setigraphics_6.03_windows_intel86.exe any ideas ? I am new to this if you can not tell :-)

For a BOINC 6.x installation, the fully qualified name of the program which is launched to show SETI@home graphics ends with \projects\setiathome.berkeley.edu\setigraphics_6.03_windows_intelx86.exe indeed. I don't recall other users having a runtime error, can you give any more detail? There are many Windows XP SP3 users here who can probably help.
Joe

Profile Neil Blaikie
Volunteer tester
Avatar
Send message
Joined: 17 May 99
Posts: 142
Credit: 6,587,948
RAC: 8,235
Canada
Message 1021544 - Posted: 1 Aug 2010, 4:02:30 UTC

I would appreciate some help with this error as well

http://setiathome.berkeley.edu/result.php?resultid=1671699531.

All Cuda tasks are failing and yet nothing is showing as an error on my graphics card using any test software.

Thanks in advance
____________

Profile Gundolf Jahn
Send message
Joined: 19 Sep 00
Posts: 3184
Credit: 359,239
RAC: 27
Germany
Message 1021573 - Posted: 1 Aug 2010, 8:59:03 UTC - in response to Message 1021544.

Does a reboot (power cycle) help? In case something is "stuck" in VRAM.

Gruß,
Gundolf

Profile MarkJProject donor
Volunteer tester
Avatar
Send message
Joined: 17 Feb 08
Posts: 941
Credit: 25,100,093
RAC: 29,235
Australia
Message 1021615 - Posted: 1 Aug 2010, 12:22:04 UTC - in response to Message 1021544.

I would appreciate some help with this error as well

http://setiathome.berkeley.edu/result.php?resultid=1671699531.

All Cuda tasks are failing and yet nothing is showing as an error on my graphics card using any test software.

Thanks in advance


The error messages suggest your drivers/dll files. I'd suggest you upgrade from the 191 driver to the current (258.96) drivers.

1. Set BOINC so it doesn't auto-start and shut it down.
2. Uninstall the old drivers via control panel, which will require a reboot.
3. When Win 7 comes up it will automatically reinstall basic driver functionality and want another reboot.
4. After that install the "proper" drivers (download from www.nvidia.com/drivers) and yet another reboot.

You should be right to start BOINC up after all that.
____________
BOINC blog

Profile S@NL - eFMer - efmer.com/boincProject donor
Volunteer tester
Avatar
Send message
Joined: 7 Jun 99
Posts: 512
Credit: 130,027,012
RAC: 34,977
United States
Message 1021623 - Posted: 1 Aug 2010, 12:49:38 UTC - in response to Message 1021615.

use the
http://www.efmer.eu/forum_tt/index.php?topic=428.0
When you see the -177 errors
Check the -177 in the expert tab and reschedule.
This should eliminate the -177 errors.

The runtime estimate correction on the server side seems to be running with mixed results.
I've seen correction ratio for 0.3 to 40.
Some of these corrections are way off and will cause -177 errors.
This will also move the DCF around a bit, one WU has a correction of 1 and another of 40. This can move the DCF from 0.1 to 3 or something like that.
____________
TThrottle Control your temperatures. BoincTasks The best way to view BOINC. Anza Borrego Desert hiking.

Profile Neil Blaikie
Volunteer tester
Avatar
Send message
Joined: 17 May 99
Posts: 142
Credit: 6,587,948
RAC: 8,235
Canada
Message 1021646 - Posted: 1 Aug 2010, 14:26:42 UTC - in response to Message 1021615.

Thank you for the help but it seems like there is still something wrong.

Still getting computation erros, any further help appreciated.

Might have to revert back to my old card and only use the GTS250 for gaming purposes ratehr than CUDA. Ah well
____________

N1LEF
Send message
Joined: 29 Jul 10
Posts: 4
Credit: 8,118
RAC: 0
United States
Message 1021796 - Posted: 1 Aug 2010, 22:03:23 UTC

I fixed my problem. If anyone out there ends up with a runtime error having to do with the graphics and had put their own background image in the custom settings check the path. I took mine out and it is doing fine now.

Profile James Sotherden
Avatar
Send message
Joined: 16 May 99
Posts: 8907
Credit: 35,870,732
RAC: 43,346
United States
Message 1021843 - Posted: 2 Aug 2010, 4:31:04 UTC

Here is an error code I havent seen. -529697949 (0xffffffffe06d7363)

Is there a master list some where of codes we can accsess?
____________

Old James

Profile MiepProject donor
Volunteer moderator
Avatar
Send message
Joined: 23 Jul 99
Posts: 2411
Credit: 351,996
RAC: 0
Message 1022055 - Posted: 2 Aug 2010, 21:40:14 UTC - in response to Message 1021843.

The boinc faq holds a partial list, but I couldn't find that one.
____________
Carola
-------
I'm multilingual - I can misunderstand people in several languages!

Profile [seti.international] Dirk SadowskiProject donor
Volunteer tester
Avatar
Send message
Joined: 6 Apr 07
Posts: 7101
Credit: 60,895,985
RAC: 17,203
Germany
Message 1022057 - Posted: 2 Aug 2010, 21:50:43 UTC - in response to Message 1021843.
Last modified: 2 Aug 2010, 21:57:12 UTC

Here is an error code I havent seen. -529697949 (0xffffffffe06d7363)

Is there a master list some where of codes we can accsess?


You didn't posted an URL..
IIRC this is the BUG of Raistmers CUDA build. Sometimes the detection/output for to solve the -12 error fail and give this -529697949 'out of mem' error.

But, sometimes this 'out of mem' error happened also because of your GPU. Overheat, too much OC, faulty GPU/RAM.. or what ever.

So if you have an 'out of mem' error and your CUDA wingman have also -12 or -529697949 error, then nothing to worry about.


Maybe it would be well to add this error value to the boinc faq.
But I would guess this is already there (I don't have this URL in my mind ;-).
____________
BR

SETI@home Needs your Help ... $10 & U get a Star!

Team seti.international

Das Deutsche Cafe. The German Cafe.

Profile Ageless
Avatar
Send message
Joined: 9 Jun 99
Posts: 12324
Credit: 2,629,532
RAC: 1,085
Netherlands
Message 1022058 - Posted: 2 Aug 2010, 21:52:33 UTC - in response to Message 1022055.

Not too weird, not all errors are in there. Most of these errors can be found in Google or the Microsoft knowledgebase though.

Taking for instance -529697949 (0xffffffffe06d7363), you take the number between the parenthesis, take all those 'f's off and you end up with the error you can look up in Google or the MSKB. In this case it's 0xE06D7363.

The errors starts with an E, which is an Exception. So it's a Microsoft C++ Exception code. Where it dropped is in the stderr.txt that got sent back to the project. It would be nice to have a link to the actual task, if you can find it. It's too bad that the replica database is still off line, which makes tracking this error quite difficult at this time.
____________
Jord

Fighting for the correct use of the apostrophe, together with Weird Al Yankovic

Profile Jim_SProject donor
Avatar
Send message
Joined: 23 Feb 00
Posts: 4526
Credit: 18,813,286
RAC: 8,739
United States
Message 1022059 - Posted: 2 Aug 2010, 21:53:09 UTC - in response to Message 1021843.

Here is an error code I havent seen. -529697949 (0xffffffffe06d7363)

Is there a master list some where of codes we can accsess?

I Googled 0xffffffffe06d7363 and found several responses...Too many to put here. You can try that and see if any of them fits your problem.
____________

I Desire Peace and Justice, Jim Scott (Mod-Ret.)

Profile MiepProject donor
Volunteer moderator
Avatar
Send message
Joined: 23 Jul 99
Posts: 2411
Credit: 351,996
RAC: 0
Message 1022062 - Posted: 2 Aug 2010, 21:58:23 UTC - in response to Message 1022057.

Maybe it would be well to add this error value to the boinc faq.
But I would guess this is already there (I don't have this URL in my mind ;-).


I usually pick it out of Ageless' signature. Poses the slightly smaller problem where I've seen him post recently :D
____________
Carola
-------
I'm multilingual - I can misunderstand people in several languages!

Profile [seti.international] Dirk SadowskiProject donor
Volunteer tester
Avatar
Send message
Joined: 6 Apr 07
Posts: 7101
Credit: 60,895,985
RAC: 17,203
Germany
Message 1022063 - Posted: 2 Aug 2010, 22:01:59 UTC - in response to Message 1022057.

With a quick forum search here, I found my old Message 976963.

____________
BR

SETI@home Needs your Help ... $10 & U get a Star!

Team seti.international

Das Deutsche Cafe. The German Cafe.

Previous · 1 · 2 · 3 · Next

Message boards : Number crunching : What causes this ERROR and how to prevent?

Copyright © 2014 University of California