194 (0xc2) EXIT_ABORTED_BY_CLIENT - "finish file present too long"

Message boards : Number crunching : 194 (0xc2) EXIT_ABORTED_BY_CLIENT - "finish file present too long"
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3

AuthorMessage
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1494026 - Posted: 23 Mar 2014, 15:03:09 UTC - in response to Message 1494025.  
Last modified: 23 Mar 2014, 15:10:37 UTC

Maybe it´s better to install Win 8.1 directly? Since it must have the most updated drivers or no?

Thanks for your tips and usual help.


If you do that, STILL make sure it puts proper Intel(R) chipset drivers ;)

[Edit:] Oh, SATA/RAID controller driver too...

on top of previous example from my mobo:
[Example Only] Here's the ones I had to force update here ( all Intel(R) devices ) :
- G33/G31/P35/P31 Express Chipset PCI express root port
- G33/G31/P35/P31 Express Chipset Processor to I/O controller
- ICH9 Family PCI express root port, 1, 5 & 6 (3 devices)
- SMBus Controller
- LPC Interface Controller
- Lots of Intel(R) USB Universal Host COntrollers

"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1494026 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1494031 - Posted: 23 Mar 2014, 15:27:23 UTC

If it has the same instalation bug then forget the Win 8.1 for now.

There is no other easy way to do that? Some kind of automatic general intel driver update program? Remember if that is the problem, i will need to do that on several hosts... and not all uses the same MB.
ID: 1494031 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1494096 - Posted: 23 Mar 2014, 18:57:05 UTC - in response to Message 1494031.  
Last modified: 23 Mar 2014, 18:59:01 UTC

If it has the same instalation bug then forget the Win 8.1 for now.

There is no other easy way to do that? Some kind of automatic general intel driver update program? Remember if that is the problem, i will need to do that on several hosts... and not all uses the same MB.


The auto installer will tend to only add the inf, and even the web detection thing say older drivers there are 'valid'. The logic with that is possibly along the lines of if the system is booting/running already, then make no automatic assumptions that could make the situation worse. i.e. update those if you know you need them.

Another possibility is that with such chipsets being so ubiquitous in many small variations, they (Intel) made a quality control nightmare for themselves, and have to limit potential spread of accident or misadventure.

The general situation is that many hosts may never need these looked at, and for the bulk of chipsets that predate the OS release, the MS ones would be fine.

I'd argue that special/unusual circumstances exist in gaming/enthusiast and other circles, such as yours for crunching with multiple GPUs, that warrant the extra attention, and push resources closer to limits at every level.

Probably you could compare the situation to something like a factory car can do just fine in a cross country race with much of the same original componentry, but you will want to toss a lot of extra weight, tweak a bit, and reflash the computer in it with something better than a Generic Microsoft fuel curve from 10 years ago.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1494096 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1494167 - Posted: 23 Mar 2014, 21:20:56 UTC

Ok will do that manualy host by host, but sure not now to much alcohol in my system.
ID: 1494167 · Report as offensive
Profile Jeff Buck Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester

Send message
Joined: 11 Feb 00
Posts: 1441
Credit: 148,764,870
RAC: 0
United States
Message 1494659 - Posted: 24 Mar 2014, 18:30:15 UTC - in response to Message 1493999.  

I'm not sure whether that's the same problem or not. It looks as if the program reached the normal completion point, but couldn't close down properly for some reason. So, it started again, and crashed on the restart.

Still, lots of lovely debug information logged, so I'll save that for Jason - looks like the replacement wingmate is a very fast host, so it may disappear too soon.

Got one last evening similar to Juan's, on a stock cuda42 task, 3453302106, if you you want to grab the debugging info before it vanishes.

Based on the timing, I think I caused this one myself, by shutting down and restarting BOINC several times while testing a theory related to the "Zombie" AP tasks that started this thread in the first place. (I'll probably post more on that theory in a separate thread when I can.) It seems as if BOINC actually shuts down before the applications do, and this task must have managed to "finish" after BOINC was already terminated. It then restarted at 86.16% but with the "finish file" already present.
ID: 1494659 · Report as offensive
Profile jason_gee
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 24 Nov 06
Posts: 7489
Credit: 91,093,184
RAC: 0
Australia
Message 1494667 - Posted: 24 Mar 2014, 18:44:44 UTC - in response to Message 1494659.  
Last modified: 24 Mar 2014, 19:06:57 UTC

I'm not sure whether that's the same problem or not. It looks as if the program reached the normal completion point, but couldn't close down properly for some reason. So, it started again, and crashed on the restart.

Still, lots of lovely debug information logged, so I'll save that for Jason - looks like the replacement wingmate is a very fast host, so it may disappear too soon.

Got one last evening similar to Juan's, on a stock cuda42 task, 3453302106, if you you want to grab the debugging info before it vanishes.

Based on the timing, I think I caused this one myself, by shutting down and restarting BOINC several times while testing a theory related to the "Zombie" AP tasks that started this thread in the first place. (I'll probably post more on that theory in a separate thread when I can.) It seems as if BOINC actually shuts down before the applications do, and this task must have managed to "finish" after BOINC was already terminated. It then restarted at 86.16% but with the "finish file" already present.


Yeah, Boinc client likes to aggressively terminate the processes if it doesn't hear what it wants to hear in fixed time periods (i.e. That's correct termination if the client killed the app, aborting the task.).

Unfortunately that means rapidly starting/stopping can induce modes of failure totally outside of the scope of the application or boincapi embedded in it. This case is solely in the Boinc client's court (complete with some design issues).

That's especially likely when we're talking multiple tasks being asked to shut down while having been still ramping up. That's simply because the system will be under high contention (loading from disk etc), and possibly not yet even reached main code where it can listen for exit requests.

That's where Boinc has some issues with 'thread-safety', and fixed assumptions about time, that really don't apply except on 'Real-Time' operating systems like those used in car computers.
"Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions.
ID: 1494667 · Report as offensive
juan BFP Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 16 Mar 07
Posts: 9786
Credit: 572,710,851
RAC: 3,799
Panama
Message 1497277 - Posted: 29 Mar 2014, 22:08:54 UTC
Last modified: 29 Mar 2014, 22:09:13 UTC

Could be related or no, but this is not the first time one WU end on this error:

ERROR: both checkpoint files are damaged, aborting task

In this WU: http://setiathome.berkeley.edu/result.php?resultid=3461617408

Nothing apparently is wrong with the host, HDD allready checked, no error.
Before and after that the hosts crunch a lot more WU with no error.

Any clues?
ID: 1497277 · Report as offensive
Previous · 1 · 2 · 3

Message boards : Number crunching : 194 (0xc2) EXIT_ABORTED_BY_CLIENT - "finish file present too long"


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.