BOINC is NOT robust against BSOD / lockups

Message boards : Number crunching : BOINC is NOT robust against BSOD / lockups
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile YeshuaAgapao
Avatar

Send message
Joined: 23 Oct 00
Posts: 13
Credit: 5,627,678
RAC: 0
United States
Message 238675 - Posted: 28 Jan 2006, 7:14:20 UTC

X-Plane (8.32 this time) BSOD on the video driver (nv4disp), instead of locking up on the machine. Seems to be about twice a week. the BSOD took out most of BOINC's setings and SETI is back to the currupt banner.jpg thing and LHC's .exe is missing and both LHC and einstein did resets. I managed to get my LHC work copied. BOINC seems to leave the orphaned work in the directories. I found 3 orphaned CPDN workunits. Is there any way I can get these reconnected? and can the dev people make BOINC more robust against hard crashes by bad games and buggy video drivers?

My... LinkSite | Blog | Pictures
ID: 238675 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13961
Credit: 208,696,464
RAC: 304
Australia
Message 238676 - Posted: 28 Jan 2006, 7:16:46 UTC - in response to Message 238675.  

and can the dev people make BOINC more robust against hard crashes by bad games and buggy video drivers?

No.
Just don't use buggy software, problem solved.
Grant
Darwin NT
ID: 238676 · Report as offensive
Profile YeshuaAgapao
Avatar

Send message
Joined: 23 Oct 00
Posts: 13
Credit: 5,627,678
RAC: 0
United States
Message 238680 - Posted: 28 Jan 2006, 7:28:24 UTC

Is there any way to get orphaned workunits (CPDN and LHC) reattatched to BOINC?

My... LinkSite | Blog | Pictures
ID: 238680 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 238681 - Posted: 28 Jan 2006, 7:29:40 UTC

It would be nice to know which nVidia card you used...
ID: 238681 · Report as offensive
Profile YeshuaAgapao
Avatar

Send message
Joined: 23 Oct 00
Posts: 13
Credit: 5,627,678
RAC: 0
United States
Message 238682 - Posted: 28 Jan 2006, 7:33:57 UTC

Nvidian GeForce 6800 GO on a Alienware Area51m 7700 (computer name 'Inca' in my computer listings).

My... LinkSite | Blog | Pictures
ID: 238682 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 238684 - Posted: 28 Jan 2006, 7:44:55 UTC - in response to Message 238682.  
Last modified: 28 Jan 2006, 7:45:55 UTC

We don't see videocards. We don't see Computer names. We don't see IP addresses. We can only handle what information you give us.

So it would be nice to start from the beginning. So you have an nVidia Geforce 6800 with what drivers? Which Windows/Linux version? What DirectX version? Is it an AGP card or a PCI-E? have you updated your motherboard drivers lately? Your videocard drivers? Tried the ones from nVidia? Are your overclocking the GPU on your videocard? Tried it at stock speed?

Just little simple questions like that.
ID: 238684 · Report as offensive
Profile YeshuaAgapao
Avatar

Send message
Joined: 23 Oct 00
Posts: 13
Credit: 5,627,678
RAC: 0
United States
Message 238686 - Posted: 28 Jan 2006, 7:48:45 UTC
Last modified: 28 Jan 2006, 7:54:22 UTC

I checked the video driver version. It was old. I updated it. DirectX is 9.0c and its on PCI-E 16x. Maybe that will fix it.

You can see the computer name 'Inca' here -- http://setiathome.berkeley.edu/hosts_user.php?userid=51993 . It has been apparently duped four-fold from the last BSOD. Yes I know i can merge it but I don't feel like checking all 5 projects right now.

Can I fixed my orphaned LHC workunits? Theres about 5 days total work with 2 of completed. There are also 3 'lost' CPDN work units, 2 from this crash, 1 apparently from the last crash.

My... LinkSite | Blog | Pictures
ID: 238686 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 238688 - Posted: 28 Jan 2006, 8:03:37 UTC - in response to Message 238686.  
Last modified: 28 Jan 2006, 8:08:12 UTC

You can see the computer name 'Inca' here

No, we cannot see your computer names. Click on my name and figure out what my computer's name is called, please. You can't. I can see that info. No one else. And thus, no one here can see your computer names and info either. Only how much RAM it has, what CPU, what OS and the CPU benchmarks.

And no, you cannot "fix" your orphaned results. They are not longer recognized as being part of "your computer" since your computer has gotten a new hostID. Results are bound by the computer hostID they are downloaded on. No other hostID can upload them.

So you have one videocard running at PCI-E 16x? Why? Isn't that mostly useful when you have 2 to 4 of them running in SLI? I'm about sure you can turn it down to standard and test again, else refuse to run graphics or the screen saver for now. But have you updated your motherboard drivers lately?




ID: 238688 · Report as offensive
Profile YeshuaAgapao
Avatar

Send message
Joined: 23 Oct 00
Posts: 13
Credit: 5,627,678
RAC: 0
United States
Message 238689 - Posted: 28 Jan 2006, 8:08:15 UTC

Oh ok. Looks like you can only see your own account's computer names.

Its a laptop. Never seen a laptop with two video cards.

My... LinkSite | Blog | Pictures
ID: 238689 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 238690 - Posted: 28 Jan 2006, 8:13:43 UTC - in response to Message 238689.  

Its a laptop.

This is why we ask details about your setup. Or why you should give details about your setup to begin with. How are we to guess what kind of computer you have, desktop, tower or laptop?

Laptops overheat pretty easily. Especially when running 24/7 with CPU intensive tasks. Like the science applications running in BOINC. And more so even when you run graphics.

I had edited my post above already to "Have you tried updated motherboard drivers?", and I will add "have you looked for a new BIOS?" .. Plus the fact that you have to cool the laptop right. Off a flat surface, on top of some books, but so that the fan on the underside can suck in air. And make sure the outlets on the top and back aren't blocked!
ID: 238690 · Report as offensive
Profile YeshuaAgapao
Avatar

Send message
Joined: 23 Oct 00
Posts: 13
Credit: 5,627,678
RAC: 0
United States
Message 238695 - Posted: 28 Jan 2006, 8:28:58 UTC

Its on a desk but i have the front lifted up sitting on matchbox cars, or the video card overheats (computer shuts off). CPU fans are fine.

Looks like I cannot run the main X-plane game and the plane maker (or airfoil maker) simultaneously with the new driver version. The screen flickers and then the main x-plane crashes (app crash not BSOD or system crash). Both of the last two big crashes on my old video drivers that coruppted my BOINC were from me switching between x-plane and plane-maker.

My... LinkSite | Blog | Pictures
ID: 238695 · Report as offensive
Profile MJKelleher
Volunteer tester
Avatar

Send message
Joined: 1 Jul 99
Posts: 2048
Credit: 1,575,401
RAC: 0
United States
Message 238788 - Posted: 28 Jan 2006, 15:11:27 UTC

Because of the nature of what BOINC/SETI do and the nature of what games and other video-intensive software do, they sometimes don't get along well. Add in the potential of overheating a laptop, and you're more likely to see problems there, even if it's well set up.

One work around I've seen and suggested for gamers is, when you're using your game, go to the BOINC icon in the system tray, right click on it and Suspend. When you're done with the game, again right click and Run always or Run based on preferences, whichever is more appropriate to your setup. When you've got a game running, BOINC isn't going to be getting many CPU cycles anyway, so you're not losing much SETI processing, and you're eliminating the conflicts.

MJ

ID: 238788 · Report as offensive
Profile roguebfl
Volunteer tester
Avatar

Send message
Joined: 21 May 99
Posts: 129
Credit: 223,953
RAC: 0
New Zealand
Message 238892 - Posted: 28 Jan 2006, 18:24:26 UTC

Make sure the Laptop is properly venter, for most modles the means free flowing air below the laptop.
uninstall dyslexica.o : Permission denied


AMD Athlon 64 3000+ w/Windows
AMD Athlon 1800+ w/Linux
ID: 238892 · Report as offensive

Message boards : Number crunching : BOINC is NOT robust against BSOD / lockups


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.