Posts by MajorTom

21) Message boards : Number crunching : Cuda memory leak and freezes and other issues // lunatics 0.44 (Message 1802397)
Posted 13 Jul 2016 by Profile MajorTom
Post:
Hi AMDave

I dont think it has something 2 do with the instances Ive running, the major issue is still the same, I cant leave the rig unattend.

If it would be only the bloating and lags then maybe but that Im not able 2 get a display signal when the screen goes blank after the screen saver usallay after about 10hrs oft this state.

And the cuda memory leaks or nv_driver_reset occurs in a total random pattern and when I give the gpus more cpu cycles the are left unused and the tasks are still bloated anyways.

On the old maindboard with the 580gtx I had not that issues with the same amounts of instances, cant imagine that with the 970 gfx its so different.

But yes the whole thing started about then when I swapped the gfx & display, ram and psu and since then nothing is normal, only problems.

And when I now put in the old ram and gfx, I have the same issues as with the new parts, thats the reason I have no clue what it might be.

When I check now the task mgr I have about 10% Idle cpu, I think it should suffice, because about 2 years ago it runned seti fine with 7cpu 3nvgpu 1igpu.

OFC I can try it with 6cpu 2(3)nv 1igpu and report if it makes any difference, I think I will even go back on stock to check again.
Im pretty shure its not the reason for the missbehaving but Ive allready tried almost everthing, then why not try that too.

The biggest problem is that wenn I leave the rig alone for more then about 8 hours then its locks up and cant get a display output on both of my displays, if the old display would wake up then case would be clear but here I really dont know what it could be.

Thats the reason Im writing here, Ive battleing this now so long and really nothing helps, before I sent in the new MB, I runned a while stock apps and the major issue remained the same, so again I cant imagine it has 2 do with the amount of isntaces Im running, as long they are reasonable.
22) Message boards : Number crunching : Cuda memory leak and freezes and other issues // lunatics 0.44 (Message 1802338)
Posted 13 Jul 2016 by Profile MajorTom
Post:
furmark burn-in test runs trough no issue, cpu stress test no issue
win7 mem check 2 pass extend, no errors
standalone memtest86+ extend check, no errors reported after 12hours
sfc reports no issues in the sys

cooling is no issue, cpu runs at 55degC @full load

the cuda memory leak Ive got with the 580gtx too and its crashing unattend the same way like the 970, the nv_driver resets happens on both cards, so think it has be something different.

The really odd thing is, when Im using my rig then its no problem, beside the lags. When I leave the sys a while running and the screen goes blank it doesnt wake up, or its freezed in the screensaver.
I should mention that Im using dual displays but honestly that shouldnt be the source of all this trouble.

All in in all its a really odd and strange thing, never ever had that in all the time Im dealing with pc's and seti and thats allrdy a very long time.

Ive going trough this with a friend of mine and he has no clue either, Ive rebuilded my whole rig since 9 months about 4 or 5 times, so he suggested if its not possible it has 2 do something with the seti apps, which I didnt belive but at least it would be a possibility, and the reason Im asking what do you think ;-)

Ive allready said as joke that this system is totally jinxed and I need an exorcist =D its that kind of annoying.
23) Message boards : Number crunching : Cuda memory leak and freezes and other issues // lunatics 0.44 (Message 1802337)
Posted 13 Jul 2016 by Profile MajorTom
Post:
I notice you're running Windows 7, and many people have been having issues with Windows Update chewing up lots of CPU time.

I'd suggest Process Explorer to see just what is running & using system resources.
My system has 2 GTX 750Tis & runs 3 WUs at a time on each of them with no system sluggishness using CUDA50.
However I don't have an iGPU, and even if I did I wouldn't use it for crunching- sharing memory, power & heat limits with the CPU results in significantly reduced work output compared to just running the CPU alone.


Yes Ive noticed that odd behaving on my fresh win7 install with the wupdate, service is hanging about 12%cpu load and the svchost, installed the kb fix but it still occurs time from time. If meet someone from MS I punch them straight in their face for doing this, IMHO they do that on purpose, urge ppl upgrade to win10.

But right now ive checked the taskmgr and no hanging svchost. But I will check it again when it start lagging that awfully again. But there is no solution that works for this issue thats public since a year or even longer.

Its a shame from MS I gonna never ever buy an OS from them again.
24) Message boards : Number crunching : Cuda memory leak and freezes and other issues // lunatics 0.44 (Message 1802335)
Posted 13 Jul 2016 by Profile MajorTom
Post:
thx for all the kind replies =)

hmmm... so my biggest concern might be true and it dont have 2 do with seti at all but then the its really difficult to say what it might be thats causing this.

here the specs not shown

its gtx970 from asus, the model strix which is OC from factory, thats the reason for the 1304mhz default clock. The Card was send in under warranty allready twice the last asus replaced with a complete new card.

No OC at all, done by the side of mine, nor cpu or ram, gpus . CPU 3.9ghz max default turbo and ram @1866mhz but the issue persist if I underclock the ram @1800 and 1600.. it makes no difference. Its a z97 chipset that max default ram is 1866, but in theory it shouldnt be a problem clocking the ram above that.

Like on the 580gtx that Ive used long time for crunching, I run this instances 7cpu 3nv_gpu 1igpu; beacause the all the issues persists I changed it 2 days ago 7cpu 2nv 1igpu, no differnce cuda50 still having a bloated behaving and the laggin form whole sys is eaven worse then with 3 instances, running only 6 cpu instances dont change the issue either.

from youre responds I get more clueless then befor but thx, so I know its really something strange going on that only affects my system, this was not the answer I hoped 2 hear but at least, it might possibly be something completly different, I have 2 look for.

My last possible guess.. it might be somthing with the power connection, inhouse.. I tried different wall sockets and no difference, but if there is more evidence that its might something 2 do in this direction, I have try it with an APC PDU / USV.

The odd thing is if it really has something 2 do with the power grid quality, it dont have affected me 3 or 4 years ago, it has started beeing really disturbing and nagging soon after wow competition '15

No other software is running on this sys, since this issues occurs Ive reinstalled the OS trice and keept it barebone since then, only the anti virus SW is running additional, no monitoring SW is running.

One thing might be worth trying would be switch of the ipgu crunching and looking what then happens, like Grant mention. But imho if that would be it, then why its only affecting my system, though shouldnt be a problem using the igpu and dgpu at the same time.

thx for all the suggestions
25) Message boards : Number crunching : Cuda memory leak and freezes and other issues // lunatics 0.44 (Message 1802309)
Posted 13 Jul 2016 by Profile MajorTom
Post:
Hello Folks

I have quick question, is anyone having the same issues like me?

Im having now since about soon nine months extreme issues with the whole seti apps, constalty freezes, nvidia driver resets, almost no performace on gpu task.

Ive changed every part in my rig since then to exclude faulty HW, but the errors remains absolute the same =( burn in test no problem at all.

So my conclusion is that the seti apps have really big issues in it and Im really in the mood to stop my contribution to this project, its sad but I dont have the nerves no more, its like a brutal kick in the nuts... to see how this project is going down the drain... its a shame... corporateUSA u ruining everything... stop doing stupid war and give sience&education the fundings they need.

Its a big shame =( to see how the corrupt soziophatic US corp. governement treats the world and and our computations dont even get an official analysis. No ntpkcr and that stupid billionare its taking away funds from seti. From my view the seti staff is barely able to keep the projevt alive, but its not their fault, they are enthusiast and doing their best possible with a small crew and small budget, like the most of us. Okai thats the gossip part...

back on topic

Running seti since new year '16 is frustrating as hell, almost everey time when I leave my rig for a day or 2 unattend then its 99% shure crashed, with no log entries at all. RAM, MB, CPU, PSU, gfx card, where changed serveral times, and I still have the same issues... as before... no change at all in the error description. But alot of down time, this year alone about 2 1/2 months. Got my sys back running since a few days and its the same problem as in january, constantly crashing.

When on my nv gpu a guppi is running, it almost locks up my system, the lags are that awfull, severall seconds, mostly about 5-15 secs, that Im not able to write this text without shutting down seti. Its really a pain in the "*&&!

Have anyone same experiences, issues like I have? I would be glad hear from you.

What recommendations? going back stock till lunatics.45 is released?

If someone need the specs of my rig, u will find it in my active hosts ;-)

I c Raistmer is pretty active doing something what might be the right direction, but Im not shure if its the same thing that has 2 do with the issues Im regarding.

But what I conclude there is pretty shure a bug in the igpu seti v7/8 app that occurs when a task is finished at the exit... there its proned 2 crash the whole sys with a permanent freeze with the need 2 reboot.

Kind regards
MajorTom
26) Message boards : Number crunching : new error message (Message 1005361)
Posted 17 Jun 2010 by Profile MajorTom
Post:
Ok I try out the update when my cache is dried out =) that might take day 1 or 2.
though make anyway a bakup of the whole boinc folder

I report then what happend ;-) hope the opt app is still working then, under installing boinc as service
27) Message boards : Cafe SETI : Seti usr profile changes, How? (Message 1005186)
Posted 17 Jun 2010 by Profile MajorTom
Post:
Is there a way to change the usr profile?
I only find a way to look at it or delete it, I would like just make some updates and delete some old outdated txt content.

Your help is welcome
thx in advantage


Lol Ive found it myself , geez why go that way over community/usr profile and not over account ;-P
From my point of view both would be good, then nobody would have to search
28) Message boards : Cafe SETI : Seti usr profile changes, How? (Message 1005184)
Posted 17 Jun 2010 by Profile MajorTom
Post:
Is there a way to change the usr profile?
I only find a way to look at it or delete it, I would like just make some updates and delete some old outdated txt content.

Your help is welcome
thx in advantage
29) Message boards : Number crunching : new error message (Message 1005182)
Posted 17 Jun 2010 by Profile MajorTom
Post:
Small update, got new work, but its seti enhanced, though that isnt optimzed too


I'checked it today, and it is crunching with AK_v8_win_x64_SSE41.exe so it looks perfect for me.

Has anyone updated the Boinc platform? Im still 6.2.19, any inconvience wiht the version 6.10.56 and optimzed apps?
30) Message boards : Number crunching : new error message (Message 1005049)
Posted 17 Jun 2010 by Profile MajorTom
Post:
Small update, got new work, but its seti enhanced, though that isnt optimzed too
31) Message boards : Number crunching : new error message (Message 1005043)
Posted 17 Jun 2010 by Profile MajorTom
Post:
Same to me, after that hint, I removed astropulse AP 5.00 in the app_info.xml too and the error msg dissapeared, but still dont get no new work for seti.

Anyway I've decided to try an other project since a few days, because seti has no work.
I tried rosetta@home(folding@home dont use boinc), I liked the medical aspect. The downside there they have no optimized binaries =( so a WU takes about 3 hours.

Does somebody know somethin cool with optimized files, and that dont make sience for astronomical research? I only know there the climate prediction. But there I dont see a big sense since BP spill out oil in the sea (maybe better then burn it in cars, sarcastic thought)

I would like crunch for health care and GPU grid I cant use, as I think with ATI GPU. something that would use the SSE4.1 ability of my CPU would be nice.

Any hints?

Kind regards, keep on crunching
32) Message boards : Number crunching : new error message (Message 1002468)
Posted 10 Jun 2010 by Profile MajorTom
Post:
Hi here Im again,

my machine act now pretty like yours, Bill.

Still get the error msg since Ive rebooted yesterday evening, but at least after reboot, my host is now able to download WU's again, the error msg remained.

It looks strange but at least its crunching again =) , I think that whole thing is serverside located and we cant really do something, modify app_info maybe would be a way but with a lager vers then "603" nothing helped in my case, and Ive only tried to add a newer app vers in it, based on trial and error ;-)

Happy Idle is now 0% again =) and Im still on SSE4.1

kind regards
MajorTom
33) Message boards : Number crunching : new error message (Message 1002116)
Posted 9 Jun 2010 by Profile MajorTom
Post:
Get the same error msg, I wonder why? seems have something 2 do with the updates during the weekly outage.

Tried to modify the app_info.xml but dont helped

</app_version>
<app_version>
<app_name>setiathome_enhanced</app_name>
<version_num>603</version_num>
<file_ref>
<file_name>AK_v8_win_x64_SSE41.exe</file_name>
<main_program/>
</file_ref>
</app_version>

09.06.2010 15:26:23||Starting BOINC client version 6.2.19 for windows_x86_64
09.06.2010 15:26:23||log flags: task, file_xfer, sched_ops
09.06.2010 15:26:23||Libraries: libcurl/7.18.0 OpenSSL/0.9.8g zlib/1.2.3
09.06.2010 15:26:23||Running as a daemon
09.06.2010 15:26:23||Data directory: C:\Documents and Settings\All Users\Application Data\BOINC
09.06.2010 15:26:23||Running under account boinc_master
09.06.2010 15:26:23|SETI@home|Found app_info.xml; using anonymous platform
09.06.2010 15:26:23||Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q9550 @ 2.83GHz [EM64T Family 6 Model 23 Stepping 10]
09.06.2010 15:26:23||Processor features: fpu tsc pae nx sse sse2
09.06.2010 15:26:23||OS: Microsoft Windows XP: Professional x64 Editon, Service Pack 2, (05.02.3790.00)
09.06.2010 15:26:23||Memory: 4.00 GB physical, 5.74 GB virtual

09.06.2010 15:26:23|SETI@home|URL: http://setiathome.berkeley.edu/; Computer ID: 4630864; location: home; project prefs: home
09.06.2010 15:26:23||General prefs: from SETI@home (last modified 19-Dec-2005 16:35:10)
09.06.2010 15:26:23||Computer location: home
09.06.2010 15:26:23||General prefs: no separate prefs for home; using your defaults

09.06.2010 15:26:56|SETI@home|Sending scheduler request: Requested by user. Requesting 639360 seconds of work, reporting 0 completed tasks
09.06.2010 15:27:01|SETI@home|Scheduler request succeeded: got 0 new tasks
09.06.2010 15:27:01|SETI@home|Message from server: Invalid app version description
09.06.2010 15:27:01|SETI@home|Message from server: Project has no jobs available



09.06.2010 15:27:01|SETI@home|Message from server: Invalid app version description
09.06.2010 15:27:01|SETI@home|Message from server: Project has no jobs available

hope they could fix it soon

Kind regards CU


Previous 20


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.