GTX 480 crunch card

Message boards : Number crunching : GTX 480 crunch card
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 13 · Next

AuthorMessage
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 984159 - Posted: 27 Mar 2010, 0:03:28 UTC - in response to Message 984124.  
Last modified: 27 Mar 2010, 0:03:51 UTC

I also am down to one 295 to boot; it was running at 100% fan in less than three minutes from a cold start, and even at 100% fan, the temp climbed from 65C to 90C before I could get it shut down.

Ah well, it'll be interesting to see what this rig can do over the weekend; hopefully the project will cooperate. :)

I set the fan on my 295 manually to 100% anyway and it still runs at 96C on S@H WU's and 88C on Milkyway WU's. Need to take it apart and replace the thermal compound with Ceramique. Prior to the last time I took it apart (in the vain hope that I would be able to clean the inside of the fan out) it had run 24/7 for over 12 months at 89 - 91C; whenever it got above 93C I knew it was time to take it out of the case and apply the compressed air. A couple of weeks ago it got up to 102C for quite a period without generating any errors.

F.
ID: 984159 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984162 - Posted: 27 Mar 2010, 0:08:41 UTC - in response to Message 984159.  
Last modified: 27 Mar 2010, 0:10:16 UTC

I also am down to one 295 to boot; it was running at 100% fan in less than three minutes from a cold start, and even at 100% fan, the temp climbed from 65C to 90C before I could get it shut down.

Ah well, it'll be interesting to see what this rig can do over the weekend; hopefully the project will cooperate. :)

I set the fan on my 295 manually to 100% anyway and it still runs at 96C on S@H WU's and 88C on Milkyway WU's. Need to take it apart and replace the thermal compound with Ceramique. Prior to the last time I took it apart (in the vain hope that I would be able to clean the inside of the fan out) it had run 24/7 for over 12 months at 89 - 91C; whenever it got above 93C I knew it was time to take it out of the case and apply the compressed air. A couple of weeks ago it got up to 102C for quite a period without generating any errors.

F.


I have never been able to get my 295 hotter than 69°C. I am about to try again as my cache is about 45 minutes from being drained, so I can load the optimized apps again. This will be interesting as my memory is at 900, clock at 670, and shader at 1560. I definitly expect some fan noise once it starts crunching on those optimized apps. If it still chokes, I'll try reducing the memory again.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984162 · Report as offensive
Rasputin
Volunteer tester

Send message
Joined: 13 Jun 02
Posts: 1764
Credit: 6,132,221
RAC: 0
Russia
Message 984164 - Posted: 27 Mar 2010, 0:11:47 UTC - in response to Message 984146.  


Why would a cranky old squirrel give a rat's a.........?


Because if you get banned, you'll regret it in the morning. Just trying to save you from yourself. :)

And I think your a good person, you just get a little sidetracked at times.

Nuff off topic for the night
ID: 984164 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984170 - Posted: 27 Mar 2010, 0:33:00 UTC

Dang Nabbit! I just loaded the optimized apps, and got 14 cuda units. It created error in all 14 in about 10 seconds. I reset the 295 to stock, and underclocked the memory again, and the server says it has no jobs for me. This is much harder than it seems.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984170 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 984177 - Posted: 27 Mar 2010, 0:44:16 UTC - in response to Message 984170.  

SciManStev,

Hold your horses! Those errors were -6s. That means they were VLARs and the killer got them. That was the V12 VLARKiller doing it's job.


PROUD MEMBER OF Team Starfire World BOINC
ID: 984177 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984182 - Posted: 27 Mar 2010, 0:54:07 UTC - in response to Message 984177.  

SciManStev,

Hold your horses! Those errors were -6s. That means they were VLARs and the killer got them. That was the V12 VLARKiller doing it's job.


I hate to continue to appear ignorant, but I haven't actually been able to read what exactly are VLAR's? I have read all kinds of references to these things, but not an explanation as to what they are, or why they need to be killed. That was the result I got a couple of months ago, and it said the VLARKiller got those as well. I thought that with my screen blanking, and me seeing those errors that my 295 just won't work as it should. I did get some more units at stock speeds with the memory at 900, but it blanked the screen again (It just did it again) and the driver recovered. I reduced the mempry to 885 on the GTX 295, and as I just mentioned it did it again.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984182 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65791
Credit: 55,293,173
RAC: 49
United States
Message 984184 - Posted: 27 Mar 2010, 1:03:17 UTC - in response to Message 984182.  

SciManStev,

Hold your horses! Those errors were -6s. That means they were VLARs and the killer got them. That was the V12 VLARKiller doing it's job.


I hate to continue to appear ignorant, but I haven't actually been able to read what exactly are VLAR's? I have read all kinds of references to these things, but not an explanation as to what they are, or why they need to be killed. That was the result I got a couple of months ago, and it said the VLARKiller got those as well. I thought that with my screen blanking, and me seeing those errors that my 295 just won't work as it should. I did get some more units at stock speeds with the memory at 900, but it blanked the screen again (It just did it again) and the driver recovered. I reduced the mempry to 885 on the GTX 295, and as I just mentioned it did it again.

If You want to do them on the 295, You should switch to the No-kill Optimized app, VLAR is short for "Very Low Angle Range" and they take longer than others on the gpu to do. Otherwise the app Yer using will generate an error -6 as You've seen.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 984184 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984200 - Posted: 27 Mar 2010, 1:26:29 UTC - in response to Message 984184.  


If You want to do them on the 295, You should switch to the No-kill Optimized app, VLAR is short for "Very Low Angle Range" and they take longer than others on the gpu to do. Otherwise the app Yer using will generate an error -6 as You've seen.


OK. Over all they are weeded out, as perhaps less credit is given, and the GPU time is better used crunching other units? Thank you again for the great information. I have read volumes of posts, and didn't know that. My goal is to do as much science as I can, and I am doing my best to figure out how to do that. What about the screen blanking and recovery, which sometimes results in a reboot? I can try to go even lower on the memory if that will help. My 295 is running at stock speeds now, except for the underclocked memory, and the screen still blanks out whether I am doing anything or not. All Aeropeek, and fancy stuff is turned off.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984200 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65791
Credit: 55,293,173
RAC: 49
United States
Message 984219 - Posted: 27 Mar 2010, 1:55:56 UTC - in response to Message 984200.  


If You want to do them on the 295, You should switch to the No-kill Optimized app, VLAR is short for "Very Low Angle Range" and they take longer than others on the gpu to do. Otherwise the app Yer using will generate an error -6 as You've seen.


OK. Over all they are weeded out, as perhaps less credit is given, and the GPU time is better used crunching other units? Thank you again for the great information. I have read volumes of posts, and didn't know that. My goal is to do as much science as I can, and I am doing my best to figure out how to do that. What about the screen blanking and recovery, which sometimes results in a reboot? I can try to go even lower on the memory if that will help. My 295 is running at stock speeds now, except for the underclocked memory, and the screen still blanks out whether I am doing anything or not. All Aeropeek, and fancy stuff is turned off.

Yer welcome, Screen Blank sounds like a Screen Saver maybe(I have XP, So I'm makin a guess, pointed ears or not;)), If Yer OS is 7 or Vista, I'd say turn It off(not the OS), As LCD monitors don't need It, I don't know about the reboot(I can guess, see below), It might be Yer PC is going to sleep and sometimes doesn't wake up(Power Options in XP x64), Try and disable that too, If You haven't already that is. Or You'd have to tell Me more about Yer PCs cpu and/or ram, Are they overclocked any? Whats the cpu temp? cpu & ram speed?
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 984219 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65791
Credit: 55,293,173
RAC: 49
United States
Message 984235 - Posted: 27 Mar 2010, 2:39:25 UTC
Last modified: 27 Mar 2010, 2:47:09 UTC

Hey there's a GTX470/480 Review over on the [H]ardOCP, Here's the Link.

So far a pair of GTX480 cards in SLI mode slays an HD5970 and by a good margin too. :D

I'd sure like to see a dual gpu version of the 470/480. :D
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 984235 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984242 - Posted: 27 Mar 2010, 2:51:28 UTC - in response to Message 984219.  


Yer welcome, Screen Blank sounds like a Screen Saver maybe(I have XP, So I'm makin a guess, pointed ears or not;)), If Yer OS is 7 or Vista, I'd say turn It off(not the OS), As LCD monitors don't need It, I don't know about the reboot(I can guess, see below), It might be Yer PC is going to sleep and sometimes doesn't wake up(Power Options in XP x64), Try and disable that too, If You haven't already that is. Or You'd have to tell Me more about Yer PCs cpu and/or ram, Are they overclocked any? Whats the cpu temp? cpu & ram speed?


I read almost the entire Lunatics site, and I understand more about the VLARs. They can cause severe screen issues, and are removed except for people who are willing to take the risk. This happens to me only with the optimized apps., regardless of whether my CPU is overclocked or not. Until now, I had always thought it had to do with those errors I was getting. I had stopped using the optimized apps as I didn't want to throw errors back at the servers. Currently my CPU is very stable at 4.2 GHz, with the memory at 1774 MHz. This is interesting as I use the Throttle plug in to back throttle my CPU if it overheats. That has a default junction temperature of 100 deg. C, and it is reporting my core temps at 62 deg. C. The interesting thing is that my motherboard is reporting my core temp at 48 deg. C. My coolant temp is 12 deg C. I am not sure which temperature measurement is correct. I would use an infrared thermometer to measure the core, except for the water cooling hides it. I have Corsair DDR3 2000 RAM, so I still have room to increase it. Just by itself the screen blanking is bad, but when it causes a reboot is what worries me. I do not have BOINC to load on reboot, as I work an hour away, and if there is a problem, I don't want to fry a system or burn down my house during the day while I am gone. This happens at overclock or stock speed. I had the 295 overclocked nicely with the standard apps, and it performed very well, except if I tried to play a game of solitair while crunching. It took forever to move a card from one place to another, and usually caused a reboot. I quickly quit playing on my cruncher, and moved my gaming to this laptop. There are no screen savers enabled, and no sleep, hibornation, or anything else I am aware of. Let me know anything else you need to know, as I really want to get to the bottom of this, and floor the throttle.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984242 · Report as offensive
Profile perryjay
Volunteer tester
Avatar

Send message
Joined: 20 Aug 02
Posts: 3377
Credit: 20,676,751
RAC: 0
United States
Message 984259 - Posted: 27 Mar 2010, 3:03:35 UTC - in response to Message 984200.  

SciManStev,

How are your temps on your GPUs? Could one of them be running a bit hot? You've only reported one of the errors from the screen blanking but it is device-1 and it is an error code 1. I've been getting some of those on mine too. Haven't figured out what it is yet. Whatever it is it's causing your graphics driver to fail and recover. Some people have said it could be because of overheating.


PROUD MEMBER OF Team Starfire World BOINC
ID: 984259 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984270 - Posted: 27 Mar 2010, 3:09:13 UTC - in response to Message 984259.  
Last modified: 27 Mar 2010, 3:09:46 UTC

SciManStev,

How are your temps on your GPUs? Could one of them be running a bit hot? You've only reported one of the errors from the screen blanking but it is device-1 and it is an error code 1. I've been getting some of those on mine too. Haven't figured out what it is yet. Whatever it is it's causing your graphics driver to fail and recover. Some people have said it could be because of overheating.


I have never gotten my GPU over 69 deg. C. I am told that that is not hot for a GTX 295. I am running the optimized apps now, and it is 67 and 53. Thats odd. They are usually the same temperature. It was actually running hotter when I had it overclocked with the standard apps.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984270 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 984273 - Posted: 27 Mar 2010, 3:12:59 UTC - in response to Message 984242.  

... I really want to get to the bottom of this ...

As you are still having problems with re-boots etc it sound like something more is amiss than just a memory weakness on the 295. But to eliminate that possiblity first have a look at this old post of mine and check the memory on the 295 as a starter.

I'm running only a quaddie rather than an i7 but I have had problems not totally dissimilar from an overheating northbridge on my Asus MoBo (I had to turn the CPU overclock down 10% to cure this) so it could be mother board weakness; or similar effects could occur if you were reaching the limit of your PSU capability.

I know this is highly frustrating to you but, from this end, it is quite intriguing.

F.
ID: 984273 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65791
Credit: 55,293,173
RAC: 49
United States
Message 984284 - Posted: 27 Mar 2010, 3:18:49 UTC - in response to Message 984242.  


Yer welcome, Screen Blank sounds like a Screen Saver maybe(I have XP, So I'm makin a guess, pointed ears or not;)), If Yer OS is 7 or Vista, I'd say turn It off(not the OS), As LCD monitors don't need It, I don't know about the reboot(I can guess, see below), It might be Yer PC is going to sleep and sometimes doesn't wake up(Power Options in XP x64), Try and disable that too, If You haven't already that is. Or You'd have to tell Me more about Yer PCs cpu and/or ram, Are they overclocked any? Whats the cpu temp? cpu & ram speed?


I read almost the entire Lunatics site, and I understand more about the VLARs. They can cause severe screen issues, and are removed except for people who are willing to take the risk. This happens to me only with the optimized apps., regardless of whether my CPU is overclocked or not. Until now, I had always thought it had to do with those errors I was getting. I had stopped using the optimized apps as I didn't want to throw errors back at the servers. Currently my CPU is very stable at 4.2 GHz, with the memory at 1774 MHz. This is interesting as I use the Throttle plug in to back throttle my CPU if it overheats. That has a default junction temperature of 100 deg. C, and it is reporting my core temps at 62 deg. C. The interesting thing is that my motherboard is reporting my core temp at 48 deg. C. My coolant temp is 12 deg C. I am not sure which temperature measurement is correct. I would use an infrared thermometer to measure the core, except for the water cooling hides it. I have Corsair DDR3 2000 RAM, so I still have room to increase it. Just by itself the screen blanking is bad, but when it causes a reboot is what worries me. I do not have BOINC to load on reboot, as I work an hour away, and if there is a problem, I don't want to fry a system or burn down my house during the day while I am gone. This happens at overclock or stock speed. I had the 295 overclocked nicely with the standard apps, and it performed very well, except if I tried to play a game of solitaire while crunching. It took forever to move a card from one place to another, and usually caused a reboot. I quickly quit playing on my cruncher, and moved my gaming to this laptop. There are no screen savers enabled, and no sleep, hibernation, or anything else I am aware of. Let me know anything else you need to know, as I really want to get to the bottom of this, and floor the throttle.

The error isn't a real problem, As It's merely saying It's a rejected WU that would be more suitable elsewhere(like on a cpu), So I wouldn't worry about It or I'd run the stock cuda app instead of the optimized app. Oh and the software to measure the core temp, Is It RealTemp? The forum for RealTemp is Here, As there is a beta 3.56 and 3.58 there and a good amount of info too. Listen to Fred and yeah gpu temps at stock video card speed(fan 100%) I've seen them and I'm not too worried, Then I can still play a game, But I usually turn Boinc off, Just in case. then When I'm done with the game, turn Boinc back on.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 984284 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984295 - Posted: 27 Mar 2010, 3:25:23 UTC - in response to Message 984273.  

... I really want to get to the bottom of this ...

As you are still having problems with re-boots etc it sound like something more is amiss than just a memory weakness on the 295. But to eliminate that possiblity first have a look at this old post of mine and check the memory on the 295 as a starter.

I'm running only a quaddie rather than an i7 but I have had problems not totally dissimilar from an overheating northbridge on my Asus MoBo (I had to turn the CPU overclock down 10% to cure this) so it could be mother board weakness; or similar effects could occur if you were reaching the limit of your PSU capability.

I know this is highly frustrating to you but, from this end, it is quite intriguing.

F.


This happens regardless of any overclock or none on the CPU. It only happens with the optimized apps, and it happens regardless of whether the 295 is overclocked or not. It is frustrating, but I know I will get throough it. I do have two GTX 480 cards coming, and I can't wait to see what they do. I downloaded the Memtest G80, and just ran it. I put it into the C drive root directory, and ran it from the command line as it said in the read me. It errored and said that cudart.dll was not found, and to reinstall it. That is in the SETI@Home directory under BOINC. Am I doing something wrong?
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984295 · Report as offensive
Profile zoom3+1=4
Volunteer tester
Avatar

Send message
Joined: 30 Nov 03
Posts: 65791
Credit: 55,293,173
RAC: 49
United States
Message 984358 - Posted: 27 Mar 2010, 5:04:04 UTC - in response to Message 984295.  

... I really want to get to the bottom of this ...

As you are still having problems with re-boots etc it sound like something more is amiss than just a memory weakness on the 295. But to eliminate that possiblity first have a look at this old post of mine and check the memory on the 295 as a starter.

I'm running only a quaddie rather than an i7 but I have had problems not totally dissimilar from an overheating northbridge on my Asus MoBo (I had to turn the CPU overclock down 10% to cure this) so it could be mother board weakness; or similar effects could occur if you were reaching the limit of your PSU capability.

I know this is highly frustrating to you but, from this end, it is quite intriguing.

F.


This happens regardless of any overclock or none on the CPU. It only happens with the optimized apps, and it happens regardless of whether the 295 is overclocked or not. It is frustrating, but I know I will get throough it. I do have two GTX 480 cards coming, and I can't wait to see what they do. I downloaded the Memtest G80, and just ran it. I put it into the C drive root directory, and ran it from the command line as it said in the read me. It errored and said that cudart.dll was not found, and to reinstall it. That is in the SETI@Home directory under BOINC. Am I doing something wrong?


What version of Boinc are You using? As Boinc from a certain version and up doesn't store cudart.dll in the Boinc folder, So I'd guess You had the file in the hidden location If Yer using 6.10.43 or so, I put My cudart.dll and the Boinc files in a folder in the My Docs folder(G:\My Docs\Boinc-Files), Otherwise You'd have to unhide the folder along the path to the files where Your dll file is located at.
The T1 Trust, PRR T1 Class 4-4-4-4 #5550, 1 of America's First HST's
ID: 984358 · Report as offensive
Fred W
Volunteer tester

Send message
Joined: 13 Jun 99
Posts: 2524
Credit: 11,954,210
RAC: 0
United Kingdom
Message 984366 - Posted: 27 Mar 2010, 5:19:50 UTC - in response to Message 984295.  

It errored and said that cudart.dll was not found, and to reinstall it. That is in the SETI@Home directory under BOINC. Am I doing something wrong?

Copy the cudart.dll that you found into the folder where you put the MemtestG80. It should run then?

F.
ID: 984366 · Report as offensive
hbomber
Volunteer tester

Send message
Joined: 2 May 01
Posts: 437
Credit: 50,852,854
RAC: 0
Bulgaria
Message 984419 - Posted: 27 Mar 2010, 9:49:37 UTC
Last modified: 27 Mar 2010, 9:50:05 UTC

Can we finally stay on topic, please?

http://www.anandtech.com/video/showdoc.aspx?i=3783&p=6
ID: 984419 · Report as offensive
Profile SciManStev Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Jun 99
Posts: 6653
Credit: 121,090,076
RAC: 0
United States
Message 984439 - Posted: 27 Mar 2010, 12:18:32 UTC - in response to Message 984419.  

Can we finally stay on topic, please?

http://www.anandtech.com/video/showdoc.aspx?i=3783&p=6


It's too bad they didn't include the GTX 295 in their reviews. Either way, I can hardly wait to load these up, and go for it.
Warning, addicted to SETI crunching!
Crunching as a member of GPU Users Group.
GPUUG Website
ID: 984439 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 . . . 13 · Next

Message boards : Number crunching : GTX 480 crunch card


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.