Panic Mode On (76) Server Problems?

Message boards : Number crunching : Panic Mode On (76) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 20 · Next

AuthorMessage
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1273783 - Posted: 21 Aug 2012, 22:54:10 UTC - in response to Message 1273779.  

My GTX560 Ti & dual GTX550 Ti rigs don't mind the VLARs at all but overnight my triple 9800 rig started on its lot with very bad results eventually locking it up while I slept, so they all got aborted this morning on that setup.

Cheers.
ID: 1273783 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1273784 - Posted: 21 Aug 2012, 23:01:43 UTC - in response to Message 1273782.  

<snippage occured>

*EDIT* - Bill, out of curiosity, when was the last time you saw a VLAR running on one of your nVidia GPUs?[/quote]

3 are running concurrently right now. Ran out of regular
gpu work units during the outage so shipped a few over from the cpu.

Why? And yes I did receive some for the Nvidia gpu during the last server hiccup. Did those too on the gpu. Run times were posted in an earlier post.
ID: 1273784 · Report as offensive
Profile Sunny129
Avatar

Send message
Joined: 7 Nov 00
Posts: 190
Credit: 3,163,755
RAC: 0
United States
Message 1273789 - Posted: 21 Aug 2012, 23:26:17 UTC - in response to Message 1273784.  

Why? And yes I did receive some for the Nvidia gpu during the last server hiccup. Did those too on the gpu. Run times were posted in an earlier post.

b/c a number of us haven't seen any VLARs crunch on our GPUs since the "fix" was put in place. in the Bug in server affecting older BOINC clients with NVIDIA GPUs thread, someone was asking troubleshooting questions a few days ago. after solving his problem, he made mention of some VLARs, which briefly led Erik Corpela to think that the VLARs were broken again. but it was quickly pointed out that his VLARs occurred on August 11th, before the fixes were put into place. so it seemed like a false alarm, but then you mentioned having VLARs on your nVidia GPU, so now i'm wondering again...

to be clear, i'm not having problems with this phenomenon myself.

ID: 1273789 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1273795 - Posted: 21 Aug 2012, 23:43:47 UTC - in response to Message 1273789.  

For anyone whom I might have confused, I have received
zero vlars for my Nvidia gpu since Eric's last fix on the
server bug. I will and do transfer some to my gpu to keep
it busy if I run out of regular work units and cannot get
any from the servers.
My apologies If I gave any other impression.
ID: 1273795 · Report as offensive
Profile Akio
Avatar

Send message
Joined: 18 May 11
Posts: 375
Credit: 32,129,242
RAC: 0
United States
Message 1273796 - Posted: 21 Aug 2012, 23:56:16 UTC

8/21/2012 7:50:44 PM | SETI@home | Started download of astropulse_6.04_opencl_nvidia_AUTHORS
8/21/2012 7:50:45 PM | SETI@home | Finished download of astropulse_6.04_opencl_nvidia_AUTHORS
8/21/2012 7:50:45 PM | SETI@home | Started download of astropulse_6.04_opencl_nvidia_COPYING
8/21/2012 7:50:48 PM | SETI@home | Finished download of astropulse_6.04_opencl_nvidia_COPYING
8/21/2012 7:50:48 PM | SETI@home | Started download of astropulse_6.04_opencl_nvidia_COPYRIGHT
8/21/2012 7:50:50 PM | SETI@home | Finished download of astropulse_6.04_opencl_nvidia_COPYRIGHT
8/21/2012 7:50:50 PM | SETI@home | Started download of astropulse_6.04_windows_intelx86__opencl_nvidia_100.pdb
8/21/2012 7:53:13 PM | SETI@home | Temporarily failed download of astropulse_6.04_windows_intelx86__opencl_nvidia_100.pdb: transient HTTP error
8/21/2012 7:53:13 PM | SETI@home | Backing off 3 min 33 sec on download of astropulse_6.04_windows_intelx86__opencl_nvidia_100.pdb
8/21/2012 7:53:13 PM | SETI@home | Started download of libfftw3f-3.dll


Ok, I'm stumped. What's with the nvidia AUTHORS/COPYING/COPYRIGHT, etc stuff? Also: What the heck is the last line there? (the .dll)
ID: 1273796 · Report as offensive
Profile Fred E.
Volunteer tester

Send message
Joined: 22 Jul 99
Posts: 768
Credit: 24,140,697
RAC: 0
United States
Message 1273805 - Posted: 22 Aug 2012, 0:14:14 UTC
Last modified: 22 Aug 2012, 0:14:58 UTC

8/21/2012 7:50:44 PM | SETI@home | Started download of astropulse_6.04_opencl_nvidia_AUTHORS
8/21/2012 7:50:45 PM | SETI@home | Finished download of astropulse_6.04_opencl_nvidia_AUTHORS
8/21/2012 7:50:45 PM | SETI@home | Started download of astropulse_6.04_opencl_nvidia_COPYING
8/21/2012 7:50:48 PM | SETI@home | Finished download of astropulse_6.04_opencl_nvidia_COPYING
8/21/2012 7:50:48 PM | SETI@home | Started download of astropulse_6.04_opencl_nvidia_COPYRIGHT
8/21/2012 7:50:50 PM | SETI@home | Finished download of astropulse_6.04_opencl_nvidia_COPYRIGHT
8/21/2012 7:50:50 PM | SETI@home | Started download of astropulse_6.04_windows_intelx86__opencl_nvidia_100.pdb
8/21/2012 7:53:13 PM | SETI@home | Temporarily failed download of astropulse_6.04_windows_intelx86__opencl_nvidia_100.pdb: transient HTTP error
8/21/2012 7:53:13 PM | SETI@home | Backing off 3 min 33 sec on download of astropulse_6.04_windows_intelx86__opencl_nvidia_100.pdb
8/21/2012 7:53:13 PM | SETI@home | Started download of libfftw3f-3.dll

Ok, I'm stumped. What's with the nvidia AUTHORS/COPYING/COPYRIGHT, etc stuff? Also: What the heck is the last line there? (the .dll)


Looks like the new AP-for-Nvidia stock application was released today. You are getting all the files necessary to run it. That's a first-time only event for a new application. You are running stock, so you get it automatically if your prefs are set for "use Nvidia" and AP6.
Another Fred
Support SETI@home when you search the Web with GoodSearch or shop online with GoodShop.
ID: 1273805 · Report as offensive
Profile Akio
Avatar

Send message
Joined: 18 May 11
Posts: 375
Credit: 32,129,242
RAC: 0
United States
Message 1273807 - Posted: 22 Aug 2012, 0:17:23 UTC - in response to Message 1273805.  

Oh cool! Thanks for the help, bud :D
ID: 1273807 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1273823 - Posted: 22 Aug 2012, 1:32:44 UTC - in response to Message 1273805.  

So, how to get this new app for Nvidia OpenCL AstroPulse if you are running current Lunatics stuff on an anonymous platform?

Keith

Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1273823 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1273854 - Posted: 22 Aug 2012, 4:40:32 UTC - in response to Message 1273823.  

So, how to get this new app for Nvidia OpenCL AstroPulse if you are running current Lunatics stuff on an anonymous platform?

Keith

Why would you want to change, you're already running r1316, Stock 6.04 is r1316 too.

Claggy
ID: 1273854 · Report as offensive
Profile Sunny129
Avatar

Send message
Joined: 7 Nov 00
Posts: 190
Credit: 3,163,755
RAC: 0
United States
Message 1273874 - Posted: 22 Aug 2012, 5:29:20 UTC - in response to Message 1273854.  
Last modified: 22 Aug 2012, 6:21:48 UTC

So, how to get this new app for Nvidia OpenCL AstroPulse if you are running current Lunatics stuff on an anonymous platform?

Keith

Why would you want to change, you're already running r1316, Stock 6.04 is r1316 too.

Claggy

i'm not sure i understand what you're saying...the r1316 build didn't appear until 7/6/12, long after the most current Lunatics installer v0.40 was released (which included no OpenCL apps, let alone any apps on the r1316 build). if Keith is running the latest Lunatics installer, then he has an app_info.xml and is running anonymous platform. won't that prevent his host from automatically DLing OpenCL AP tasks for his nVidia GPU, even if APv6 are checked in his web preferences? doesn't this new OpenCL AP app have to be manually placed in the SETI@Home data directory, and isn't an entry required in the app_info.xml file in order to reference this new app and receive work for it?

Keith, see Raistmer's first post in THIS thread for a link to the Lunatics webpage that has the new OpenCL app available for download. that thread i linked you to also have a sample app_info.xml entry for nVidia GPUs in the 2nd post.
ID: 1273874 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1273886 - Posted: 22 Aug 2012, 6:51:02 UTC - in response to Message 1273874.  


He already have the app, one of his tasks:
http://setiathome.berkeley.edu/result.php?resultid=2566706424

AstroPulse v.6
Non-graphics FFTW USE_CONVERSION_OPT
Windows x86 rev 1316, V6 match, by Raistmer with support of Lunatics.kwsn.net team. SSE2


Obviously he manually added it.


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1273886 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1273931 - Posted: 22 Aug 2012, 11:27:01 UTC - in response to Message 1273854.  
Last modified: 22 Aug 2012, 11:30:04 UTC

I did not know that. I only saw the News announcement of new Nvidia OpenCL AstroPulse apps to be made available shortly after the ATI ones were sent out as stock apps. Would have been less confusing if someone, somewhere had mentioned that the stock 6.04 app was r1316.

Cheers, Keith
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1273931 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1274109 - Posted: 22 Aug 2012, 20:46:25 UTC
Last modified: 22 Aug 2012, 20:46:39 UTC

The ucb Routers page, along with all the Cricket Graphs are down.

Claggy
ID: 1274109 · Report as offensive
Profile Sutaru Tsureku
Volunteer tester

Send message
Joined: 6 Apr 07
Posts: 7105
Credit: 147,663,825
RAC: 5
Germany
Message 1274140 - Posted: 22 Aug 2012, 22:19:14 UTC
Last modified: 22 Aug 2012, 22:20:40 UTC

I would like to make one message because of the .vlar WUs.

I let run CUDA WUs since the project admins decided to send out a CUDA application.
Immediately I bought only because of SETI@home NVIDIA GPUs. ;-D

The project admins decided not to send .vlar WUs to NVIDIA GPUs (this are SETI@home Enhanced (MultiBeam) WUs with < 0.12 AR).
Because they slow down (other daily work on) the whole machine.

I think maybe because of: The member could think: "Hey, what's up?" - and will never again crunch SETI@home WUs.

The most members of SETI@home (maybe 95 %) let run the stock applications.

The stock 6.08_cuda up to 6.10_cuda_fermi applications calculate very bad/long this .vlar WUs.

Also on pre-Fermi GPUs the calculation is very slow.

I can't remember which application it was, maybe CUDA_V12 or x32f..
On my machine E7600 with GTX260 O/C:
A shorty: 2 minutes (120 seconds)
A normal 0.448x AR WU: 8 minutes and 20 seconds (500 seconds)
A 0.12x AR WU (which will be send): 25 minutes and 8 seconds (1508 seconds)
< 0.12 AR WU (.vlar WU, not send to NVIDIA GPUs): 1 hour and 30 minutes (5400 seconds)

This mean between normal (0.448x) and nearly VLAR WU (0.12x) the time is x ~ 3.

A .vlar WU would mean x ~ 10.8 longer than a normal AR WU. Wasted performance.


Maybe in future the project admins decide to send out .vlar WUs to Fermi and later GPUs - and still not to pre-Fermi GPUs (if this is possible with the server software).
If the upcoming stock S@h v7 CUDA application calculate very well this kind of WUs on Fermi+ GPUs.


* Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *
ID: 1274140 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1274146 - Posted: 22 Aug 2012, 22:34:36 UTC - in response to Message 1274109.  
Last modified: 22 Aug 2012, 22:34:45 UTC

The Cricket Graphs are back, but are empty of fresh data,

Claggy
ID: 1274146 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34744
Credit: 261,360,520
RAC: 489
Australia
Message 1274154 - Posted: 22 Aug 2012, 23:19:39 UTC - in response to Message 1274140.  
Last modified: 22 Aug 2012, 23:20:49 UTC

Maybe in future the project admins decide to send out .vlar WUs to Fermi and later GPUs - and still not to pre-Fermi GPUs (if this is possible with the server software).
If the upcoming stock S@h v7 CUDA application calculate very well this kind of WUs on Fermi+ GPUs.

Even though Fermi and later don't seem to have to many problems with them, their increased computation times (3-5 times longer than average) really throw the DCF out a lot, so I can't see the point in allowing them back ATM.

Cheers.
ID: 1274154 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1274164 - Posted: 22 Aug 2012, 23:57:35 UTC - in response to Message 1274140.  

A reply to some points made by Sutaru

I would like to make one message because of the .vlar WUs.

I let run CUDA WUs since the project admins decided to send out a CUDA application.
Immediately I bought only because of SETI@home NVIDIA GPUs. ;-D

The project admins decided not to send .vlar WUs to NVIDIA GPUs (this are SETI@home Enhanced (MultiBeam) WUs with < 0.12 AR).
Because they slow down (other daily work on) the whole machine.

bill- If this is a problem then yes, the offending work units should not be crunched on the gpu.

I think maybe because of: The member could think: "Hey, what's up?" - and will never again crunch SETI@home WUs.

bill - If those people are so easily dissuaded from crunching for SETI
without even investigating as to why this is happening and how to fix the problem then it's probably for the best that they don't do any Boinc project work at all because they will eventually run into problems with any project they choose.

The most members of SETI@home (maybe 95 %) let run the stock applications.

The stock 6.08_cuda up to 6.10_cuda_fermi applications calculate very bad/long this .vlar WUs.

bill - Yes the work units take longer. That is not bad unless pc sluggishness
or errors are a problem. Back in the beginning of the project all work units used to take a long time. The work still needs to be done though.

Also on pre-Fermi GPUs the calculation is very slow.

I can't remember which application it was, maybe CUDA_V12 or x32f..
On my machine E7600 with GTX260 O/C:
A shorty: 2 minutes (120 seconds)
A normal 0.448x AR WU: 8 minutes and 20 seconds (500 seconds)
A 0.12x AR WU (which will be send): 25 minutes and 8 seconds (1508 seconds)
< 0.12 AR WU (.vlar WU, not send to NVIDIA GPUs): 1 hour and 30 minutes (5400 seconds)

This mean between normal (0.448x) and nearly VLAR WU (0.12x) the time is x ~ 3.

A .vlar WU would mean x ~ 10.8 longer than a normal AR WU. Wasted performance.

bill - Can't agree. The vlar will take even longer to work on a cpu than a gpu.
To me crunching a work unit faster is better. If your gpu is not busy doing any other work units, why not let it do vlars if they cause no problems.

Maybe in future the project admins decide to send out .vlar WUs to Fermi and later GPUs - and still not to pre-Fermi GPUs (if this is possible with the server software).

bill - That would be excellent

If the upcoming stock S@h v7 CUDA application calculate very well this kind of WUs on Fermi+ GPUs.

bill - Hopefully. I have no objection to aborting vlars, or any other type of work unit, if they cause problems
on the client computer. Taking longer to run is not a problem in and of itself,
though.

* Best regards! :-) * Sutaru Tsureku, team seti.international founder. * Optimize your PC for higher RAC. * SETI@home needs your help. *

ID: 1274164 · Report as offensive
.clair.

Send message
Joined: 4 Nov 04
Posts: 1300
Credit: 55,390,408
RAC: 69
United Kingdom
Message 1274167 - Posted: 23 Aug 2012, 0:06:46 UTC

I accept that VLAR wu do take longer than `normal`
Though the longest i have seen one take on a 7970 is 25 minits (after i got it set up right)
I am willing to crunch nothing but VLAR if it was possible to do it.
Though i have not seen any heer since the latest problem was fixed.
ID: 1274167 · Report as offensive
bill

Send message
Joined: 16 Jun 99
Posts: 861
Credit: 29,352,955
RAC: 0
United States
Message 1274170 - Posted: 23 Aug 2012, 0:14:21 UTC - in response to Message 1274154.  


Even though Fermi and later don't seem to have to many problems with them, their increased computation times (3-5 times longer than average) really throw the DCF out a lot, so I can't see the point in allowing them back ATM.

Cheers.


I don't think anyone, including myself, was advocating allowing them "back in"
at the moment. It would be nice if they could be allowed by choice, like the AP's.

As for screwing up the DCF to the point where it causes problems, that could probably be gotten around with a bit of programming. OTOH, if it remains that
vlars never get sent to Nvidia gpu's, I can live with that too, they'll get done eventually.

ID: 1274170 · Report as offensive
Profile Link
Avatar

Send message
Joined: 18 Sep 03
Posts: 834
Credit: 1,807,369
RAC: 0
Germany
Message 1274259 - Posted: 23 Aug 2012, 6:56:40 UTC - in response to Message 1274164.  

A .vlar WU would mean x ~ 10.8 longer than a normal AR WU. Wasted performance.

bill - Can't agree. The vlar will take even longer to work on a cpu than a gpu.
To me crunching a work unit faster is better. If your gpu is not busy doing any other work units, why not let it do vlars if they cause no problems.

Sure, a GPU doing a VLAR might be better than an idle GPU (depending from your point of view), but if for a CPU it pretty much doesn't matter, if it's cruching a VLAR or a normal-AR task while the GPU is x times slower on VLAR, you are waisting performance if you send VLARs to a GPU. It's better for the project that a GPU do few 0.44 WUs instead of 1 VLAR. And it's up to the user to set his cache high enough that his card never idle (OK, not easy with the current load on servers, but that's another thing).
ID: 1274259 · Report as offensive
Previous · 1 . . . 9 · 10 · 11 · 12 · 13 · 14 · 15 . . . 20 · Next

Message boards : Number crunching : Panic Mode On (76) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.