Error 1 vs. error 12 on Seti@home Enhanced CUDA tasks.


log in

Advanced search

Message boards : Number crunching : Error 1 vs. error 12 on Seti@home Enhanced CUDA tasks.

Author Message
musicplayer
Send message
Joined: 17 May 10
Posts: 1431
Credit: 687,186
RAC: 5
Message 1367581 - Posted: 13 May 2013, 17:28:02 UTC

After attaching to Seti@home using Boinc Manager version 7.0.64 and Windows Ultimate 7, 32 bits, I had 18 CUDA tasks completed which resulted in an error before I noticed that I once again had forgot to set the CUDA_GRID_SIZE_COMPAT environment variable and therefore had to restart Boinc Manager once again.

This environment variable is still needed for my GTX 680 4 GB cards, I assume.

The nVidia graphics driver version is now 314.22, by the way.

If I am not wrong, forgetting to set this environment variable leads to error 12.

Why did many of these tasks result in error 1 instead? Is the card (or really graphics driver) working and functional when getting this error, or is this perhaps a computational error which I may receive at other times when the environment variable has already been set?

Thanks for explanation!

Josef W. SegurProject donor
Volunteer developer
Volunteer tester
Send message
Joined: 30 Oct 99
Posts: 4221
Credit: 1,040,523
RAC: 418
United States
Message 1367585 - Posted: 13 May 2013, 17:42:32 UTC - in response to Message 1367581.

...I noticed that I once again had forgot to set the CUDA_GRID_SIZE_COMPAT environment variable and therefore had to restart Boinc Manager once again.

This environment variable is still needed for my GTX 680 4 GB cards, I assume.

The nVidia graphics driver version is now 314.22, by the way.

If I am not wrong, forgetting to set this environment variable leads to error 12.
...

No, error -12 is a data-dependant error indication meaning "Unsupported function". When NVIDIA ported the code to CUDA, it was judged that data leading to two triplets in one array or more than 10 bins above threshold in a triplet array would be so rare that allocating additional Video RAM to allow for those cases wasn't sensible.

Yes, the GTX 680 is a Kepler card and needs the environment variable to successfully run the NVIDIA ported CUDA applications.
Joe

musicplayer
Send message
Joined: 17 May 10
Posts: 1431
Credit: 687,186
RAC: 5
Message 1367589 - Posted: 13 May 2013, 18:00:48 UTC - in response to Message 1367585.
Last modified: 13 May 2013, 18:01:59 UTC

Thanks Joe.

Have a look at http://setiathome.berkeley.edu/workunit.php?wuid=1243943382 for example.

This task is having a little longer run time length than some others when it comes to my error task (Task 2994419633).

Error 1 is apparently reported there, but possibly the driver was already functional (perhaps even before the environment variable had been set).

In fact I think I should be able to tell when the driver is not working correctly.

Or am I perhaps still misreading it once again?

Profile WilliamProject donor
Volunteer tester
Avatar
Send message
Joined: 14 Feb 13
Posts: 1580
Credit: 9,461,062
RAC: 6,564
Message 1367778 - Posted: 14 May 2013, 9:03:19 UTC

*spock* [For several reasons]

Looks like slight differences in startup time to me - CPU time is mostly higher as well.
A Kepler/stock setup will only work with the variable set.
Tasks that error out straight away almost always point to some problem with the setup (e.g. wrong app, bad driver, not enough memory...)

____________
A person who won't read has no advantage over one who can't read. (Mark Twain)

Profile Wiggo
Avatar
Send message
Joined: 24 Jan 00
Posts: 6703
Credit: 92,327,873
RAC: 73,934
Australia
Message 1367796 - Posted: 14 May 2013, 9:42:02 UTC - in response to Message 1367589.

Thanks Joe.

Have a look at http://setiathome.berkeley.edu/workunit.php?wuid=1243943382 for example.

This task is having a little longer run time length than some others when it comes to my error task (Task 2994419633).

Error 1 is apparently reported there, but possibly the driver was already functional (perhaps even before the environment variable had been set).

In fact I think I should be able to tell when the driver is not working correctly.

Or am I perhaps still misreading it once again?

The error shown in the Stderr output for that workunit is, "UFFT error in file 'd:/Projects/SETI/seti_boinc/client/cuda/cudaAcc_fft.cu' in line 62.", which shows that the environment variable had not been set.

Did you restart the PC after setting the environment variable?

Cheers.

musicplayer
Send message
Joined: 17 May 10
Posts: 1431
Credit: 687,186
RAC: 5
Message 1367854 - Posted: 14 May 2013, 13:37:11 UTC
Last modified: 14 May 2013, 13:39:31 UTC

Back again and making a closer check, definitely error 1 is the CUDA_GRID_SIZE_COMPAT environment variable not having been set error message while error 12 is the "cudaAcc_find_triplets erroneously found a triplet twice in find_triplets_kernel" error message.

Therefore this problem has now been solved and interpreted by me.

The only thing which actually confused me here was the indication message in the result lines for both types of errors stating that both GTX 680 cards were working (for both error 1 and error 12).

But apparently not as they really should because of the environment variable not having been set where error 1 is present.

Message boards : Number crunching : Error 1 vs. error 12 on Seti@home Enhanced CUDA tasks.

Copyright © 2014 University of California