留言板 :
Number crunching :
Bug "inside long FFA"?
留言板合理
| 作者 | 消息 |
|---|---|
Richard Haselgrove ![]() 发送消息 已加入:4 Jul 99 贴子:14152 积分:200,643,578 近期平均积分:874
|
There's a fix for that being pushed through testing, Test results for these instances added to the test matrix. |
|
Josef W. Segur 发送消息 已加入:30 Oct 99 贴子:4504 积分:1,414,761 近期平均积分:0
|
I have two AP tasks that have been looping with repeated "exited with zero status but no 'finished' file." That "ERROR: some exception inside long FFA..." is what shows when the app runs out of memory while trying to handle thousands of repetitive pulses above threshold. There's a fix for that being pushed through testing, meanwhile reducing how many signals have to be handled simultaneously by reducing the -ffa_block and -ffa_block_fetch is the way to go. Joe |
JohnDK ![]() 发送消息 已加入:28 May 00 贴子:1200 积分:451,243,443 近期平均积分:1,127
|
Had a WU like that last night, don't know how many restarts it had, but I decided to abort it... http://setiathome.berkeley.edu/workunit.php?wuid=1627971982 |
Richard Haselgrove ![]() 发送消息 已加入:4 Jul 99 贴子:14152 积分:200,643,578 近期平均积分:874
|
That's my point exactly, Richard. I don't have a debugger, but I do have a GTX 670 - close match - and I'm starting to run tests under bench conditions, starting with default cmdline parameters. One thing to watch out for will be abnormally high memory usage on these tasks. Edit - seemed to settle at 244 MB usage after a couple of minutes. High, but not as high as we've seen under bug conditions. Edit2 - memory consumption over a gigabyte with the same command line, single instance. This does look like the bug which was already under investigation - will run with the bugfix version already under test next. So, Mike's suggestion might be a temporary palliative while we wait for the bugfix to complete acceptance testing - or leave the commandline as it is if you're prepared to risk the same thing happening again, and supply additional test cases for the testing pool. |
Mike 发送消息 已加入:17 Feb 01 贴子:32233 积分:79,922,639 近期平均积分:80
|
That's my point exactly, Richard. So i dont have to worry any longer. With each crime and every kindness we birth our future. |
Oddbjornik ![]() 发送消息 已加入:15 May 99 贴子:220 积分:349,610,548 近期平均积分:1,728
|
That's my point exactly, Richard. I'm a software engineer myself, and when I see a reoccurring error like this, I know that a developer with a debugger can normally find out exactly what goes wrong. |
Mike 发送消息 已加入:17 Feb 01 贴子:32233 积分:79,922,639 近期平均积分:80
|
Reduce unroll to 12 and ffa_block to 12288 6144. Do you want to tell me how the app works ? With each crime and every kindness we birth our future. |
Richard Haselgrove ![]() 发送消息 已加入:4 Jul 99 贴子:14152 积分:200,643,578 近期平均积分:874
|
Reduce unroll to 12 and ffa_block to 12288 6144. With 1061 valid AP v7 tasks so far, I think he knows how to drive the application. The question was why these two (and only these two, as I understand him) should have behaved differently. |
Mike 发送消息 已加入:17 Feb 01 贴子:32233 积分:79,922,639 近期平均积分:80
|
Reduce unroll to 12 and ffa_block to 12288 6144. See if this helps. With each crime and every kindness we birth our future. |
Richard Haselgrove ![]() 发送消息 已加入:4 Jul 99 贴子:14152 积分:200,643,578 近期平均积分:874
|
|
Oddbjornik ![]() 发送消息 已加入:15 May 99 贴子:220 积分:349,610,548 近期平均积分:1,728
|
I have two AP tasks that have been looping with repeated "exited with zero status but no 'finished' file." Looking inside the result reports, they both say this: ERROR: some exception inside long FFA, probably video-driver restart, restarting app... The command line options are as follows: -use_sleep -hp -unroll 16 -oclFFT_plan 256 16 256 -ffa_block 16384 -ffa_block_fetch 8192 -tune 1 64 8 1 -tune 2 64 8 1 Since the GTX680 card that these tasks have run on behaves quite well with all other tasks, and still repeatedly denies to finish just these two, I think I may have stumbled upon a reproduceable error. Anyone want to investigate? Any additional info needed? Edit: Lunatics 0.43, running three tasks on the card. |
©2020 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.