I am puzzled... |
![]() |
| log in |
Message boards : Number crunching : I am puzzled...
| Author | Message |
|---|---|
|
I am puzzled by the behavoir of my Q6600 system overnight. Lastnight when I last looked all the ATI WU's had a remaining time of 7 hours or so. This morning they are all 90+ hours. The CPU WU's are showing 1-3 hours remaining which looks to be the same as lastnight. Everything in the log looks ok to me and work is being done in expected amount of time. Just not sure why the DCF for the GPU went whacko. | |
| ID: 1202091 · | |
|
You had one unit running over 16000 seconds. | |
| ID: 1202093 · | |
You had one unit running over 16000 seconds. And several VLAR running over 25,000 seconds - like task 2334375975. | |
| ID: 1202094 · | |
You had one unit running over 16000 seconds. Sounds like the ATI low GPU use Bug, Claggy | |
| ID: 1202098 · | |
|
Ok, so they should come back down over time? The system has only been up and crunching for 38 hours since it got overhualed. I know the 2 ATI 6670 GPUs aren't the fastest crunchers but 90 hours is alittle out of the ballpark. | |
| ID: 1202099 · | |
|
How do I check for this bug and is it correctable? | |
| ID: 1202101 · | |
|
each time a gpu one is ticking you can lower 30sec to 1 mins each, but when you get a cpu one ticking, specially big long ones, you can jump 8 to 12 mins (on the estimations for each WU in the queue) | |
| ID: 1202103 · | |
|
The Application details for that host are showing a pretty insane APR for the ATI app. | |
| ID: 1202107 · | |
How do I check for this bug and is it correctable? Not much you can do. First thing is to restart the computer. Reduce instances to 3 better would be 2 on your card. Each time you notice a slow down suspend GPU for 30 seconds and resume again. ____________ | |
| ID: 1202117 · | |
|
I have the same problem with ATI GPU. | |
| ID: 1202210 · | |
|
Actually you want it in a different format than that. | |
| ID: 1202230 · | |
|
Actually you can use both formats to give this value. | |
| ID: 1202235 · | |
Actually you want it in a different format than that. According to Joe Segur, the parser that handles that value can cope with most reasonable numeric and scientific formats. You do, however, need to keep your wits about you when dealing with numbers as large as that. Not many of us (except the bankers) regularly deal with a couple of hundred billion - of anything, even flops. If you're used to exponential notation, 200e9 is indeed easier to get right than 200000000000. But then, why not 2e11? For this purpose, the minute fractional bits after the decimal point really don't matter. There is a nice example of this in today's New Scientist magazine. The website is subscription-only, so I'll have to quote instead of link: BANKS are under orders to tighten their accounting, but we fear HSBC may be paying too much attention to the small things rather than the gigapounds. Andrew Beggs wanted to know the distance from his home to the bank's nearest branch. He consulted the bank's website, which came up with the answer 0.9904670356841079 miles (1.5940021810076005 kilometres). This is supposedly accurate to around 10-13 metres, much less than the radius of a hydrogen atom. Moisture condensing on the branch's door would bring it several significant digits closer. | |
| ID: 1202237 · | |
|
Testing superscripts for DA. BANKS are under orders to tighten their accounting, but we fear HSBC may be paying too much attention to the small things rather than the gigapounds. Andrew Beggs wanted to know the distance from his home to the bank's nearest branch. He consulted the bank's website, which came up with the answer 0.9904670356841079 miles (1.5940021810076005 kilometres). This is supposedly accurate to around 10-13 metres, much less than the radius of a hydrogen atom. Moisture condensing on the branch's door would bring it several significant digits closer. Superscripts seem OK, but don't click the Use BBCode tags to format your text link while editing - you'll lose your changes. I'll tell him. | |
| ID: 1202321 · | |
|
I can read many articles in New Scientist without even registering. I am a registered non paying guest in Nature magazine and can read all its editorials and even some articles, same in Nature Communications. | |
| ID: 1202325 · | |
I can read many articles in New Scientist without even registering. I am a registered non paying guest in Nature magazine and can read all its editorials and even some articles, same in Nature Communications. I tried the section link on the front page for that story (Feedback: Highway exit with no return) before posting, but it told me I had to log in first. Not everybody here will want to create even a guest registration just to read one silly story. | |
| ID: 1202327 · | |
|
I have just read and printed a short article "Oceans acidifying at unprecedented speed". I grab what I can without registering from New Scientist, but Nature is a more reliable source. | |
| ID: 1202328 · | |
The Application details for that host are showing a pretty insane APR for the ATI app. So it was indicating 1.4 TeraFlops and has since grown to 2.2 TeraFlops. I'm no ATI specialist but need not be to say that's ridiculously high. The cause is result_overflow tasks which haven't been marked as runtime_outlier by a sah_validate process, because the BOINC core client API all too often truncates the stderr. Task 2334954485 is a current example though due to be purged soon. It has Run time 39.31, CPU time 30.91, Credit 0.33, but only the first 8 lines of stderr.txt captured and the result_overflow line would be much later. The Validator has always looked for that result_overflow keyword so the assimilated result is marked, and now it is also used to tell BOINC not to include the runtime in its averages. In January, Matt Arsenault (Milkyway) noted on the boinc_dev list that about 2% of reports had truncated stderr sections, and provided a patch to fix the problem. Dr. Anderson apparently wasn't convinced it needs fixing, and even if he had implemented the patch it would only be effective for those alpha testing BOINC 7.0.x clients. As a related note, the runtime_outlier detection for Astropulse v6 will be based on information in the uploaded result file rather than the stderr which is supposed to go back in requests to the Scheduler. Perhaps it would be a good idea to make a similar change for SETI@home v7. Joe | |
| ID: 1202411 · | |
|
I dropped to 2 task per GPU and the times have dropped from 90 hours to around 30. This is still 5-6 times higher than I have seen most task finish in. I'll just let it keep crunching and watch it. | |
| ID: 1202422 · | |
Superscripts seem OK, but don't click the Use BBCode tags to format your text link while editing - you'll lose your changes. I'll tell him. It's safe to use again now, though I'm not sure I remember all the page headers being in the formatting pop-up before. | |
| ID: 1202477 · | |
Message boards : Number crunching : I am puzzled...
| Copyright © 2013 University of California |