Don't know where it should go? Stick it here!

Message boards : Number crunching : Don't know where it should go? Stick it here!
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 118 · 119 · 120 · 121 · 122 · 123 · 124 . . . 147 · Next

AuthorMessage
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2047474 - Posted: 28 Apr 2020, 23:20:46 UTC

Development news from the Khronos Group that they are getting close to finalizing the OpenCL 3.0 standard.

That may make things interesting down the line since the new standard will allow Nvidia to support the standard on their devices. Nvidia has been stuck on OpenCL 1.2 because the OpenCL 2.0 standard didn't allow them any leeway in supporting their list of features. The new 3.0 standard will allow Nvidia to ignore features that their cards don't accept.

https://www.phoronix.com/scan.php?page=article&item=opencl-30-spec&num=1
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2047474 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13932
Credit: 208,696,464
RAC: 304
Australia
Message 2047502 - Posted: 29 Apr 2020, 6:46:54 UTC - in response to Message 2043765.  

I guess so. That is what actually happens with a "finish file present too long" error. They just dumbed down the description instead of tying it directly to the error description. This is DA's description of the error and the fix in the pull request.

When an app finishes, it writes a "finish file",
which ensures the client that the app really finished.

If the app process is still there N seconds after the finish file appears,
the client assumes that something went wrong, and it aborts the job.
Yeah, i saw his description.
And what he describes (which is the "Finish file present too long" problem) is different to "If output file is missing on startup, flag task as error."
The second one is "if a file is missing, then and error has occurred" the first one "if a process is still going a certain time after the file is written, the Task is aborted". To me they are two different issues- one is a missing file, the other the file is there. One is the result of an error, the other results in an error.


Will just have to see if any "Finish file present too long" errors show up or not on systems with the new Manager.
Looks like it is still an issue under certain conditions.

<core_client_version>7.16.6</core_client_version>
<![CDATA[
<stderr_txt>
command: rosetta_4.16_x86_64-apple-darwin -run:protocol jd2_scripting -parser:protocol predictor_v11_boinc--fuse--il1r_design_boinc_v1_mod.xml @flags_il6r3 -in:file:silent Mini_Protein_binds_IL6R_COVID-19_1bqu_v2_1_SAVE_ALL_OUT_IGNORE_THE_REST_7hi8gw7a.silent -in:file:silent_struct_type binary -silent_gz -mute all -write_failures false -silent_read_through_errors true -out:file:silent_struct_type binary -out:file:silent default.out -in:file:boinc_wu_zip Mini_Protein_binds_IL6R_COVID-19_1bqu_v2_1_SAVE_ALL_OUT_IGNORE_THE_REST_7hi8gw7a.zip @Mini_Protein_binds_IL6R_COVID-19_1bqu_v2_1_SAVE_ALL_OUT_IGNORE_THE_REST_7hi8gw7a.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 2312473
Starting watchdog...
Watchdog active.
======================================================
DONE ::  1064 starting structures    28787 cpu seconds
This process generated   1064 decoys from    1064 attempts
======================================================
BOINC :: WS_max 3.32947e+09

BOINC :: Watchdog shutting down...
01:29:09 (24997): called boinc_finish(0)

</stderr_txt>
<message>
Process still present 5 min after writing finish file; aborting</message>
]]>


CPU type Intel(R) Xeon(R) CPU X5690 @ 3.47GHz [x86 Family 6 Model 44 Stepping 2]
24 cores/threads.
Operating System Darwin 18.7.0
BOINC version 7.16.6
Memory 98304 MB
Storage is an SSD.

When a whole bunch of Tasks finish at the same time, the error occurs on from 1-4 of them (which apparently is an improvement).


Rosetta are taking Raistmer's advice & re-working the application so the database isn't written to & cleaned out of each slot, which should hep resolve the problem anyway.
Grant
Darwin NT
ID: 2047502 · Report as offensive     Reply Quote
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24929
Credit: 3,081,182
RAC: 7
Ireland
Message 2048113 - Posted: 4 May 2020, 15:46:18 UTC

1st time I've seen this error.

04/05/2020 02:08:45 | World Community Grid | [error] File mip1.MIP1_00294466.1 has wrong size: expected 4648936, got 4648926
04/05/2020 02:08:45 | World Community Grid | [error] Checksum or signature error for mip1.MIP1_00294466.1
04/05/2020 02:10:35 | World Community Grid | Started download of mip1.MIP1_00294466.1
04/05/2020 02:10:38 | World Community Grid | Finished download of mip1.MIP1_00294466.1
04/05/2020 02:10:38 | World Community Grid | [error] File mip1.MIP1_00294466.1 has wrong size: expected 4648936, got 4648926
04/05/2020 02:10:38 | World Community Grid | [error] Checksum or signature error for mip1.MIP1_00294466.1
ID: 2048113 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3462
Credit: 715,342
RAC: 4
United States
Message 2048163 - Posted: 5 May 2020, 3:45:46 UTC - in response to Message 2048113.  

1st time I've seen this error.

04/05/2020 02:08:45 | World Community Grid | [error] File mip1.MIP1_00294466.1 has wrong size: expected 4648936, got 4648926
04/05/2020 02:08:45 | World Community Grid | [error] Checksum or signature error for mip1.MIP1_00294466.1
04/05/2020 02:10:35 | World Community Grid | Started download of mip1.MIP1_00294466.1
04/05/2020 02:10:38 | World Community Grid | Finished download of mip1.MIP1_00294466.1
04/05/2020 02:10:38 | World Community Grid | [error] File mip1.MIP1_00294466.1 has wrong size: expected 4648936, got 4648926
04/05/2020 02:10:38 | World Community Grid | [error] Checksum or signature error for mip1.MIP1_00294466.1

Missing 10 bytes. Have seen this happen recently at Rosetta when they had issues with a over reactivate firewall. It's not your www connection or computers fault. Hope you have reported it to WCG.
ID: 2048163 · Report as offensive     Reply Quote
EdwardPF
Volunteer tester

Send message
Joined: 26 Jul 99
Posts: 389
Credit: 236,772,605
RAC: 374
United States
Message 2048189 - Posted: 5 May 2020, 14:02:39 UTC

my computer # 5947710 just crashed and it looks bad ...

since it's a remote system from where I am I will not be able to get there for some time ...

If someone can "re-cycle" the 180 WUs it has ... that would be fine with me ... keep things moving

Ed F
ID: 2048189 · Report as offensive     Reply Quote
Ian&Steve C.
Avatar

Send message
Joined: 28 Sep 99
Posts: 4267
Credit: 1,282,604,591
RAC: 6,640
United States
Message 2048190 - Posted: 5 May 2020, 14:11:09 UTC - in response to Message 2048189.  

unfortunately no one here can do that. and they project folks are unlikely to single out a system to manually move stuff around. if you don't get the system back up before their June deadline, they will expire at that time. So nothing will happen until then.
Seti@Home classic workunits: 29,492 CPU time: 134,419 hours

ID: 2048190 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 29 May 99
Posts: 5414
Credit: 85,281,665
RAC: 126
Finland
Message 2048214 - Posted: 5 May 2020, 17:50:05 UTC

Einstein@home web site is down for me. Tasks are downloading and uploading just fine. Anybody else seeing this? https://downforeveryoneorjustme.com/einsteinathome.org says it is fine but not for me.
ID: 2048214 · Report as offensive     Reply Quote
Profile Siran d'Vel'nahr
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 7381
Credit: 44,181,323
RAC: 238
United States
Message 2048215 - Posted: 5 May 2020, 17:50:23 UTC

Greetings,

Holy crap! My RAC is below 10K! :( Haven't seen it that low in years.

Have a great day! :)

Siran
CAPT Siran d'Vel'nahr - L L & P _\\//
Winders 11 OS? "What a piece of junk!" - L. Skywalker
"Logic is the cement of our civilization with which we ascend from chaos using reason as our guide." - T'Plana-hath
ID: 2048215 · Report as offensive     Reply Quote
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11451
Credit: 29,581,041
RAC: 66
United States
Message 2048218 - Posted: 5 May 2020, 18:08:58 UTC - in response to Message 2048214.  

Einstein@home web site is down for me. Tasks are downloading and uploading just fine. Anybody else seeing this? https://downforeveryoneorjustme.com/einsteinathome.org says it is fine but not for me.

E@H message boards are working for me.
ID: 2048218 · Report as offensive     Reply Quote
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 2048227 - Posted: 5 May 2020, 19:32:26 UTC - in response to Message 2048214.  

Einstein@home web site is down for me. Tasks are downloading and uploading just fine. Anybody else seeing this? https://downforeveryoneorjustme.com/einsteinathome.org says it is fine but not for me.

Flush your arp and dns caches and then retry.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 2048227 · Report as offensive     Reply Quote
Harri Liljeroos
Avatar

Send message
Joined: 29 May 99
Posts: 5414
Credit: 85,281,665
RAC: 126
Finland
Message 2048236 - Posted: 5 May 2020, 20:22:26 UTC - in response to Message 2048227.  

Einstein@home web site is down for me. Tasks are downloading and uploading just fine. Anybody else seeing this? https://downforeveryoneorjustme.com/einsteinathome.org says it is fine but not for me.

Flush your arp and dns caches and then retry.

Did that with no help. Then I rebooted the computer but no success. Then rebooted the router but without success. Then I decided to re-select url from my bookmarks and suddenly it worked. I usually keep my favorite sites open on tabs and when browser starts they will be opened automatically. Today einstein didn't open until now.
ID: 2048236 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2048257 - Posted: 6 May 2020, 0:32:52 UTC

Is this the oldest WU that hasn't timed out yet ?

https://setiathome.berkeley.edu/workunit.php?wuid=3818342749
ID: 2048257 · Report as offensive     Reply Quote
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 38031
Credit: 261,360,520
RAC: 489
Australia
Message 2048259 - Posted: 6 May 2020, 0:36:49 UTC - in response to Message 2048257.  

Is this the oldest WU that hasn't timed out yet ?

https://setiathome.berkeley.edu/workunit.php?wuid=3818342749
No. I have 2 that will expire on the 22nd May and 1 for the 23rd. ;-)

Cheers.
ID: 2048259 · Report as offensive     Reply Quote
Scrooge McDuck
Avatar

Send message
Joined: 26 Nov 99
Posts: 1625
Credit: 1,674,173
RAC: 54
Germany
Message 2048395 - Posted: 7 May 2020, 10:47:47 UTC - in response to Message 2048259.  

No. I have 2 that will expire on the 22nd May and 1 for the 23rd. ;-)

Keith is looking for the oldest (long living) WU. Creation date of https://setiathome.berkeley.edu/workunit.php?wuid=3818342749 is 1 Jan 2020, 19:33:57 UTC
ID: 2048395 · Report as offensive     Reply Quote
Profile Keith T.
Volunteer tester
Avatar

Send message
Joined: 23 Aug 99
Posts: 962
Credit: 537,293
RAC: 9
United Kingdom
Message 2048405 - Posted: 7 May 2020, 14:22:16 UTC - in response to Message 2048395.  

Thanks, there are at least 2 others with the same creation date

https://setiathome.berkeley.edu/results.php?hostid=8837426&offset=0&show_names=1&state=0&appid=

They're all Validated, but will be around until 15 May, unless https://setiathome.berkeley.edu/results.php?hostid=7989306 comes back to return or abort them.

These are all 10 week deadline tasks, that got sent to 2 unreliable hosts, so they are probably some of the longest.
ID: 2048405 · Report as offensive     Reply Quote
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13932
Credit: 208,696,464
RAC: 304
Australia
Message 2048711 - Posted: 9 May 2020, 21:58:58 UTC
Last modified: 9 May 2020, 22:09:23 UTC

For those of us old enough to remember dialup modems, here are some photos from the the early days of the original modem company, Hayes.


And for those interested in the handshake, here is a description of what it was all about (with a very good, and large, image of the process).
Grant
Darwin NT
ID: 2048711 · Report as offensive     Reply Quote
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1646
Credit: 12,921,799
RAC: 89
New Zealand
Message 2048800 - Posted: 10 May 2020, 23:28:22 UTC

Out of curiosity and interest what projects are people putting their GPU's to use on?
ID: 2048800 · Report as offensive     Reply Quote
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11451
Credit: 29,581,041
RAC: 66
United States
Message 2048801 - Posted: 10 May 2020, 23:48:42 UTC - in response to Message 2048800.  

GTX660, GTX1060 X 2, GTX1660 super = E@H
ID: 2048801 · Report as offensive     Reply Quote
Profile tullio
Volunteer tester

Send message
Joined: 9 Apr 04
Posts: 8797
Credit: 2,930,782
RAC: 1
Italy
Message 2048802 - Posted: 11 May 2020, 0:01:36 UTC - in response to Message 2048800.  

My GTX 1060 is used in GPUGRID and Einstein@home but there it fails on the Gravitational Waves 02 tasks because its 3 GB of video RAM are not enough.
Tullio
ID: 2048802 · Report as offensive     Reply Quote
Profile Tom M
Volunteer tester

Send message
Joined: 28 Nov 02
Posts: 5126
Credit: 276,046,078
RAC: 462
Message 2048846 - Posted: 11 May 2020, 13:15:09 UTC - in response to Message 2048800.  

Out of curiosity and interest what projects are people putting their GPU's to use on?


I just looked at your system. It should be able to run nearly anything out there.

You could easily run E@H's Gravity Waves gpu or Pulsar Search #1. PrimeGrid. MilkyWay. GpuGrid (Nvidia only).

My Nvidia gpus are running Gravity Wave. I have some RX 570's that are running Pulsar Search#1.
While I usually run PrimeGrid on a cpu thread the project has gpu tasks too.
I kept getting Inconclusive results on MilkyWay so it may be a Amd/Ati only.

Tom M
A proud member of the OFA (Old Farts Association).
ID: 2048846 · Report as offensive     Reply Quote
Previous · 1 . . . 118 · 119 · 120 · 121 · 122 · 123 · 124 . . . 147 · Next

Message boards : Number crunching : Don't know where it should go? Stick it here!


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.