Computation errors on Seti Enhanced

Message boards : Number crunching : Computation errors on Seti Enhanced
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · 16 . . . 22 · Next

AuthorMessage
Profile Clyde C. Phillips, III

Send message
Joined: 2 Aug 00
Posts: 1851
Credit: 5,955,047
RAC: 0
United States
Message 322246 - Posted: 1 Jun 2006, 9:32:08 UTC

I saw another one this morning - it stopped after being 96 percent complete. In a couple minutes it called itself 100 percent complete and with a computation error. Maybe it aborted itself automatically. Another computation-error workunit was listed alongside. Total time lost for that core: 8-1/2 hours overnight. I just hope it's not my computer!
ID: 322246 · Report as offensive
arthurm

Send message
Joined: 17 May 99
Posts: 2
Credit: 392,066
RAC: 0
United Kingdom
Message 322444 - Posted: 1 Jun 2006, 14:20:30 UTC

I have a workunit which has been showing 97.690% complete for the last 8 hours - "CPU Time" is 24 hours, "To completion" is 30 mins and increasing slowly.

Taskmanager shows it to be consuming 100% CPU, its graphical display shows that something's happening - although the rate of display change is VERY slow compared with other active workunits.

Will it finish or is it stuck in a loop? Others with the workunit haven't returned it so I can't use their results as a guide.

arthurm
ID: 322444 · Report as offensive
billmiranda

Send message
Joined: 12 Jan 01
Posts: 4
Credit: 38,742
RAC: 0
United States
Message 322499 - Posted: 1 Jun 2006, 15:47:00 UTC

Still? ANY Ideas? Other than the fact that for some reason, a "_0" (or _1, _2 etc) is being added to each unit... is this why it says that the unit can't be found?

6/1/2006 10:58:19 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 10:58:19 AM|SETI@home|Reason: Requested by user
6/1/2006 10:58:19 AM|SETI@home|Requesting 51477 seconds of new work, and reporting 2 completed tasks
6/1/2006 10:58:34 AM|SETI@home|Scheduler request succeeded
6/1/2006 10:58:36 AM|SETI@home|Started download of file 20mr99aa.15250.25346.467332.3.241
6/1/2006 10:58:36 AM|SETI@home|Started download of file 28fe99aa.18483.18002.136088.3.240
6/1/2006 10:59:36 AM|SETI@home|Finished download of file 20mr99aa.15250.25346.467332.3.241
6/1/2006 10:59:36 AM|SETI@home|Throughput 6067 bytes/sec
6/1/2006 10:59:36 AM|SETI@home|Finished download of file 28fe99aa.18483.18002.136088.3.240
6/1/2006 10:59:36 AM|SETI@home|Throughput 6140 bytes/sec
6/1/2006 10:59:37 AM||Rescheduling CPU: files downloaded
6/1/2006 10:59:37 AM||Rescheduling CPU: files downloaded
6/1/2006 10:59:37 AM|SETI@home|Started download of file 20mr99aa.15250.22658.286082.3.105
6/1/2006 10:59:37 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:38 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:38 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:39 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:40 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:40 AM|SETI@home|Unrecoverable error for result 20mr99aa.15250.25346.467332.3.241_0 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 10:59:40 AM||Rescheduling CPU: start failed
6/1/2006 10:59:40 AM|SETI@home|Computation for task 20mr99aa.15250.25346.467332.3.241_0 finished
6/1/2006 10:59:40 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:41 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:41 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:42 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:42 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:43 AM|SETI@home|Unrecoverable error for result 28fe99aa.18483.18002.136088.3.240_2 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 10:59:43 AM||Rescheduling CPU: start failed
6/1/2006 10:59:43 AM|SETI@home|Computation for task 28fe99aa.18483.18002.136088.3.240_2 finished
6/1/2006 10:59:55 AM|SETI@home|Finished download of file 20mr99aa.15250.22658.286082.3.105
6/1/2006 10:59:55 AM|SETI@home|Throughput 6003 bytes/sec
6/1/2006 10:59:56 AM||Rescheduling CPU: files downloaded
6/1/2006 10:59:56 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:56 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:57 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:58 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:58 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 10:59:59 AM|SETI@home|Unrecoverable error for result 20mr99aa.15250.22658.286082.3.105_0 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 10:59:59 AM||Rescheduling CPU: start failed
6/1/2006 10:59:59 AM|SETI@home|Computation for task 20mr99aa.15250.22658.286082.3.105_0 finished
6/1/2006 11:08:41 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 11:08:41 AM|SETI@home|Reason: To report completed tasks
6/1/2006 11:08:41 AM|SETI@home|Requesting 86400 seconds of new work, and reporting 3 completed tasks
6/1/2006 11:09:03 AM||Project communication failed: attempting access to reference site
6/1/2006 11:09:06 AM|SETI@home|Scheduler request failed: couldn't connect to server
6/1/2006 11:09:06 AM|SETI@home|Deferring scheduler requests for 1 minutes and 0 seconds
6/1/2006 11:09:19 AM||Access to reference site failed - check network connection or proxy configuration.
6/1/2006 11:10:06 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 11:10:06 AM|SETI@home|Reason: To report completed tasks
6/1/2006 11:10:06 AM|SETI@home|Requesting 86400 seconds of new work, and reporting 3 completed tasks
6/1/2006 11:10:31 AM|SETI@home|Scheduler request failed: couldn't connect to server
6/1/2006 11:10:31 AM|SETI@home|Deferring scheduler requests for 1 minutes and 0 seconds
6/1/2006 11:11:31 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 11:11:31 AM|SETI@home|Reason: To report completed tasks
6/1/2006 11:11:31 AM|SETI@home|Requesting 86400 seconds of new work, and reporting 3 completed tasks
6/1/2006 11:12:16 AM|SETI@home|Scheduler request succeeded
6/1/2006 11:12:18 AM|SETI@home|Started download of file 20mr99aa.15250.25394.136088.3.58
6/1/2006 11:12:18 AM|SETI@home|Started download of file 23mr99aa.2562.7986.623562.3.135
6/1/2006 11:16:22 AM|SETI@home|Finished download of file 23mr99aa.2562.7986.623562.3.135
6/1/2006 11:16:22 AM|SETI@home|Throughput 1488 bytes/sec
6/1/2006 11:16:22 AM|SETI@home|Started download of file 28fe99aa.18483.18049.861064.3.122
6/1/2006 11:16:24 AM||Rescheduling CPU: files downloaded
6/1/2006 11:16:24 AM|SETI@home|Finished download of file 20mr99aa.15250.25394.136088.3.58
6/1/2006 11:16:24 AM|SETI@home|Throughput 1479 bytes/sec
6/1/2006 11:16:24 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:24 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:25 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:25 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:26 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:26 AM|SETI@home|Unrecoverable error for result 23mr99aa.2562.7986.623562.3.135_2 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:16:26 AM||Rescheduling CPU: start failed
6/1/2006 11:16:26 AM||Rescheduling CPU: files downloaded
6/1/2006 11:16:26 AM|SETI@home|Computation for task 23mr99aa.2562.7986.623562.3.135_2 finished
6/1/2006 11:16:26 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:27 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:27 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:27 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:27 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:16:27 AM|SETI@home|Unrecoverable error for result 20mr99aa.15250.25394.136088.3.58_3 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:16:27 AM||Rescheduling CPU: start failed
6/1/2006 11:16:27 AM|SETI@home|Computation for task 20mr99aa.15250.25394.136088.3.58_3 finished
6/1/2006 11:18:03 AM|SETI@home|Finished download of file 28fe99aa.18483.18049.861064.3.122
6/1/2006 11:18:03 AM|SETI@home|Throughput 3598 bytes/sec
6/1/2006 11:18:04 AM||Rescheduling CPU: files downloaded
6/1/2006 11:18:05 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:18:05 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:18:05 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:18:06 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:18:06 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:18:06 AM|SETI@home|Unrecoverable error for result 28fe99aa.18483.18049.861064.3.122_1 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:18:06 AM||Rescheduling CPU: start failed
6/1/2006 11:18:06 AM|SETI@home|Computation for task 28fe99aa.18483.18049.861064.3.122_1 finished
6/1/2006 11:22:23 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 11:22:23 AM|SETI@home|Reason: To report completed tasks
6/1/2006 11:22:23 AM|SETI@home|Requesting 86400 seconds of new work, and reporting 3 completed tasks
6/1/2006 11:22:45 AM||Project communication failed: attempting access to reference site
6/1/2006 11:22:48 AM|SETI@home|Scheduler request failed: couldn't connect to server
6/1/2006 11:22:48 AM|SETI@home|Deferring scheduler requests for 1 minutes and 0 seconds
6/1/2006 11:23:00 AM||Access to reference site succeeded - project servers may be temporarily down.
6/1/2006 11:23:49 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 11:23:49 AM|SETI@home|Reason: To report completed tasks
6/1/2006 11:23:49 AM|SETI@home|Requesting 86400 seconds of new work, and reporting 3 completed tasks
6/1/2006 11:24:59 AM|SETI@home|Scheduler request succeeded
6/1/2006 11:25:01 AM|SETI@home|Started download of file 20mr99aa.22885.23233.479804.3.184
6/1/2006 11:25:01 AM|SETI@home|Started download of file 14ja99aa.5585.21986.542316.3.137
6/1/2006 11:29:45 AM|SETI@home|Finished download of file 14ja99aa.5585.21986.542316.3.137
6/1/2006 11:29:45 AM|SETI@home|Throughput 1279 bytes/sec
6/1/2006 11:29:45 AM|SETI@home|Started download of file 20mr99aa.22885.23233.479804.3.172
6/1/2006 11:29:46 AM||Rescheduling CPU: files downloaded
6/1/2006 11:29:46 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:29:47 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:29:48 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:29:48 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:29:49 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:29:49 AM|SETI@home|Unrecoverable error for result 14ja99aa.5585.21986.542316.3.137_1 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:29:49 AM||Rescheduling CPU: start failed
6/1/2006 11:29:49 AM|SETI@home|Computation for task 14ja99aa.5585.21986.542316.3.137_1 finished
6/1/2006 11:31:13 AM|SETI@home|Finished download of file 20mr99aa.22885.23233.479804.3.184
6/1/2006 11:31:13 AM|SETI@home|Throughput 974 bytes/sec
6/1/2006 11:31:14 AM||Rescheduling CPU: files downloaded
6/1/2006 11:31:14 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:31:15 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:31:15 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:31:16 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:31:17 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:31:17 AM|SETI@home|Unrecoverable error for result 20mr99aa.22885.23233.479804.3.184_0 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:31:17 AM||Rescheduling CPU: start failed
6/1/2006 11:31:17 AM|SETI@home|Computation for task 20mr99aa.22885.23233.479804.3.184_0 finished
6/1/2006 11:35:05 AM|SETI@home|Sending scheduler request to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
6/1/2006 11:35:05 AM|SETI@home|Reason: To report completed tasks
6/1/2006 11:35:05 AM|SETI@home|Requesting 51504 seconds of new work, and reporting 2 completed tasks
6/1/2006 11:35:55 AM|SETI@home|Scheduler request succeeded
6/1/2006 11:35:57 AM|SETI@home|Started download of file 20mr99aa.15250.25441.848592.3.187
6/1/2006 11:36:03 AM|SETI@home|Finished download of file 20mr99aa.22885.23233.479804.3.172
6/1/2006 11:36:03 AM|SETI@home|Throughput 961 bytes/sec
6/1/2006 11:36:03 AM|SETI@home|Started download of file 28fe99aa.18483.18097.523582.3.98
6/1/2006 11:36:04 AM||Rescheduling CPU: files downloaded
6/1/2006 11:36:04 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:36:04 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:36:05 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:36:05 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:36:05 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:36:05 AM|SETI@home|Unrecoverable error for result 20mr99aa.22885.23233.479804.3.172_1 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:36:05 AM||Rescheduling CPU: start failed
6/1/2006 11:36:06 AM|SETI@home|Computation for task 20mr99aa.22885.23233.479804.3.172_1 finished
6/1/2006 11:36:19 AM||Project communication failed: attempting access to reference site
6/1/2006 11:36:21 AM|SETI@home|Temporarily failed download of 20mr99aa.15250.25441.848592.3.187: http error
6/1/2006 11:36:21 AM|SETI@home|Backing off 1 minutes and 0 seconds on download of file 20mr99aa.15250.25441.848592.3.187
6/1/2006 11:36:29 AM|SETI@home|Temporarily failed download of 28fe99aa.18483.18097.523582.3.98: http error
6/1/2006 11:36:29 AM|SETI@home|Backing off 1 minutes and 0 seconds on download of file 28fe99aa.18483.18097.523582.3.98
6/1/2006 11:37:21 AM|SETI@home|Started download of file 20mr99aa.15250.25441.848592.3.187
6/1/2006 11:37:25 AM||Access to reference site succeeded - project servers may be temporarily down.
6/1/2006 11:37:29 AM|SETI@home|Started download of file 28fe99aa.18483.18097.523582.3.98
6/1/2006 11:40:21 AM|SETI@home|Finished download of file 20mr99aa.15250.25441.848592.3.187
6/1/2006 11:40:21 AM|SETI@home|Throughput 2026 bytes/sec
6/1/2006 11:40:22 AM||Rescheduling CPU: files downloaded
6/1/2006 11:40:22 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:40:23 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:40:23 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:40:23 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:40:24 AM|SETI@home|Process creation failed: The system cannot find the file specified. (0x2)
6/1/2006 11:40:25 AM|SETI@home|Unrecoverable error for result 20mr99aa.15250.25441.848592.3.187_3 (CreateProcess() failed - The system cannot find the file specified. (0x2))
6/1/2006 11:40:25 AM||Rescheduling CPU: start failed
6/1/2006 11:40:25 AM|SETI@home|Computation for task 20mr99aa.15250.25441.848592.3.187_3 finished
6/1/2006 11:41:26 AM|SETI@home|Finished download of file 28fe99aa.18483.18097.523582.3.98
6/1/2006 11:41:26 AM|SETI@home|Throughput 1531 bytes/sec
6/1/2006 11:41:27 AM||Rescheduling CPU: files downloaded

ID: 322499 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 322519 - Posted: 1 Jun 2006, 16:04:19 UTC - in response to Message 322499.  

Still? ANY Ideas? Other than the fact that for some reason, a "_0" (or _1, _2 etc) is being added to each unit...

The _0, _1, _2 etc. at the end of the result name just tells that it's 1st release, 2nd release, 3rd release etc. of the result. 1st release is _0 .. If a result doesn't get quorum, it's being sent out to another person or persons. the result then gets _1 behind it and so on until that result is returned by 3 computers, all validated correctly and credits granted.
ID: 322519 · Report as offensive
Alinator
Volunteer tester

Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 322528 - Posted: 1 Jun 2006, 16:21:32 UTC - in response to Message 322499.  
Last modified: 1 Jun 2006, 16:23:23 UTC

Still? ANY Ideas? Other than the fact that for some reason, a "_0" (or _1, _2 etc) is being added to each unit... is this why it says that the unit can't be found?

<snip>



What's the story with the 5.19 science app you appear to be using?

What happened when you were running "stock" stuff?

Don't know what you have tried so far, but at this point it would seem you have two choices:

1.) You could poof all the science app related files in the project directory and then update.

2.) Set Boinc to No New Work, abort all work onboard, update to get rid of them, and then reset the project.

Either way you should get back to ground zero with stock SE and then go from there.

To answer you're other question, the _X at the end just indicates which specific result you were for the WU in question and shouldn't be the problem here (in theory). ;-)

HTH,

Alinator
ID: 322528 · Report as offensive
Profile Clyde C. Phillips, III

Send message
Joined: 2 Aug 00
Posts: 1851
Credit: 5,955,047
RAC: 0
United States
Message 322637 - Posted: 1 Jun 2006, 18:56:46 UTC - in response to Message 322444.  

I have a workunit which has been showing 97.690% complete for the last 8 hours - "CPU Time" is 24 hours, "To completion" is 30 mins and increasing slowly.

Taskmanager shows it to be consuming 100% CPU, its graphical display shows that something's happening - although the rate of display change is VERY slow compared with other active workunits.

Will it finish or is it stuck in a loop? Others with the workunit haven't returned it so I can't use their results as a guide.

arthurm


The fact that "time to completion" is increasing very slowly makes sense because (1.00 - 0.9769) * (24 hours + "To Completion") is also increasing. If 97.690% in not increasing it would make sense that the computation is not progressing. I may be wrong but I would abort the unit. I don't know about the graph tray but if "percent complete" is not increasing I would take that as "progress halted", possibly for good. I've aborted two units like that, and some abort automatically, apparently, after progress is stopped.

ID: 322637 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 322645 - Posted: 1 Jun 2006, 19:05:16 UTC - in response to Message 322637.  

I have a workunit which has been showing 97.690% complete for the last 8 hours - "CPU Time" is 24 hours, "To completion" is 30 mins and increasing slowly.

Taskmanager shows it to be consuming 100% CPU, its graphical display shows that something's happening - although the rate of display change is VERY slow compared with other active workunits.

Will it finish or is it stuck in a loop? Others with the workunit haven't returned it so I can't use their results as a guide.

arthurm


The fact that "time to completion" is increasing very slowly makes sense because (1.00 - 0.9769) * (24 hours + "To Completion") is also increasing. If 97.690% in not increasing it would make sense that the computation is not progressing. I may be wrong but I would abort the unit. I don't know about the graph tray but if "percent complete" is not increasing I would take that as "progress halted", possibly for good. I've aborted two units like that, and some abort automatically, apparently, after progress is stopped.

Don't abort the result yet!

Try to exit Boinc first, then restart it. See what that does.
Else try to reboot your computer.

Why is everyone micromanaging their computers these days? I run all results without even looking at them.
ID: 322645 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 955
Credit: 136,115,648
RAC: 73
Hungary
Message 322660 - Posted: 1 Jun 2006, 19:26:37 UTC - in response to Message 322645.  



Why is everyone micromanaging their computers these days? I run all results without even looking at them. [/quote]
I think it maybe due to the dissatisfaction with the reliabilty of the units we are receiving.

Nick

ID: 322660 · Report as offensive
Profile Crunch3r
Volunteer tester
Avatar

Send message
Joined: 15 Apr 99
Posts: 1546
Credit: 3,438,823
RAC: 0
Germany
Message 322661 - Posted: 1 Jun 2006, 19:27:17 UTC - in response to Message 322645.  
Last modified: 1 Jun 2006, 19:27:37 UTC


Why is everyone micromanaging their computers these days? I run all results without even looking at them.


Hmm well, I would guess because of computation errors on Seti Enhanced and WU's that get stuck, error out etc. ???

Could that be the cause ? ;)

Join BOINC United now!
ID: 322661 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 955
Credit: 136,115,648
RAC: 73
Hungary
Message 322768 - Posted: 1 Jun 2006, 21:25:43 UTC

And another piece of garbage and time wasted on it:
27mr99aa.18824.18256.497156.3.133_2

In the future if i get one more error on the due date June 6, I shall abort all June 6 due dates. Wasted nearly an hour of electricity on this one.

Nick
ID: 322768 · Report as offensive
Brian Silvers

Send message
Joined: 11 Jun 99
Posts: 1681
Credit: 492,052
RAC: 0
United States
Message 322943 - Posted: 2 Jun 2006, 2:46:40 UTC - in response to Message 322645.  
Last modified: 2 Jun 2006, 2:51:29 UTC


Why is everyone micromanaging their computers these days? I run all results without even looking at them.


I look at my results fairly frequently, now that my credit has nosedived... I will admit that I liked seeing my ranking climbing upward. I think that is only natural...even though I know my credits won't get me any material thing in this world. I guess that if I do find "THE" signal, then the aliens will give me something though...?

At any rate, the time spent on crunching a few WUs has hurt tremendously. Take this WU for example: Nearly 9 hours for nothing

Now, take into account that I'm the FASTEST host reporting in so far on that WU (my computer is 2049482), perhaps maybe you can appreciate the growing anger amongst people with relatively high pre-enhanced RAC levels. For a single-core system, my RAC was up there. It could've been higher, had I had BOINC running all the time vs. about 86% of the time. YES, I was running optimized. YES, I know I was supposed to see a drop in RAC. YES, I know that some people consider RAC "meaningless" (debateable there, but I'll skip the debate). YES, I am overclocked heavily (my 3700+, which is supposed to be 2.2GHz is running at 2.75GHz, which puts me between an FX-55 and an FX-57). Oh, and NO, I don't think it's this overclocking which is the problem, since everyone else that got the WU bombed out as well and I don't have a slew of other invalid results...

Now, I'm OK with the amount of credit dropping, but these units that take a long time to give up are killing not only my credit levels, but they are likely taking a toll on participation in the project as a whole. Perhaps not right now, but I'm sure there are people thinking "what's up with this crap"?

Brian

Edit: I see there is one host that "completed" the WU that went 91,570.93 seconds (nearly 25.5 hours) and is asking for a whopping 0.11 credit!

ID: 322943 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 955
Credit: 136,115,648
RAC: 73
Hungary
Message 323034 - Posted: 2 Jun 2006, 6:13:10 UTC

333366660 79952833 28 May 2006 16:11:47 UTC 2 Jun 2006 5:32:46 UTC Over Success Done 2,495.16 3.59 0.00
ID: 323034 · Report as offensive
Profile UBT - AndyJG247

Send message
Joined: 26 Nov 02
Posts: 4
Credit: 1,000,341
RAC: 0
United Kingdom
Message 323061 - Posted: 2 Jun 2006, 7:07:49 UTC - in response to Message 322943.  

I just replaced my boinc.exe and dll with some from Crack3r's modifeied ones and so far no more errors.
ID: 323061 · Report as offensive
vicel

Send message
Joined: 28 Mar 06
Posts: 4
Credit: 819,493
RAC: 0
Ukraine
Message 323083 - Posted: 2 Jun 2006, 7:52:37 UTC

Error for WU: 20mr99aa.15250.21794.4826.3.255
http://setiathome.berkeley.edu/workunit.php?wuid=80321839

All the other clients got error for this workunit.

BOINC 5.4.9

ID: 323083 · Report as offensive
imnomadus

Send message
Joined: 11 Jan 06
Posts: 1
Credit: 572
RAC: 0
United States
Message 323114 - Posted: 2 Jun 2006, 8:59:07 UTC

So what is up with this crap?

To quote a person, that is.

I had my computer crunching all through the three day weekend, from friday afternoon until monday night. The only thing it had working on it was seti, yet when I closed it down monday night boinc said it had only crunched 5 hours and 22% of the work.

What the ????

Needless to say my work load is not going to be done by the completion date ... in three hours.

s
ID: 323114 · Report as offensive
arthurm

Send message
Joined: 17 May 99
Posts: 2
Credit: 392,066
RAC: 0
United Kingdom
Message 323217 - Posted: 2 Jun 2006, 11:20:58 UTC - in response to Message 322645.  

I have a workunit which has been showing 97.690% complete for the last 8 hours - "CPU Time" is 24 hours, "To completion" is 30 mins and increasing slowly.

Taskmanager shows it to be consuming 100% CPU, its graphical display shows that something's happening - although the rate of display change is VERY slow compared with other active workunits.

Will it finish or is it stuck in a loop? Others with the workunit haven't returned it so I can't use their results as a guide.

arthurm


The fact that "time to completion" is increasing very slowly makes sense because (1.00 - 0.9769) * (24 hours + "To Completion") is also increasing. If 97.690% in not increasing it would make sense that the computation is not progressing. I may be wrong but I would abort the unit. I don't know about the graph tray but if "percent complete" is not increasing I would take that as "progress halted", possibly for good. I've aborted two units like that, and some abort automatically, apparently, after progress is stopped.

Don't abort the result yet!

Try to exit Boinc first, then restart it. See what that does.
Else try to reboot your computer.

Why is everyone micromanaging their computers these days? I run all results without even looking at them.


That fixed it - many thanks.

I don't think I micromanage,i just look at what's running every now and again and spotted this "stuck" workunit. All is well now.

Thanks again
ID: 323217 · Report as offensive
peristalsis

Send message
Joined: 23 Jul 99
Posts: 154
Credit: 28,610,163
RAC: 51
United States
Message 323282 - Posted: 2 Jun 2006, 13:04:33 UTC

I have several enhanced units with a projected completeion time of 14 hours.
My machine finished one and the error I rcvd was :
6/1/2006 12:13:32 PM|SETI@home|Unrecoverable error for result 27mr99aa.15051.497.286082.3.52_6 (Maximum CPU time exceeded)
Now I look at several others who did the same wu, and I see three with a credit of .11 (as in point 11) and the rest (four) errored out as I did.
I see that my machine wasted 18 hours of time on this turkey. My main concern is that I have several other 14 hour WU's in my queue. Are they also going to be a waste of time and should I just dump them or what? Suggestions and recommendations solicited (g). thanks..p

ID: 323282 · Report as offensive
Miklos M.

Send message
Joined: 5 May 99
Posts: 955
Credit: 136,115,648
RAC: 73
Hungary
Message 323486 - Posted: 2 Jun 2006, 16:01:06 UTC
Last modified: 2 Jun 2006, 16:02:03 UTC

6/2/2006 11:26:18 AM|SETI@home|Task 03mr99ab.21236.9360.465900.3.172_4 exited with zero status but no 'finished' file
I am just wondering, is anyone at BOINC reading these and trying to correct the problem?

Nick

ID: 323486 · Report as offensive
Alinator
Volunteer tester

Send message
Joined: 19 Apr 05
Posts: 4178
Credit: 4,647,982
RAC: 0
United States
Message 323503 - Posted: 2 Jun 2006, 16:13:34 UTC - in response to Message 323486.  
Last modified: 2 Jun 2006, 16:15:43 UTC

6/2/2006 11:26:18 AM|SETI@home|Task 03mr99ab.21236.9360.465900.3.172_4 exited with zero status but no 'finished' file
I am just wondering, is anyone at BOINC reading these and trying to correct the problem?

Nick



Keep in mind this message can get generated for reasons not related to a "hard" error from the science app.

A no biggie scenario is when the science app fails to get the heartbeat from the BOINC exe for a while and exits like it's designed to do. Then when BOINC sees it went away unexpectedly it thinks the result is done, looks for the output data file and when it doesn't find it, it throws the log message and will restart it from the last checkpoint, just like a regular task switch.

Alinator
ID: 323503 · Report as offensive
Profile Bymark
Avatar

Send message
Joined: 30 Dec 04
Posts: 29
Credit: 700,896
RAC: 0
Finland
Message 323584 - Posted: 2 Jun 2006, 18:40:59 UTC

-I’ am pissed behave of Crunch3r's good work not getting the credit he supposed to.
Next week all my seti computers (20) are on NO NEW WORK , until he gets a APOLOGY from all people on this forum that got him pissed of and seti administrators.
-A second reason is I got 5 work units yesterday, about 45 hours work/units on my AMD Opteron 165 computer. I have got only 5/20 optimized computers. Sucks.
-Until we get back our 4 hour Crunch units!

cya2 : Turku, Finland
Comments to: thomas.bymark@pp.inet.fi

http://www.boincstats.com/ :

SETI@HomeCredit/day 2,833
Position based on Total Credit 1484, and going down next week (seti).

Until we meet again................... ( SETI@home enhanced, sucks, as now)

ID: 323584 · Report as offensive
Previous · 1 . . . 10 · 11 · 12 · 13 · 14 · 15 · 16 . . . 22 · Next

Message boards : Number crunching : Computation errors on Seti Enhanced


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.