Can't report or get new tasks

Message boards : Number crunching : Can't report or get new tasks
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

AuthorMessage
Dave

Send message
Joined: 29 Mar 02
Posts: 778
Credit: 25,001,396
RAC: 0
United Kingdom
Message 1124671 - Posted: 4 Jul 2011, 18:57:16 UTC
Last modified: 4 Jul 2011, 18:57:26 UTC

I reboot.

It's fine now.

Heh.
ID: 1124671 · Report as offensive
Profile dahls

Send message
Joined: 24 Oct 04
Posts: 135
Credit: 178,942,502
RAC: 217
Norway
Message 1126628 - Posted: 10 Jul 2011, 18:03:24 UTC - in response to Message 1124455.  

And still, the last contact date is June 25th for this machine.

I don't understand this at all.
I am certain that this morning it showed a wu that was uploaded at about 7:00 UTC on 3 July. I even checked the link I did put down in my previous post.
But now the link shows another wu. ???


After being out of the house for a week, the two machines has not contacted SETI according to the "Your computers" status. They are not able to report completed WU's and not able to get new WU's (since they already have too many of them that has not been reported).

The two machines report "500 Internal server error" as the return status from the servers at berkeley.

I give up! (unless some people at the project cam come up with something clever). I don't want to spend more time finding out what is wrong in the communication. All I know is that the linux machines with Fedora Core 12 will not run BOINC properly after some time. All other linux machiens (FC4 and FC10) are working find.

Thanks to all who have replied to this thread.
May the source be with you :)
ID: 1126628 · Report as offensive
Profile dahls

Send message
Joined: 24 Oct 04
Posts: 135
Credit: 178,942,502
RAC: 217
Norway
Message 1133674 - Posted: 29 Jul 2011, 21:07:48 UTC

Latest news: Three more linux machines are not longer able to communicate with BOINC/SETI.

Status: ALL linux servers are not able to crunch data!
Windows machines seem to work OK.


Does this mean that BOINC work on Windows ONLY???? I does not hope so!
ID: 1133674 · Report as offensive
Profile BilBg
Volunteer tester
Avatar

Send message
Joined: 27 May 07
Posts: 3720
Credit: 9,385,827
RAC: 0
Bulgaria
Message 1134155 - Posted: 30 Jul 2011, 15:47:12 UTC - in response to Message 1133674.  


I asked earlier:
http://setiathome.berkeley.edu/forum_thread.php?id=63664&nowrap=true#1124472

Do the SETI server see the same "External IP address" for all your Linux and Windows machines?

If they connect from different External IP addresses they may be affected by the "HE connection problems".

Did you try to set/use HTTP Proxy in BOINC options?
http://setiathome.berkeley.edu/forum_thread.php?id=64691
http://setiathome.berkeley.edu/forum_thread.php?id=64691&nowrap=true#1129151


 


- ALF - "Find out what you don't do well ..... then don't do it!" :)
 
ID: 1134155 · Report as offensive
Profile dahls

Send message
Joined: 24 Oct 04
Posts: 135
Credit: 178,942,502
RAC: 217
Norway
Message 1134608 - Posted: 31 Jul 2011, 20:25:03 UTC - in response to Message 1134155.  


I asked earlier:
http://setiathome.berkeley.edu/forum_thread.php?id=63664&nowrap=true#1124472

Do the SETI server see the same "External IP address" for all your Linux and Windows machines?

If they connect from different External IP addresses they may be affected by the "HE connection problems".

Did you try to set/use HTTP Proxy in BOINC options?
http://setiathome.berkeley.edu/forum_thread.php?id=64691
http://setiathome.berkeley.edu/forum_thread.php?id=64691&nowrap=true#1129151



Yes, all machines are hiding behind the same IP. I have not set up any proxy server.
But I see that windows machines do get new WUs while all linux servers get http error (no WUs).

There MUST be something wrong at the server side.
ID: 1134608 · Report as offensive
S@NL - John van Gorsel
Volunteer tester
Avatar

Send message
Joined: 5 Jul 99
Posts: 193
Credit: 139,673,078
RAC: 0
Netherlands
Message 1134633 - Posted: 31 Jul 2011, 21:51:58 UTC - in response to Message 1134608.  


There MUST be something wrong at the server side.


There are some differences between Linux and Windows. Since 3 days, my Linux pc's are not able to download unless I add the 208.68.240.18 (download server) to the host file. My Windows 7 pc's use the same network, router and modem but do not have any problems downloading.

I'm not saying that your problem is the same but the fact that Linux has problems and Windows does not, does not necessarily mean the problem is at the server side.


Seti@Netherlands website
ID: 1134633 · Report as offensive
Claggy
Volunteer tester

Send message
Joined: 5 Jul 99
Posts: 4654
Credit: 47,537,079
RAC: 4
United Kingdom
Message 1134645 - Posted: 31 Jul 2011, 22:23:13 UTC - in response to Message 1134633.  


There MUST be something wrong at the server side.


There are some differences between Linux and Windows. Since 3 days, my Linux pc's are not able to download unless I add the 208.68.240.18 (download server) to the host file. My Windows 7 pc's use the same network, router and modem but do not have any problems downloading.

I'm not saying that your problem is the same but the fact that Linux has problems and Windows does not, does not necessarily mean the problem is at the server side.

I'd be very tempted to try one of the problem Linux hosts on a different ISP,

and try one of the probem Linux hosts on the Network with all of the other hosts disconnected,

Claggy
ID: 1134645 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22205
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1134756 - Posted: 1 Aug 2011, 11:26:28 UTC

If I read dahls's post correctly he is saying that all his PCs are connected through the same router to the same isp, but some (Windows 7) are getting/reporting WU, but others (Linux) are not. This suggests that it is a problem with the way his Linux PCs are set up.
My first guess is that John is running a firewall on those PCs that is blocking a range of IP addresses, and needs an exception set for the S@H servers. By adding the ip208.68.240.18 address to the hosts list he is allowing communication, so, effectively solving the problem. This may of course have come about from an update to his Linux system, and possibly one he didn't know happened.
For dahls's I can confirm that S@H runs smoothly on Linux under BOINC - just look at my computers for confirmation (I cannot comment on other projects under BOINC because my Linux crunchers are currently only doing S@H).
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1134756 · Report as offensive
Profile tullio
Volunteer tester

Send message
Joined: 9 Apr 04
Posts: 8797
Credit: 2,930,782
RAC: 1
Italy
Message 1134758 - Posted: 1 Aug 2011, 11:49:21 UTC

On my Linux box I am getting downloads from QMC@home, Einstein@home and Test4Theory@home but not from SETI@home where I have 3 stuck downloads. I haven't changed anything recently.
Tullio
ID: 1134758 · Report as offensive
Profile dahls

Send message
Joined: 24 Oct 04
Posts: 135
Credit: 178,942,502
RAC: 217
Norway
Message 1134789 - Posted: 1 Aug 2011, 15:04:22 UTC - in response to Message 1134758.  

Reinstalled boinc today on one of the linux servers. It got a few WU's which was done within a few hours.

But, when looking at the status for the machine, http://setiathome.berkeley.edu/results.php?hostid=5256434&offset=0&show_names=0&state=1&appid= it says that it has 96 WUs pending.

Looking into the BOINC/projects/setiathome.berkeley.edu directory, there is hardly any files at all:
[userx@r2-d2 ~]$ cd BOINC/projects/setiathome.berkeley.edu/
[userx@r2-d2 setiathome.berkeley.edu]$ ls -l
total 7404
-rw-r--r--. 1 userx userx       0 2011-08-01 13:04 17ap11ad.23942.18472.5.10.150
-rw-r--r--. 1 userx userx       0 2011-08-01 08:36 17ap11ad.23942.18472.5.10.155
-rw-r--r--. 1 userx userx       0 2011-08-01 13:04 17ap11ad.23942.18472.5.10.156
-rw-r--r--. 1 userx userx  375332 2011-08-01 08:37 17ap11ad.23942.18472.5.10.161
-rw-r--r--. 1 userx userx       0 2011-08-01 08:36 17ap11ad.23942.18472.5.10.162
-rw-r--r--. 1 userx userx       0 2011-08-01 08:36 17ap11ad.23942.18472.5.10.167
-rw-r--r--. 1 userx userx       0 2011-08-01 08:37 17ap11ad.23942.18472.5.10.173
-rw-r--r--. 1 userx userx  375331 2011-08-01 08:36 17ap11ad.23942.18472.5.10.174
-rw-r--r--. 1 userx userx       0 2011-08-01 08:36 17ap11ad.23942.18472.5.10.191
-rw-r--r--. 1 userx userx       0 2011-08-01 08:37 17ap11ad.23942.18472.5.10.197
-rw-r--r--. 1 userx userx       0 2011-08-01 08:39 17ap11ad.23942.18472.5.10.203
-rw-r--r--. 1 userx userx       0 2011-08-01 08:39 17ap11ad.23942.18472.5.10.209
-rw-r--r--. 1 userx userx   52779 2011-08-01 08:31 arecibo_181.png
-rw-r--r--. 1 userx userx    2536 2011-08-01 08:31 sah_40.png
-rw-r--r--. 1 userx userx   25488 2011-08-01 08:31 sah_banner_290.png
-rw-r--r--. 1 userx userx   35399 2011-08-01 08:32 sah_ss_290.png
-rw-r--r--. 1 userx userx    9068 2011-08-01 08:30 seti_528.jpg
-rw-r--r--. 1 userx userx     751 2011-08-01 08:33 setiathome-5.28_AUTHORS
-rw-r--r--. 1 userx userx   17992 2011-08-01 08:30 setiathome-5.28_COPYING
-rw-r--r--. 1 userx userx    1990 2011-08-01 08:30 setiathome-5.28_COPYRIGHT
-rw-r--r--. 1 userx userx    2060 2011-08-01 08:30 setiathome-5.28_README
-rwxr-xr-x. 1 userx userx 6641240 2011-08-01 08:35 setiathome-5.28.x86_64-pc-linux-gnu
-rw-r--r--. 1 userx userx      71 2011-08-01 11:06 slideshow_setiathome_enhanced_00
-rw-r--r--. 1 userx userx      72 2011-08-01 11:06 slideshow_setiathome_enhanced_01
-rw-r--r--. 1 userx userx      75 2011-08-01 11:06 slideshow_setiathome_enhanced_02
-rw-r--r--. 1 userx userx      67 2011-08-01 11:06 stat_icon


And the debug output file indicate that no files has been uploaded, only a few been downloaded.

I would like to see some kind of explanation about this? I have been trying to run SETI under linux for months, but it seems to be almost impossible.
ID: 1134789 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22205
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1134813 - Posted: 1 Aug 2011, 15:50:04 UTC

Pending = work that has been sent to your cruncher and not reported
96 WU have been sent, but not reported yet, these may be "lost".
There are 4 tasks that have been reported and are awaiting validation (all reported today), and awaiting your wingmen to do their processing.
When you re-installed BOINC & S@H did you do a clean install or did you just re-install the applications?
If you did a clean install the data (tasks to be run and results awaiting return) may have been removed, so "lost" - it might be worth looking in "lost and found" to see if they've been sent there, but it is a very long shot, as you would have to (manually) recreate the xml control files that tell BOINC what tasks are available, and what their status is.
Under Linux I've found it worth letting the cache empty, and forcing a delivery before doing an update as it is just a bit less user tame than windows.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1134813 · Report as offensive
Bernd Noessler

Send message
Joined: 15 Nov 09
Posts: 99
Credit: 52,635,434
RAC: 0
Germany
Message 1134816 - Posted: 1 Aug 2011, 15:51:48 UTC

You didn't reinstall boinc. You installed a 64bit linux client on a machine where
a 32bit linux client was installed. By doing this the 32bit client was deleted.
And boinc deleted all tasks which had to be crunched with this client.
And now all this tasks will reach there deadline. In the moment your are allowed
to load only 16 tasks per day.

It's not impossible to crunch seti with linux. Have a look at my machines.

ID: 1134816 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22205
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1134822 - Posted: 1 Aug 2011, 15:58:34 UTC
Last modified: 1 Aug 2011, 16:00:43 UTC

Dahls - I've just looked at another thread about unable to get downloads, do uploads or report finished tasks. It would appear there is "a problem" with the .13 address, so forcing which forcing the use of the .18 address overcomes.

I think what is happening is that Linux is trying the lower (.13) ip address every time it attempts, not knowing about the higher (.18) address - hence your earlier comment about being able to work properly if you for the host address to the .18 address.
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1134822 · Report as offensive
Pedregulho
Avatar

Send message
Joined: 16 May 03
Posts: 30
Credit: 1,627,248
RAC: 0
Portugal
Message 1134839 - Posted: 1 Aug 2011, 16:23:16 UTC

Seg 01 Ago 2011 17:14:08 WEST SETI@home Computation for task 08ap11an.19796.22153.3.10.214_0 finished
Seg 01 Ago 2011 17:14:08 WEST SETI@home Starting 31oc10ac.3180.1415.16.10.89_1
Seg 01 Ago 2011 17:14:08 WEST SETI@home Starting task 31oc10ac.3180.1415.16.10.89_1 using setiathome_enhanced version 603
Seg 01 Ago 2011 17:14:10 WEST SETI@home Started upload of 08ap11an.19796.22153.3.10.214_0_0
Seg 01 Ago 2011 17:14:16 WEST SETI@home Finished upload of 08ap11an.19796.22153.3.10.214_0_0
Seg 01 Ago 2011 17:14:23 WEST SETI@home Computation for task 08ap11an.19796.22153.3.10.207_0 finished
Seg 01 Ago 2011 17:14:23 WEST SETI@home Starting 31oc10ac.3180.1415.16.10.88_0
Seg 01 Ago 2011 17:14:23 WEST SETI@home Starting task 31oc10ac.3180.1415.16.10.88_0 using setiathome_enhanced version 603
Seg 01 Ago 2011 17:14:25 WEST SETI@home Started upload of 08ap11an.19796.22153.3.10.207_0_0
Seg 01 Ago 2011 17:14:28 WEST SETI@home Finished upload of 08ap11an.19796.22153.3.10.207_0_0
Seg 01 Ago 2011 17:16:42 WEST SETI@home Sending scheduler request: To fetch work.
Seg 01 Ago 2011 17:16:42 WEST SETI@home Reporting 2 completed tasks, requesting new tasks for CPU
Seg 01 Ago 2011 17:16:45 WEST SETI@home Scheduler request completed: got 0 new tasks
Seg 01 Ago 2011 17:16:45 WEST SETI@home Message from server: Project has no tasks available

All normal in Ubuntu but...
Project has no tasks availabl...e

ID: 1134839 · Report as offensive
Paul

Send message
Joined: 17 May 99
Posts: 72
Credit: 42,977,964
RAC: 43
United States
Message 1134865 - Posted: 1 Aug 2011, 16:55:45 UTC - in response to Message 1134822.  

Thanks, Rob. I restarted BOINC and it immediately downloaded at least eight work units. But, then it started to fail downloads and now it thinks the project servers are all down.
ID: 1134865 · Report as offensive
OzzFan Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 9 Apr 02
Posts: 15691
Credit: 84,761,841
RAC: 28
United States
Message 1134878 - Posted: 1 Aug 2011, 17:26:20 UTC - in response to Message 1134865.  

People are grabbing and returning work faster than it is being created at the moment. Because of all this activity, it gives the appearance that the servers are unavailable.

When this happens, it is often referred to as a Distributed Denial of Service (DDoS). The only thing you can do is wait since your BOINC client will keep retrying on it's own.
ID: 1134878 · Report as offensive
Paul

Send message
Joined: 17 May 99
Posts: 72
Credit: 42,977,964
RAC: 43
United States
Message 1134888 - Posted: 1 Aug 2011, 17:42:48 UTC - in response to Message 1134878.  

Okay, thanks. I'll stop obsessing and let the back-off take over.
ID: 1134888 · Report as offensive
rob smith Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer moderator
Volunteer tester

Send message
Joined: 7 Mar 03
Posts: 22205
Credit: 416,307,556
RAC: 380
United Kingdom
Message 1134904 - Posted: 1 Aug 2011, 19:03:12 UTC

One reason for the very low production rate is the very high rate of rejection - if you look on the server status page you will see that the current "tapes" loaded have a fair number of "error" blocks, visually over half the tape of 14 blocks, which is far more than the usual 1 or two per tape. I wander if it takes more effort for the splitters when working with a high error tape over a low error tape?
Bob Smith
Member of Seti PIPPS (Pluto is a Planet Protest Society)
Somewhere in the (un)known Universe?
ID: 1134904 · Report as offensive
Profile Fourmyle of Ceres
Avatar

Send message
Joined: 24 Jun 99
Posts: 9
Credit: 1,826,095
RAC: 0
United States
Message 1134939 - Posted: 1 Aug 2011, 21:03:32 UTC

One of my 3 hosts stopped being able to communicate on Thurs 25-Jul-2011 while at least one of the others is fine.

I'm not using an http proxy and all hosts are behind the same firewall.

Host 1: This one (2-CPU Fedora 14)
http://stats.kwsn.net/host.php?proj=sah&hostid[]=5792705
hasn't had a successful WU download since Thurs 25-Jul-2011 13:08:48,

For days, Boinc Mgr 6.10.58 showed a couple dozen WU's as "downloading". It finally burned through, uploaded, and reported all the WUs I had on hand, then I did a manual project reset. Since the reset, I haven't even been able to download seti_528.jpg for 6+ hours, much less any workunits.

I've rebooted it. At the moment, it has complaints like this:

01-Aug-2011 15:25:35 [SETI@home] Temporarily failed download of setiathome-5.28.x86_64-pc-linux-gnu: HTTP error
01-Aug-2011 15:25:35 [SETI@home] Backing off 2 min 35 sec on download of setiathome-5.28.x86_64-pc-linux-gnu
01-Aug-2011 15:25:35 [SETI@home] Temporarily failed download of seti_528.jpg: HTTP error

etc, etc

Host 2: My other speedy box (Snow Leopard) has been uploading and reporting fine. (It normally crunches only during US EDT business hours, M-F, using a couple CPUs)

Host 3: My slowest box (single-cpu RHEL5) has been grinding through an AP for 125+ hours, so I have no current info.

If Host 1 doesn't get any work units by Thursday or so, I guess I'll reinstall, but I don't see why that would help.
ID: 1134939 · Report as offensive
Profile arkayn
Volunteer tester
Avatar

Send message
Joined: 14 May 99
Posts: 4438
Credit: 55,006,323
RAC: 0
United States
Message 1134979 - Posted: 1 Aug 2011, 22:53:56 UTC - in response to Message 1134939.  

One of my 3 hosts stopped being able to communicate on Thurs 25-Jul-2011 while at least one of the others is fine.

I'm not using an http proxy and all hosts are behind the same firewall.

Host 1: This one (2-CPU Fedora 14)
http://stats.kwsn.net/host.php?proj=sah&hostid[]=5792705
hasn't had a successful WU download since Thurs 25-Jul-2011 13:08:48,

For days, Boinc Mgr 6.10.58 showed a couple dozen WU's as "downloading". It finally burned through, uploaded, and reported all the WUs I had on hand, then I did a manual project reset. Since the reset, I haven't even been able to download seti_528.jpg for 6+ hours, much less any workunits.

I've rebooted it. At the moment, it has complaints like this:

01-Aug-2011 15:25:35 [SETI@home] Temporarily failed download of setiathome-5.28.x86_64-pc-linux-gnu: HTTP error
01-Aug-2011 15:25:35 [SETI@home] Backing off 2 min 35 sec on download of setiathome-5.28.x86_64-pc-linux-gnu
01-Aug-2011 15:25:35 [SETI@home] Temporarily failed download of seti_528.jpg: HTTP error

etc, etc

Host 2: My other speedy box (Snow Leopard) has been uploading and reporting fine. (It normally crunches only during US EDT business hours, M-F, using a couple CPUs)

Host 3: My slowest box (single-cpu RHEL5) has been grinding through an AP for 125+ hours, so I have no current info.

If Host 1 doesn't get any work units by Thursday or so, I guess I'll reinstall, but I don't see why that would help.


For the jpg file, you can copy it from any of your other machines and after restarting BOINC it will be picked up as in place.

ID: 1134979 · Report as offensive
Previous · 1 . . . 4 · 5 · 6 · 7 · 8 · 9 · 10 · Next

Message boards : Number crunching : Can't report or get new tasks


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.