Posts by Hellsheep

1) Message boards : Number crunching : CUDA no longer working w/3.1 update (Message 1021773)
Posted 1 Aug 2010 by Profile Hellsheep
Post:
I'm a noob about this stuff, so apologies for any cluelessness here.

I'm running BOINC on a Macbook Pro (6,2) with an NVIDIA GeForce GT330M. I had the NVIDIA CUDA 3.0 driver installed, and BOINCManager identified the GPU as "usable."

I just installed the CUDA 3.1.10 update, and now BOINCManager (6.10.58) says "no usable GPUs found."

Any advice? Wait patiently until there's a new version of BOINC?


Hello, is BOINC installed as a service?
2) Message boards : Number crunching : Closed *SETI/BOINC Milestones [ v2.0 ] - XXI* Closed (Message 1021766)
Posted 1 Aug 2010 by Profile Hellsheep
Post:
Seems i passed 500k :)
3) Message boards : Number crunching : Server Run, July 30 - August 2 2010 (Message 1021765)
Posted 1 Aug 2010 by Profile Hellsheep
Post:
Read my post in the other thread about reporting problems. It seems to be an error with the web server.
4) Message boards : Number crunching : Problem reporting tasks (Message 1021764)
Posted 1 Aug 2010 by Profile Hellsheep
Post:
Hey guys and girls,

Sorry i haven't been around much. Been busy working on some web servers for a company.

Anyway.. After having these same issues i did some debugging.

Noticed that the URL boinc tries to connect to is initiating a request on port 80 (http) so naturally that means apache or something is installed on that server.

Tried accessing http://setiboinc.ssl.berkeley.edu or even http://ssl.berkeley.edu a few times over an hour, i got 4 successful requests which returned the default fedora test page. And the rest were not successful.

"The server at http://setiboinc.ssl.berkeley.edu is taking too long to respond."

This means boinc cannot communicate with it either. Now i'm not sure if it's some sort of network issue and why it's only affecting a few people and not others. But i know one thing, it's certainly an issue with the server running the scheduler since obviously it's inaccessible a large amount of the time.

I have a feeling either there is an issue with the web server setup or a routing issue somewhere, if they use NFS maybe something isn't working right between communication of the schedule servers and database servers.

Just thought i should let you know, since although there have been a few fixes listed here, none of these /should/ in theory be actually fixing anything as the problem lies with the connection to that URL.
5) Message boards : Number crunching : Panic Mode On (36) Server problems (Message 1021124)
Posted 31 Jul 2010 by Profile Hellsheep
Post:
Still something screwy somewhere.
Forums are slow as, and getting lots of Scheduler request faileds, even though network traffic has dropped off significantly. And even when it does connect, still getting the Project has no tasks available even though there are now plenty there.


Yeah, i'm seeing plenty of scheduler requests failed. Might be a network issue again.
6) Message boards : Number crunching : home network question...... (Message 1020093)
Posted 27 Jul 2010 by Profile Hellsheep
Post:
I'm out of ideas Sorry Geek.

Btw, you guys seem to have slow internet, i thought Australia was bad. This is mine:

7) Message boards : Number crunching : home network question...... (Message 1019836)
Posted 26 Jul 2010 by Profile Hellsheep
Post:
Generally... Most DSL Modems are 100 megabit (wired).

What "I would do" is purchase a 1 gigabit Switch so that all of your wired machines go into it (if you have it pay the money for Cat 6e wiring). Then you end up with a link between the machines of 1 gigabit, the machines to the DSl of 100 meagbit and the Wireless to the machines 100 meagbit and 150 to the internet.
The DSL wireless I have is supposed to do 150N, the max I have ever seen is 65 megabits (adaptive). I moved my wireless to a 300N Router which connects at 150 megabits.






10Mbit vs 100 Mbit vs 1000 Mbit... all should choke the same on a 3Mbit pipe(dsl sync speed) and even less upstream. Which is what makes me think it might be a flow control issue on the wireless that is outperforming the wired.


This is correct. No matter what his network speed is here it's well within range to be capable of handling the 3Mbit downstream connection. Even if you were to account for it being asynchronous and added the upstream+downstream i doubt his downstream his 7Mbit+ and it's certainly not 97Mbit. :)

The issue lies most likely in the way traffic is flowing on the wired lines. Assuming you're using Cat5e cabling, you have a 100Mbit router/switch at least and you're connected at 100Mbit. Geek can you try doing something. On the machine(s) you're having trouble with try going to Start> Control Panel> Network Connections> Right click on your network connection and click Properties> Configure. Once there look for "Speed/Duplex Settings" or something along those lines. Check what it's set to. Most times it'll be set to Auto or Auto Negotiation. Try setting it to 100Mb Full Duplex or if you only have a Full Duplex setting that'll work too. Also ensure "Flow Control" is disabled. This should allow free flow that's unrestricted.
8) Message boards : Number crunching : Are............................................................................................... (Message 1019830)
Posted 26 Jul 2010 by Profile Hellsheep
Post:
I'm all ready with my nice big cache at 10 days on my somewhat-super-cruncher which will do about 500 or so work units during those few days (not much compared to the top crunchers, but still enough to warrant a 10 day cache in my opinion. And 4 day CPU cache on my slow machine.
9) Message boards : Number crunching : New rescheduler (Message 1018888)
Posted 23 Jul 2010 by Profile Hellsheep
Post:
V 1.2

Add: A block on the BOINC client, to run while rescheduling. The BOINC Manager sometimes restarts the client.
Add: A warning in the log, when the BOINC Manager is running.
Add: Rescheduling now uses the CPU and GPU ratio. The ratio is used by SETI to correct runtime estimates.
Changed: Rescheduling on the regular SETI installation is now permitted. (With two different plan classes running).
Fixed: Several small bugs


What happens now that VLAR's don't go to GPU's?
10) Message boards : Number crunching : Error Uploadserver (Message 1018585)
Posted 23 Jul 2010 by Profile Hellsheep
Post:
Just to clarify, as far as i am aware web-servers and file systems the way the work is after a crash or a serious error, the system reboots in read-only mode. Also usually a FSCK(File system check) is done on the server automatically.

It would seem the server encountered an error, and was either rebooted or rebooted itself into read-only mode to prevent any further issues. :)

(Good thing web servers and servers are my specialty.) ;)

As I said in Pappa's thread yesterday, that makes a lot more sense than his off-the-cuff remark about a BOINC database crash. I don't know much about web or general *nix servers, but I do know a bit about databases - and if the early outage was invoked by staff because of database problems, then they would have been the result of the spontaneous reboot, not the original cause. Different symptoms entirely.


Richard, if I understand this right, you saw indications of a short power / Internet access interuption, which may have caused the upload/download servers (and maybe others) to reboot in Read-Only mode, which then caused the Master BOINC database to crash. With ALL that chaos, they shut everything down and did a full restart.

That DOES make a whole lot more sense than just a Database crash.




100% correct, a power outage or surge would cause the servers to reboot in read only mode due to it thinking it was a possible hardware failure. :)

Database probably did crash, but only as a result of it being in read only mode and unable to write anything. :)
11) Message boards : Number crunching : Error Uploadserver (Message 1018370)
Posted 22 Jul 2010 by Profile Hellsheep
Post:
Just to clarify, as far as i am aware web-servers and file systems the way the work is after a crash or a serious error, the system reboots in read-only mode. Also usually a FSCK(File system check) is done on the server automatically.

It would seem the server encountered an error, and was either rebooted or rebooted itself into read-only mode to prevent any further issues. :)

(Good thing web servers and servers are my specialty.) ;)
12) Message boards : Number crunching : Pending units (Message 1017903)
Posted 20 Jul 2010 by Profile Hellsheep
Post:
I am aware of that post, the problem is my results went through the whole weekend last week without reporting anything by itself, i had to report them manually.

BOINC should never wait more than 24 hours to report completed tasks. Are you sure that Network Activity is enabled? If you click 'update' to report them manually, BOINC temporarily opens the network to comply with your request, but them shuts it off again.


Network activity was on.

Today after my rig finished crunching all its work i detached to clear our 400+ ghost workunits.

Now that they're gone and i've set it all back up everything works perfect. I've gotten another 1300 work units and its reporting after every task or two.
13) Message boards : Number crunching : Any Ideas ... (Message 1017902)
Posted 20 Jul 2010 by Profile Hellsheep
Post:
The answer is ghost work units.

Almost every host has got large amounts of ghosts. I have 400 on one host myself.

This means that a lot of results aren't actually being crunched.

Until they do, pendings will go up probably.
14) Message boards : Number crunching : New rescheduler (Message 1017543)
Posted 19 Jul 2010 by Profile Hellsheep
Post:
I believe the reason running as a service under Win 7 or Vista doesn't work for GPU crunching is due to the way the drivers run under those OS's, something to do with WDDM. I think Joe posted a bit about this, i'll have a look and see if i can find the thread.

Also, the vlar change has only taken effect on Beta as far as i know. If all goes well at Beta, it will be released here soon.
15) Message boards : Number crunching : Pending units (Message 1017542)
Posted 19 Jul 2010 by Profile Hellsheep
Post:
Hellsheep try this in a cc_config file


<cc_config>
<options>
<report_results_immediately>1</report_results_immediately>
</options>
</cc_config>

should help


Thanks, i'll try that.


See Message 993241 by Aurora Borealis:

Boinc uploads the results immediately on completion if the Internet is available. It tries to bundle the actual reporting of completed work to ease the load on the server. Accessing the database to report a group of results only adds a small percentage of overhead more than an individual report. Boinc also ensures that work is not reported late by following a set of reporting rules as previously posted by JM7

BOINC returns work to a project at the first of:

1) 24 hours before deadline.
2) Connect Every X before deadline.
3) 24 hours after completion.
4) 60 seconds after the completion of upload if upload completes after any of 1 to 3.
5) On a work request to that project.
6) With any other reported work for that project.
7) On a trickle up to the project (CPDN only as far as I know).
8) On a trickle down request (no projects use this as far as I know).
9) On a server specified schedule (some project uses this, but not certain which).
10) On a BOINC Account Manager specified connection (only if using a BOINC Account Manager).
11) When the user pushes the update button.


Rules 3, 5 and 6 are the ones that are most commonly seen being used especially if you are attached to more than one project.


Gruß,
Gundolf


I am aware of that post, the problem is my results went through the whole weekend last week without reporting anything by itself, i had to report them manually.
16) Message boards : Number crunching : Pending units (Message 1017469)
Posted 19 Jul 2010 by Profile Hellsheep
Post:
I should have checked it out further.. One of my computers ran dry over night..
..

It just took a click of the update button to get reported and re-filled.. I do not think I had anything defered/restricted.. Ahh well still about an 80% chance
of ID-10-T failure. Unless a bunch of others have similar happening.


Mine doesn't report at all even when its not deferred or a backoff, i was about to post about this. I think there is a setting somewhere you can set in the cc_config maybe, i'll have to read up about it.

I've been reporting manually for the past week.
17) Message boards : Number crunching : is it possible to have done to much work to report? (Message 1016672)
Posted 17 Jul 2010 by Profile Hellsheep
Post:
I pay $79.95 per month for 24,000/1,024 aka 24Mbit/1Mbit.

Max theoretical down speed of 3MB/s and max theoretical up speed of 113.7KB/s.

<3 it
18) Message boards : Number crunching : more delay today it seems.... (Message 1016516)
Posted 17 Jul 2010 by Profile Hellsheep
Post:
2 of my machines reported...but my i7 timed out.

17 Jul 2010 0:29:46 UTC


I was able to report as well.
19) Message boards : Number crunching : The project down due to a network problem. (Message 1016515)
Posted 17 Jul 2010 by Profile Hellsheep
Post:
I could be wrong but I think they just turned on the data.


I can confirm this, i just reported 94 tasks.
20) Message boards : Number crunching : more delay today it seems.... (Message 1016415)
Posted 16 Jul 2010 by Profile Hellsheep
Post:
Nooooo, wait, they can't get it going, I've still got almost two days worth of work to finish yet!!!! :-)


Noooo! My GPU stopped having work 2 days ago and my CPU only has enough for another day! Noooooo! Oh wait... i set no new tasks so i can detach to remove the ~400 ghosts i have... nevermind.


Next 20


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.