Panic Mode On (112) Server Problems?

Message boards : Number crunching : Panic Mode On (112) Server Problems?
Message board moderation

To post messages, you must log in.

Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 33 · Next

AuthorMessage
TBar
Volunteer tester

Send message
Joined: 22 May 99
Posts: 5204
Credit: 840,779,836
RAC: 2,768
United States
Message 1936494 - Posted: 21 May 2018, 1:57:50 UTC
Last modified: 21 May 2018, 2:02:16 UTC

Back to the REAL Server Problems....

It would appear the 'FIX' for the Windows ATI AstroPulse Fiasco has been Broken.
It's common knowledge the Newer Windows drivers Break the ATI AsrtoPulse App resulting in very few Signals being reported. This was supposedly Fixed by NOT assigning a Windows ATI Host as a Tiebreaker or even matching two Windows ATI GPUs on an AstroPulse task.
Well, they are back to cross validating and giving Hosts producing the CORRECT results FALSE Invalids. Why can't this be solved? it's been like this for well over a couple of Years, long enough to totally pollute the AP Database.
This is the Wrong Result;
   Task      Computer	          Sent	                   Time reported                    Status	        Run time  CPU time   Credit                    Application
6648837963   7769537  17 May 2018, 23:19:16 UTC  18 May 2018, 3:33:53 UTC 	Completed, marked as invalid    1,737.41   222.33      0.00  AstroPulse v7 Anonymous platform (ATI GPU)
6648837964   8119708  17 May 2018, 23:19:16 UTC  18 May 2018, 13:16:51 UTC 	Completed and validated         2,481.45    51.31    531.55  AstroPulse v7 v7.09 (opencl_ati_100) windows_intelx86
6650504186   6845948  18 May 2018, 13:17:04 UTC  20 May 2018, 17:32:12 UTC 	Completed and validated         1,126.88   153.52    531.55  AstroPulse v7 v7.09 (opencl_ati_100) windows_intelx86

This Host has Numerous Invalid APs, https://setiathome.berkeley.edu/results.php?hostid=8119708&state=5
This Host only Validates when the Signal count is Low, or matched with another Windows ATI Host, https://setiathome.berkeley.edu/results.php?hostid=6845948&appid=20
This Only happens with the recent Windows drivers, somewhere after AMD App 1800. The Linux and Mac ATI Hosts don't have this problem.
ID: 1936494 · Report as offensive
Speedy
Volunteer tester
Avatar

Send message
Joined: 26 Jun 04
Posts: 1643
Credit: 12,921,799
RAC: 89
New Zealand
Message 1936502 - Posted: 21 May 2018, 4:55:17 UTC
Last modified: 21 May 2018, 4:57:02 UTC

Not that this is anything server related. I noticed I had a number of tasks with the date of the 19th then I looked & realised that the data was recorded only 2 days ago19my18aa.23825.885.14.41.98_1 Now that's what I call real time processing.

I have also noticed that this morning I had a good number probably 5 BLC tasks that exited after about 22 seconds
ID: 1936502 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14653
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1936511 - Posted: 21 May 2018, 7:46:25 UTC - in response to Message 1936494.  

Back to the REAL Server Problems....

It would appear the 'FIX' for the Windows ATI AstroPulse Fiasco has been Broken.
It's common knowledge the Newer Windows drivers Break the ATI AsrtoPulse App....
Last time I spent a weekend working with Raistmer on a suspected driver problem (intel_gpu app on 6th generation HD 630 iGPU), he tried everything he could think of - driver, Intel OpenCL FFT libraries, the whole shooting match.

It turned out to be a compiler optimisation (fused multiply+add) on his build machine - the compiler was selecting a quicker, but less precise, code path.

Somebody needs to revisit that app with a similarly open mind.
ID: 1936511 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1936707 - Posted: 22 May 2018, 22:09:34 UTC - in response to Message 1936701.  

So, what happened with the outage today?
The site came back long time ago, but the important servers and DB's are still down.


. . Yep, I just got a feeder not running message ...

Stephen

? ?
ID: 1936707 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34858
Credit: 261,360,520
RAC: 489
Australia
Message 1936708 - Posted: 22 May 2018, 22:29:56 UTC
Last modified: 22 May 2018, 22:45:36 UTC

All finished work returned, now just waiting for fresh work to arrive.

[edit] And they now begin to arrive.

Cheers.
ID: 1936708 · Report as offensive
Profile betreger Project Donor
Avatar

Send message
Joined: 29 Jun 99
Posts: 11361
Credit: 29,581,041
RAC: 66
United States
Message 1936716 - Posted: 23 May 2018, 0:44:56 UTC

Downloads are stalled on both boxes.
ID: 1936716 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1936717 - Posted: 23 May 2018, 0:47:01 UTC - in response to Message 1936716.  

Downloads are stalled on both boxes.

I just came here to see if it was just me.

Looks like it isn't.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1936717 · Report as offensive
Profile Wiggo
Avatar

Send message
Joined: 24 Jan 00
Posts: 34858
Credit: 261,360,520
RAC: 489
Australia
Message 1936718 - Posted: 23 May 2018, 0:50:49 UTC

Caches are full, but you just have to get them to download.

Cheers.
ID: 1936718 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1936720 - Posted: 23 May 2018, 1:15:45 UTC - in response to Message 1936718.  

Any special trick used?
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1936720 · Report as offensive
Profile Stargate (SA)
Volunteer tester
Avatar

Send message
Joined: 4 Mar 10
Posts: 1854
Credit: 2,258,721
RAC: 0
Australia
Message 1936723 - Posted: 23 May 2018, 2:15:14 UTC - in response to Message 1936720.  

Any special trick used?


Crack a can of beer per download!!

Hic---Hic---Hic :))
ID: 1936723 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1936724 - Posted: 23 May 2018, 2:16:43 UTC - in response to Message 1936723.  

Any special trick used?


Crack a can of beer per download!!

Hic---Hic---Hic :))

Ha Ha . . . I can get with that program LOL. Gin&Tonics for me.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1936724 · Report as offensive
Profile Stargate (SA)
Volunteer tester
Avatar

Send message
Joined: 4 Mar 10
Posts: 1854
Credit: 2,258,721
RAC: 0
Australia
Message 1936727 - Posted: 23 May 2018, 2:18:38 UTC

;-)P
ID: 1936727 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1936730 - Posted: 23 May 2018, 2:38:16 UTC - in response to Message 1936716.  

Downloads are stalled on both boxes.


. . Yep, I've been having stalled and very slow downloads since the outage too, but it seems to be improving.

Stephen

:(
ID: 1936730 · Report as offensive
Stephen "Heretic" Crowdfunding Project Donor*Special Project $75 donorSpecial Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 20 Sep 12
Posts: 5557
Credit: 192,787,363
RAC: 628
Australia
Message 1936731 - Posted: 23 May 2018, 2:39:36 UTC - in response to Message 1936724.  

Any special trick used?


Crack a can of beer per download!!

Hic---Hic---Hic :))

Ha Ha . . . I can get with that program LOL. Gin&Tonics for me.


. . Make mine a Margherita ....

Stephen

:)
ID: 1936731 · Report as offensive
Profile Stargate (SA)
Volunteer tester
Avatar

Send message
Joined: 4 Mar 10
Posts: 1854
Credit: 2,258,721
RAC: 0
Australia
Message 1936733 - Posted: 23 May 2018, 3:26:52 UTC - in response to Message 1936731.  
Last modified: 23 May 2018, 3:28:20 UTC

Any special trick used?


Crack a can of beer per download!!

Hic---Hic---Hic :))

Ha Ha . . . I can get with that program LOL. Gin&Tonics for me.


. . Make mine a Margherita ....

Stephen

:)


Bugger it, have all 3 types..

;-) End of the day we will think we have found ET..
ID: 1936733 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13746
Credit: 208,696,464
RAC: 304
Australia
Message 1936769 - Posted: 23 May 2018, 9:29:36 UTC
Last modified: 23 May 2018, 9:31:51 UTC

A couple more hours & we'll be out of work.
Since the outage, the splitters just haven't managed to really get going- the best they've been able to do is to meet present demand (which isn't all that high). Most of the time they've been well below the level of demand. The Ready-to-send buffer continues to empty.
Maybe once the backlog of AP & MB WU deletions clears they'll be able to pump out some work (that particular malaise seems to have returned as well).
Grant
Darwin NT
ID: 1936769 · Report as offensive
Bruce
Volunteer tester

Send message
Joined: 15 Mar 02
Posts: 123
Credit: 124,955,234
RAC: 11
United States
Message 1936781 - Posted: 23 May 2018, 12:20:36 UTC

I see that the RTS is almost empty. Splitters never did take off after the outage. I wonder if it has anything to do with the Resend Lost Tasks, the first two tasks that I got were the two Ghosts that I had. Of course they could just need a size twelve applied to the right spot. The guys will get them going when they come in.
Bruce
ID: 1936781 · Report as offensive
Profile Unixchick Project Donor
Avatar

Send message
Joined: 5 Mar 12
Posts: 815
Credit: 2,361,516
RAC: 22
United States
Message 1936812 - Posted: 23 May 2018, 15:08:51 UTC

RTS still going down (around 70K at the moment). I'm surprised it has lasted this long. All the numbers look good except the creation rate. Hopefully someone will soon be in to give the machine some percussive maintenance, as was previously suggested.
ID: 1936812 · Report as offensive
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14653
Credit: 200,643,578
RAC: 874
United Kingdom
Message 1936829 - Posted: 23 May 2018, 17:28:04 UTC

It's showing 54/sec and 101K available now - I'd say dawn has broken in California.
ID: 1936829 · Report as offensive
Profile Keith Myers Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 29 Apr 01
Posts: 13164
Credit: 1,160,866,277
RAC: 1,873
United States
Message 1936858 - Posted: 23 May 2018, 22:15:52 UTC - in response to Message 1936829.  

The splitters are still struggling to keep up with demand and having a hard time building the RTS buffer.
Seti@Home classic workunits:20,676 CPU time:74,226 hours

A proud member of the OFA (Old Farts Association)
ID: 1936858 · Report as offensive
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 33 · Next

Message boards : Number crunching : Panic Mode On (112) Server Problems?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.