SETI orphans @World Community Grid

Message boards : Number crunching : SETI orphans @World Community Grid
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

AuthorMessage
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 20147
Credit: 7,508,002
RAC: 20
United Kingdom
Message 2099593 - Posted: 15 May 2022, 12:02:28 UTC - in response to Message 2099473.  

It's as if IBM just threw the "ball" over to Krembil, without giving them much support at all, and probably not much of the documentation IBM should have created over the years. There's probably hundreds of undocumented "features" that will come up and bite Krembil when/if they manage to get WCG going again.

Yup - that would be IBM's approach to things. Loads of undocumented "features", inadequate change tracking, but at the same time claiming "everything you need is there"....

Those are the cloud in the sky claims of too many IT clouds...

Sounds like rather than a simple turnkey migration, the group has had to debug and fix the cloud system itself...

Good luck!

Keep searchin',
Martin
See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 2099593 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2099908 - Posted: 21 May 2022, 1:47:22 UTC - in response to Message 2099899.  
Last modified: 21 May 2022, 1:47:59 UTC

Do we have faith in that the 24th, will be the day WCG restarts?

No. That's like asking if the Pope is Catholic!
ID: 2099908 · Report as offensive     Reply Quote
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 2099984 - Posted: 22 May 2022, 22:05:54 UTC

Twitt

QA testing has finally successfully finished and all the bugs have been resolved. Production environment is being tested right now.

Considering the unexpected issues we ran into, we prefer to test it a couple of days more.

We will provide further details in the next few days.

Few days...
ID: 2099984 · Report as offensive     Reply Quote
JohnDK Crowdfunding Project Donor*Special Project $250 donor
Volunteer tester
Avatar

Send message
Joined: 28 May 00
Posts: 1222
Credit: 451,243,443
RAC: 1,127
Denmark
Message 2100326 - Posted: 28 May 2022, 20:25:21 UTC

Well...

We were unable to bring our production environment to the same state as the QA environment this week. As we also have yet to resolve an issue that prevents BOINC clients from downloading workunits, the effort to bring the Grid back online has stretched into next week.
Seems a basic thing.
ID: 2100326 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2100341 - Posted: 28 May 2022, 22:12:03 UTC - in response to Message 2100326.  

I wonder if they understand the difference between the meta-descriptions (workunits, tasks) and the required files? The client doesn't "download" a workunit: that's something that we probably understand better than they do. But without a functioning message board, there's no way to help them out.
ID: 2100341 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2100360 - Posted: 29 May 2022, 2:50:48 UTC - in response to Message 2100343.  
Last modified: 29 May 2022, 2:51:32 UTC

Well, I'd say that the sentence could well be shortened to "I wonder if they understand"

Anyhow, for those with deep knowledge about the inner workings of BOINC, getting in touch with Dr. Igor Jurisica, who is
responsible for the Krembil takeover of WCG, could be pretty easy. Perhaps offer som help, something that IBM obviously
haven't done. I'd say they are stuck, and it's a lot they do not understand.

Contact page for Jurisica: https://www.cs.toronto.edu/~juris/jlab/contact.html

Edit: Or at least tell them to start the forum, so they can get help in that way.


Krembel should contact the Official Google boinc_dev mailing list/forum (if they already have not done so).
It's For people developing, debugging or porting the BOINC software (client, server, and Web).
ID: 2100360 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2100365 - Posted: 29 May 2022, 6:19:06 UTC - in response to Message 2100343.  

for those with deep knowledge about the inner workings of BOINC,
If you're looking at me, I'll have to sit this one out. I'm a long way from home (touring Portugal), and will spend three of the next five days on a succession of trains to get home - up to 12 hours a day. And when I get home, I'm going to sleep....

WCG has always been a bit of an outlier in the BOINC world, so the raw BOINC documentation won't help much (and it hasn't been very well maintained for recent changes in the last few years) - this will have to be approached at a conceptual level, rather than RTFM. I've been invited to a Google meet for BOINC developers about an hour after my last train lands, so I might drop in on that and plant the idea.

If WCG are still stuck, and the trains run on time, that is...
ID: 2100365 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2100387 - Posted: 29 May 2022, 14:38:58 UTC - in response to Message 2100384.  
Last modified: 29 May 2022, 14:39:36 UTC

Thanks. If I was in a position to chat with them reasonably quickly - which I might be on Wednesday and Thursday, when I'm spending long continuous periods of time on well equipped trains with power and wifi - I think I'd suggest that one of the researchers took a machine home (or borrowed their wife's/husband's): tried to join WCG or their QA project; set the http_debug event log flag; and took the resulting log back to the lab the next day for analysis. It's more likely a connection error, not a transfer error per se.

I've sent a copy of the lack-of-progress note to the convenor of the BOINC meeting, and suggested that it might be helpful for someone experienced to act as a "conceptual mentor" - someone to help them find the right chapter and page in the manual, before they start reading the nuts and bolts.
ID: 2100387 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2100457 - Posted: 30 May 2022, 19:08:50 UTC

This is just a wild a$$ guess but... Maybe this is why IBM abandoned WCG. They had more important things to do:

IBM-powered Mayflower robo-ship once again tries to cross Atlantic
ID: 2100457 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2100518 - Posted: 1 Jun 2022, 1:30:40 UTC

Well, it's Tuesday May 31 and there's still a few hours left in the calendar day. In North America.

Krembel still hasn't posted any news on Twitter, Facebook or project website about the fixing of WCG.

Seriously makes me wonder how well they are going to actively handle a living, breathing project of this magnitude?
ID: 2100518 · Report as offensive     Reply Quote
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 20147
Credit: 7,508,002
RAC: 20
United Kingdom
Message 2100529 - Posted: 1 Jun 2022, 9:28:53 UTC

Working through the undocumented foibles of a new cloud migration can be an 'adventurous' project in itself.

(Don't believe the cloudy Marketing hype...)


Keep searchin',
Martin
See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 2100529 · Report as offensive     Reply Quote
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 20147
Credit: 7,508,002
RAC: 20
United Kingdom
Message 2100604 - Posted: 2 Jun 2022, 12:30:07 UTC - in response to Message 2100529.  
Last modified: 2 Jun 2022, 12:37:55 UTC

Ouch!...


Just had two of my physical servers physically yanked over to a different datacentre elsewhere in London...

No idea if the entire rack was forklifted or if the individual servers were carried individually...

And the aftermath is...

By chance, one of the servers was already shut down for an upgrade so all is benign for that one.

However, the second server was still live and picked up the signals for a graceful shutdown... Except when powered up again, the startup failed safe to not expose the services... There was an uncomfortable long pause to activate the new network routing and then a few hours to discover the one missing feature after the move... Now all running as before with less than a millisecond difference in the connections.

All part of the fun of relying on someone else's infrastructure...


To be fair, that has been the first time in over two years. But then... First times always involve lots of coffee...

Enjoy!
Martin

(Yes... I'd missed the maintenance email warning about the move... Immediately afterwards noticed and realization dawned that one moment too late...)
See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 2100604 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2100665 - Posted: 3 Jun 2022, 17:09:14 UTC

What is another missed deadline since it's now over 3 months since the "migration" started?

Half the day is gone in North America. It looks like Krembil will go without another update today and this week.
ID: 2100665 · Report as offensive     Reply Quote
BetelgeuseFive Project Donor
Volunteer tester

Send message
Joined: 6 Jul 99
Posts: 158
Credit: 17,117,787
RAC: 19
Netherlands
Message 2100704 - Posted: 4 Jun 2022, 8:02:55 UTC - in response to Message 2100703.  

New update on the WCG Website., and it sure doesn't seem as if they are even close to restart the project.
Nothing on Facebook, or Twitter yet though.

WCG June 3rd Update



I'm probably an old cinic, but the way I read it is:

We are having technical problems so we have decided to expand our marketing department.

They probably have to spin a lot of bad news in the upcoming period.
Poor intern ...
ID: 2100704 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2100761 - Posted: 5 Jun 2022, 5:16:42 UTC - in response to Message 2100704.  

... They probably have to spin a lot of bad news in the upcoming period.
Poor intern ...

So, this is delayed start #7. Will lady-luck be with them or will it just be another crap shot?

My bets and odds fall on the latter, rather than the first one.
ID: 2100761 · Report as offensive     Reply Quote
Sirius B Project Donor
Volunteer tester
Avatar

Send message
Joined: 26 Dec 00
Posts: 24877
Credit: 3,081,182
RAC: 7
Ireland
Message 2100832 - Posted: 6 Jun 2022, 14:19:09 UTC - in response to Message 2100820.  

Yep. 9th Nov only to crash a week later on the 18th Birthday challenge.
ID: 2100832 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2101001 - Posted: 8 Jun 2022, 18:00:22 UTC

ID: 2101001 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2101211 - Posted: 12 Jun 2022, 16:11:22 UTC
Last modified: 12 Jun 2022, 16:12:41 UTC

Well it's the start of a new week, will the gerbils at Krembil actually restart WCG even partially?

My bets are on more technical issues being found and little information being presented to us about what broke this time.

I suspect IBM dropped gave away the White Elephant project probably because it wasn't making any money.

Krembil took on the project and is now deep in the muck trying to fix what sounds like proprietary software without free IBM support.
ID: 2101211 · Report as offensive     Reply Quote
Richard Haselgrove Project Donor
Volunteer tester

Send message
Joined: 4 Jul 99
Posts: 14649
Credit: 200,643,578
RAC: 874
United Kingdom
Message 2101213 - Posted: 12 Jun 2022, 17:05:14 UTC - in response to Message 2101211.  

I suspect IBM dropped gave away the White Elephant project probably because it wasn't making any money.
Probably true, but that was by design - it was under IBM's philanthropic wing.
ID: 2101213 · Report as offensive     Reply Quote
Dr Who Fan
Volunteer tester
Avatar

Send message
Joined: 8 Jan 01
Posts: 3193
Credit: 715,342
RAC: 4
United States
Message 2101327 - Posted: 14 Jun 2022, 21:43:42 UTC - in response to Message 2101302.  

Still not a sign of life from WCG, or any update since June 3rd.

The sound cricket chirping is deafening...
As of today, 2022-06-14 @ 1735 EDT (2235 UTC) no updates since 22-06-03. The gerbils must have fallen off the wheel and can't get back up
ID: 2101327 · Report as offensive     Reply Quote
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next

Message boards : Number crunching : SETI orphans @World Community Grid


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.