An Actual Computer Bug (Apr 17 2007)

Message boards : Technical News : An Actual Computer Bug (Apr 17 2007)
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Profile Matt Lebofsky
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 1 Mar 99
Posts: 1444
Credit: 957,058
RAC: 0
United States
Message 547966 - Posted: 17 Apr 2007, 22:18:55 UTC

The BOINC web server (isaac) had its root partition fill up this morning. No big deal but the site was down for a bit as Eric cleaned that up.

During the outage we cleaned up the remaining master/replica database discrepancies and finally put sidious on UPS. Yup - it was running without a net for the past however many weeks. Well, not a direct net - we always had a replica database that was on UPS, as well as recent backup dumps. The "reorg" part took much longer than last week - perhaps due to the result/workunit tables being exercised by the new quorum settings.

While sidious was powered down I replaced the keyboard (it was using a flaky USB keyboard salvaged from a first-generation iMac) and removed the case to inspect its RAM (so we have exact specs in the event of upgrade). I popped open one of the memory banks and found that, at some point, a spider had taken up residence inside. Not really a wise choice on its part. The webs and carcass of the long deceased critter were removed before putting the memory back.

Once again, db_dump is running at the time of writing, seemingly successfully. There were some mysql configuration settings we were experimenting with last week. Though not obvious why, one of these may have been forcing the long db_dump queries to time out. Anyway, we shall see... it just wrapped up the user table sans hitch.

- Matt

-- BOINC/SETI@home network/web/science/development person
-- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude
ID: 547966 · Report as offensive
Profile littlegreenmanfrommars
Volunteer tester
Avatar

Send message
Joined: 28 Jan 06
Posts: 1410
Credit: 934,158
RAC: 0
Australia
Message 548012 - Posted: 17 Apr 2007, 23:34:31 UTC

Sounds good Matt!

Fingers crossed here
ID: 548012 · Report as offensive
KB7RZF
Volunteer tester
Avatar

Send message
Joined: 15 Aug 99
Posts: 9549
Credit: 3,308,926
RAC: 2
United States
Message 548036 - Posted: 17 Apr 2007, 23:48:50 UTC

Thank you again Matt, glad things went a bit smooth for you guys. Ill be keeping my fingers crossed too.
ID: 548036 · Report as offensive
Profile Graeme of Boinc UK

Send message
Joined: 25 Nov 02
Posts: 114
Credit: 1,250,273
RAC: 0
United Kingdom
Message 548048 - Posted: 17 Apr 2007, 23:58:13 UTC

Well done.

I and many others are looking forward to the statistics coming back.

Graeme.


ID: 548048 · Report as offensive
Profile speedimic
Volunteer tester
Avatar

Send message
Joined: 28 Sep 02
Posts: 362
Credit: 16,590,653
RAC: 0
Germany
Message 548051 - Posted: 18 Apr 2007, 0:00:09 UTC

[qoute]While sidious was powered down I replaced the keyboard (it was using a flaky USB keyboard salvaged from a first-generation iMac) and removed the case to inspect its RAM (so we have exact specs in the event of upgrade).[/quote]

talking about specs - can we have the specs for the missing cpu?

mic.
mic.


ID: 548051 · Report as offensive
Profile ML1
Volunteer moderator
Volunteer tester

Send message
Joined: 25 Nov 01
Posts: 20318
Credit: 7,508,002
RAC: 20
United Kingdom
Message 548145 - Posted: 18 Apr 2007, 1:40:18 UTC - in response to Message 548051.  
Last modified: 18 Apr 2007, 1:40:40 UTC

While sidious was powered down...

talking about specs - can we have the specs for the missing cpu?

Is it actually known to be a CPU problem, or is it a motherboard problem?...

Regards,
Martin

See new freedom: Mageia Linux
Take a look for yourself: Linux Format
The Future is what We all make IT (GPLv3)
ID: 548145 · Report as offensive
Nicolas
Avatar

Send message
Joined: 30 Mar 05
Posts: 161
Credit: 12,985
RAC: 0
Argentina
Message 548189 - Posted: 18 Apr 2007, 2:39:23 UTC

A spider on a server? Not good >_<

Contribute to the Wiki!
ID: 548189 · Report as offensive
john galbraith

Send message
Joined: 9 Aug 02
Posts: 3
Credit: 49,710
RAC: 0
United States
Message 548244 - Posted: 18 Apr 2007, 4:06:37 UTC

I've been trying to upload results for the last 3 hours and get the message that the project servers may be down. Maybe that bug wasn't dead after all.
ID: 548244 · Report as offensive
Profile Bluegrass

Send message
Joined: 20 May 99
Posts: 11
Credit: 218,479
RAC: 0
United States
Message 548281 - Posted: 18 Apr 2007, 5:22:59 UTC - in response to Message 548244.  

I've been trying to upload results for the last 3 hours and get the message that the project servers may be down. Maybe that bug wasn't dead after all.


I hava also been trying to upload for 18 hours and no connection. I get the following
Access to reference site succeeded. Project servers may be down.

I checked and servers are not down.
What gives????
ID: 548281 · Report as offensive
sideband@seti.usa
Avatar

Send message
Joined: 19 Jun 99
Posts: 25
Credit: 2,774,864
RAC: 0
United States
Message 548288 - Posted: 18 Apr 2007, 5:45:43 UTC - in response to Message 548281.  

I've been trying to upload results for the last 3 hours and get the message that the project servers may be down. Maybe that bug wasn't dead after all.


I hava also been trying to upload for 18 hours and no connection. I get the following
Access to reference site succeeded. Project servers may be down.

I checked and servers are not down.
What gives????


It's all part of the usual friendly, neighborhood weekly Berking.. Of course, someone will realize that something is wrong in due time, and then plug the cable back in, or start the service that didn't start, or whatever, and things will be back to normal...ish.

According to the site status page, all the assimilators are down.. so that could be a clue.
73 de AI8W, Chris

Abdico Concussio Fidens Servo Libertas Semper!

ID: 548288 · Report as offensive
Ernst-Friedrich Henrich

Send message
Joined: 11 Oct 06
Posts: 2
Credit: 6,853
RAC: 0
Germany
Message 548347 - Posted: 18 Apr 2007, 7:42:06 UTC - in response to Message 548288.  

Nothing will be back to normal. Still the other way round.

ID: 548347 · Report as offensive
Profile Foldgate

Send message
Joined: 29 Sep 01
Posts: 3
Credit: 1,240,167
RAC: 0
Netherlands
Message 548371 - Posted: 18 Apr 2007, 9:13:19 UTC

There is something weird with the upload and download. Some WU just do it as they are supposed to do it, they download flawless.
I also have an upload stuck on 2.05% for hours now, and a download that has not even begun and has been stuck for hours.
I have tried to resend them manualy several times, but to no avail.
ID: 548371 · Report as offensive
Daniel S. Gandolfo

Send message
Joined: 17 Nov 02
Posts: 56
Credit: 2,430,947
RAC: 0
United Kingdom
Message 548392 - Posted: 18 Apr 2007, 9:35:42 UTC
Last modified: 18 Apr 2007, 9:36:26 UTC

Hello,

I am having the same problem. I cannot upload my completed WU's. I now have three (and counting) stacked WU's for uploading. They download OK but uploading is the problem.
ID: 548392 · Report as offensive
Profile John Clark
Volunteer tester
Avatar

Send message
Joined: 29 Sep 99
Posts: 16515
Credit: 4,418,829
RAC: 0
United Kingdom
Message 548399 - Posted: 18 Apr 2007, 9:49:28 UTC

Check with Number Crunching. Two similar threads, including Panic on #4
It's good to be back amongst friends and colleagues



ID: 548399 · Report as offensive
Ernst-Friedrich Henrich

Send message
Joined: 11 Oct 06
Posts: 2
Credit: 6,853
RAC: 0
Germany
Message 548414 - Posted: 18 Apr 2007, 10:12:37 UTC - in response to Message 548347.  

Nothing will be back to normal. Still the other way round.

Now the work stops at this point. can´t download anything, can`t upload either.
And the permanently trying program harms my nettraffic.
ID: 548414 · Report as offensive
Profile Dr. C.E.T.I.
Avatar

Send message
Joined: 29 Feb 00
Posts: 16019
Credit: 794,685
RAC: 0
United States
Message 548426 - Posted: 18 Apr 2007, 10:35:57 UTC


Thanks for thE updatE Matt . . . darnEd spidErz eh . . .

06:36 AM EST(US)

fix_missing_results kryten Disabled
sah_assimilator1 kryten Disabled
sah_assimilator2 kryten Disabled
sah_assimilator3 kryten Disabled
sah_assimilator4 kryten Disabled
ID: 548426 · Report as offensive
Profile Luka Prijic

Send message
Joined: 12 Sep 04
Posts: 7
Credit: 938,335
RAC: 0
Serbia
Message 548465 - Posted: 18 Apr 2007, 12:11:47 UTC

kryten is disabled, probably maintenance, and the stuff disabled it, so probably they now what they are doing (cleaning spider`s nest`s :) )
ID: 548465 · Report as offensive
Wander Saito
Volunteer tester

Send message
Joined: 7 Jul 03
Posts: 555
Credit: 2,136,061
RAC: 0
Brazil
Message 548478 - Posted: 18 Apr 2007, 12:39:03 UTC

With all these UL/DL problems, I'm wondering: maybe the spider corpse or its web were connecting some vital circuit in the motherboard or the memory chip. :)

Kidding aside, I once saw a TV news piece showing an entire ant colony took a computer as its new home, and the machine still worked!

Regards,
Wander

ID: 548478 · Report as offensive
Arthur L. Smith

Send message
Joined: 17 Apr 02
Posts: 28
Credit: 244,050,922
RAC: 9
United States
Message 548480 - Posted: 18 Apr 2007, 12:45:06 UTC

We are having upload problems with all of our machines this morning also!!!
ID: 548480 · Report as offensive
Rob.B

Send message
Joined: 23 Jul 99
Posts: 157
Credit: 1,439,682
RAC: 0
United Kingdom
Message 548494 - Posted: 18 Apr 2007, 13:31:08 UTC

My machines are stalled also both wintel and linux, with a spread of client versions. I'm running on cashed units and that ain't massive. Look out Einstien here I come!
ID: 548494 · Report as offensive
1 · 2 · Next

Message boards : Technical News : An Actual Computer Bug (Apr 17 2007)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.