News From Rom! 12/10/04

Message boards : Number crunching : News From Rom! 12/10/04
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · Next

AuthorMessage
Profile Captain Avatar
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 15133
Credit: 529,088
RAC: 0
United States
Message 52002 - Posted: 7 Dec 2004, 14:36:48 UTC - in response to Message 51934.  
Last modified: 7 Dec 2004, 16:05:26 UTC

> UPDATE: Here is a new Rouge puter 12/06/2004
>
> Boinc@hellas
> http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=390922

Boinc@hellas
EDIT: http://setiweb.ssl.berkeley.edu/hosts_user.php?userid=7800916
>
> mikey2345 is at it again!
> http://setiweb.ssl.berkeley.edu/hosts_user.php?userid=701952
>
>
Bump
ID: 52002 · Report as offensive
Profile Captain Avatar
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 15133
Credit: 529,088
RAC: 0
United States
Message 52010 - Posted: 7 Dec 2004, 16:00:36 UTC - in response to Message 52002.  
Last modified: 7 Dec 2004, 16:04:25 UTC

Boinc@hellas
http://setiweb.ssl.berkeley.edu/hosts_user.php?userid=7800916

Mikey2345 is still ALIVE!

ID: 52010 · Report as offensive
wrzwaldo
Avatar

Send message
Joined: 16 Jul 00
Posts: 113
Credit: 1,073,284
RAC: 0
United States
Message 52013 - Posted: 7 Dec 2004, 16:23:13 UTC

http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=391793

Has there been an explaination on how this happens?
ID: 52013 · Report as offensive
Profile littleBouncer
Volunteer tester
Avatar

Send message
Joined: 28 May 99
Posts: 151
Credit: 666,283
RAC: 0
Switzerland
Message 52015 - Posted: 7 Dec 2004, 16:31:56 UTC - in response to Message 50937.  
Last modified: 7 Dec 2004, 17:01:43 UTC

> @Timmy
>
>
>
======
Look once here:

user: Drweaser
result:
http://setiweb.ssl.berkeley.edu/workunit.php?wuid=5421777 , look Host: 53673
computers ? merge ?
http://setiweb.ssl.berkeley.edu/hosts_user.php?userid=5800

great over 100 hosts (detach/attach)!!! (ironical)

ID: 52015 · Report as offensive
JAF
Avatar

Send message
Joined: 9 Aug 00
Posts: 289
Credit: 168,721
RAC: 0
United States
Message 52028 - Posted: 7 Dec 2004, 18:00:32 UTC - in response to Message 51913.  

> Sorry to bring this back up
> But this needs looking at
>
> http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=390922
>
> I stopped looking after 15 pages...
> (edit)
> 40 pages with 2 dec Wu's so far WTF.....
>
>
>
I just looked at the results for this host and he or she has at least 31,200 unprocessed work units! The very few listed at the end all have zero CPU time.

Ah, maybe some did something about this problem - I just tried to look again before posting this and got the 'couldn't find computer" message.
<img src='http://www.boincsynergy.com/images/stats/comb-912.jpg'>
ID: 52028 · Report as offensive
Profile THESPEEKER
Avatar

Send message
Joined: 3 Apr 99
Posts: 168
Credit: 48,990
RAC: 0
United Kingdom
Message 52037 - Posted: 7 Dec 2004, 18:56:04 UTC - in response to Message 52028.  

> > Sorry to bring this back up
> > But this needs looking at
> >
> > http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=390922
> >
> > I stopped looking after 15 pages...
> > (edit)
> > 40 pages with 2 dec Wu's so far WTF.....
> >
> >
> >
> I just looked at the results for this host and he or she has at least 31,200
> unprocessed work units! The very few listed at the end all have zero CPU
> time.
>
> Ah, maybe some did something about this problem - I just tried to look again
> before posting this and got the 'couldn't find computer" message.

Thanks Jaf(thumb up) Maybe the Dev's have sorted it
and will Re-Issue all the Wu's ASAP and not wait for
the Deadline....

ID: 52037 · Report as offensive
Profile THESPEEKER
Avatar

Send message
Joined: 3 Apr 99
Posts: 168
Credit: 48,990
RAC: 0
United Kingdom
Message 52043 - Posted: 7 Dec 2004, 19:21:46 UTC - in response to Message 52037.  

> > > Sorry to bring this back up
> > > But this needs looking at
> > >
> > > http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=390922
> > >
> > > I stopped looking after 15 pages...
> > > (edit)
> > > 40 pages with 2 dec Wu's so far WTF.....
> > >
> > >
> > >
> > I just looked at the results for this host and he or she has at least
> 31,200
> > unprocessed work units! The very few listed at the end all have zero CPU
> > time.
> >
> > Ah, maybe some did something about this problem - I just tried to look
> again
> > before posting this and got the 'couldn't find computer" message.
>
> Thanks Jaf(thumb up) Maybe the Dev's have sorted it
> and will Re-Issue all the Wu's ASAP and not wait for
> the Deadline....
>
Now whoever it is is under the Host
http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=391793
but it has risen to 1584 from 1460....
ID: 52043 · Report as offensive
Profile Stephen Balch
Avatar

Send message
Joined: 20 Apr 00
Posts: 141
Credit: 13,912
RAC: 0
United States
Message 52059 - Posted: 7 Dec 2004, 21:20:07 UTC

As of today, 7 Dec 2004, at about 21:00:06 UTC:

A quick look at <B>mikey2345</B>'s computers show a new computer created today (07 Dec 2004), he's back up to four computers. A quick count of mikey2345's results (those still on the server, at least) show (WU's downloaded at various times on the following dates):
5 Nov 2004 36 (Results Past Deadline)
29 Nov 2004 101
30 Nov 2004 9
1 Dec 2004 198
---
Current Total 344

<B>Drweaser</B> only has 90 computers currently listed; one Wintel box, one Linux box, 26 Powerbooks and 62 PowerMacs of various persuasions. The PowerMacs do look a bit suspicious. It looks to me like he only has two Powerbooks, but I'm not even going to estimate the PowerMacs.

http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=391793 is <B>Boinc@hellas</B>. A quick count of Boinc@hellas's results (those still on the server, at least) show (WU's downloaded at various times on the following dates):
29 Nov 2004 9 (Results Past Deadline)
1 Dec 2004 50
2 Dec 2004 620
3 Dec 2004 413 (Approximate)
6 Dec 2004 88
7 Dec 2004 124
----
Current Total 1304

The interesting part for Boinc@hellas' results is that the Host 391793 ( http://setiweb.ssl.berkeley.edu/show_host_detail.php?hostid=391793 ) seems to have been created today; possibly merged computers?

Cheers,

Stephen

<P>"I want to go dancing on the moon, I want to frolic in zero gravity!....", and now, I might be able to go someday! Thanks, SpaceShipOne and crew!<BR><a><img src="http://69.93.59.107/stats/banner.php?cpid=26cbd89db7fb85cbfe580729d76705c1"></a>
ID: 52059 · Report as offensive
Profile Rom Walton (BOINC)
Volunteer tester
Avatar

Send message
Joined: 28 Apr 00
Posts: 579
Credit: 130,733
RAC: 0
United States
Message 52064 - Posted: 7 Dec 2004, 22:09:32 UTC

The issue causing the host to be recreated with every RPC has been identified and a fix has been checked in.

Basically the problem was that BOINC couldn't write to the state file and so everytime BOINC was restarted on that machine, it would recreate a new host.

This mostly happened in schools where the administrator/teacher setup the software and then the students, with less premissioned accounts, started up the software but couldn't modify the state files or store the workunits.

For the short term we are just going to have BOINC shut itself down when it detects this condition, eventually we'll have a smart enough setup to allow the person settng up the software to configure BOINC to run as a service.

Mikey2345 is currently removing the BOINC software from the schools computers until the bug fix is deployed. The bug will be fixed in the next public release of BOINC.

----- Rom
BOINC Development Team, U.C. Berkeley
My Blog
ID: 52064 · Report as offensive
Profile Captain Avatar
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 15133
Credit: 529,088
RAC: 0
United States
Message 52066 - Posted: 7 Dec 2004, 22:18:50 UTC - in response to Message 52064.  

> The issue causing the host to be recreated with every RPC has been identified
> and a fix has been checked in.

Thanks Rom!!!

Timmy
ID: 52066 · Report as offensive
Profile Paul D. Buck
Volunteer tester

Send message
Joined: 19 Jul 00
Posts: 3898
Credit: 1,158,042
RAC: 0
United States
Message 52466 - Posted: 9 Dec 2004, 19:36:19 UTC - in response to Message 51022.  

> Yikes.
>
> A blood hound is a tool for tracking down suspected perpetrators of
> wrong-doing. Multiple SETI crunchers have taken a look at the mikey2345
> account and found it puzzling and alarming. They would like someone inside
> SETI to take a look. They don't seem to be saying, "Let's hang the bloke." But
> a look-see appears in order.
>
> I frankly support the sniffing around by some of our computer-savvy colleagues
> here in cruncherville. Beats the hell out of some of the alternatives.

Robert,

Well said ... for both problems ...


ID: 52466 · Report as offensive
.
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 410
Credit: 16,559
RAC: 0
Message 52558 - Posted: 10 Dec 2004, 4:48:47 UTC - in response to Message 52064.  

> The issue causing the host to be recreated with every RPC has been identified
> and a fix has been checked in.
>
> Basically the problem was that BOINC couldn't write to the state file and so
> everytime BOINC was restarted on that machine, it would recreate a new host.
>
> This mostly happened in schools where the administrator/teacher setup the
> software and then the students, with less premissioned accounts, started up
> the software but couldn't modify the state files or store the workunits.
>
> For the short term we are just going to have BOINC shut itself down when it
> detects this condition, eventually we'll have a smart enough setup to allow
> the person settng up the software to configure BOINC to run as a service.
>
> Mikey2345 is currently removing the BOINC software from the schools computers
> until the bug fix is deployed. The bug will be fixed in the next public
> release of BOINC.
>
>
Hi Rom,

Mikey2345 is not the only one who cause this problem! When I'm looking at all my pending WU's, many of them are waiting for result from socalled New's, and when I look at some of them, it seems, there are a lot of Mikey's out there!

Sincerely Lena
ID: 52558 · Report as offensive
Profile Captain Avatar
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 15133
Credit: 529,088
RAC: 0
United States
Message 52568 - Posted: 10 Dec 2004, 5:37:15 UTC - in response to Message 52558.  
Last modified: 10 Dec 2004, 5:37:36 UTC


Hi Lena,

Rom Stated that there is a bug In the software I wonder if it's system wide?

Timmy


"Mikey2345 is currently removing the BOINC software from the schools computers until the bug fix is deployed. The bug will be fixed in the next public release of BOINC.

----- Rom"

ID: 52568 · Report as offensive
.
Volunteer tester

Send message
Joined: 3 Apr 99
Posts: 410
Credit: 16,559
RAC: 0
Message 52608 - Posted: 10 Dec 2004, 12:00:03 UTC - in response to Message 52568.  

>
> Hi Lena,
>
> Rom Stated that there is a bug In the software I wonder if it's system wide?
>
> Timmy
>
>
> "Mikey2345 is currently removing the BOINC software from the schools computers
> until the bug fix is deployed. The bug will be fixed in the next public
> release of BOINC.
>
> ----- Rom"
>
>
Hi Timmy,

Look at this
http://setiweb.ssl.berkeley.edu/workunit.php?wuid=4842014

This was just one I randomely came to see, as I have other things to do than checking all my pending results. In Rom's message I read that it is only Mikey2345, they have "caught". But, as you can see, there are others. This is not OK :-(

So I have taken the consequences and shut my BOINCclient down for now until this has been fixed! I have other things to spend my money on than electricity bills!

Lena
ID: 52608 · Report as offensive
Profile Captain Avatar
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 15133
Credit: 529,088
RAC: 0
United States
Message 52625 - Posted: 10 Dec 2004, 13:58:16 UTC - in response to Message 50937.  
Last modified: 10 Dec 2004, 22:18:52 UTC

.
ID: 52625 · Report as offensive
Profile Captain Avatar
Volunteer tester
Avatar

Send message
Joined: 17 May 99
Posts: 15133
Credit: 529,088
RAC: 0
United States
Message 52682 - Posted: 10 Dec 2004, 22:19:09 UTC - in response to Message 52625.  
Last modified: 10 Dec 2004, 22:19:58 UTC

From Thread: http://setiweb.ssl.berkeley.edu/sah/forum_thread.php?id=7231


>Right now this really only delays the awarding of credit, unless all the other hosts are suffering from this bug. That would mean the same workunit would have to be sent to four different hosts suffering from the same bug. Numerically speaking, it isn't very likely.

I'm not trying to make light of this bug, but please understand, the dev team would have to roll back the Alpha group from testing the 4.5x branch to take this fix for the public release right now. That in turn could delay the public launch of E@H by a few weeks.

We are a few weeks away from the E@H launch, and 4.5x/4.6x rollout. The 4.5x/4.6x branch has a fix for this bug, plus the graphical UI for Mac, Linux, and Solaris. Plus the graphical science application support for the already mentioned platforms. This will bring all platforms into feature partiy with each other. It's a big release.

On the Windows side, there is a new installer which can be configured for mass deployments, configuring Windows to run BOINC as a service, and a few other nifty features.

Trying to fix this now would be costly for all the projects.

I hope you all understand, we are trying to get this done as quickly as possible.

----- Rom

BOINC Development Team, U.C. Berkeley

ID: 52682 · Report as offensive
Drweaser
Avatar

Send message
Joined: 25 Jun 99
Posts: 4
Credit: 680,584
RAC: 0
Message 52711 - Posted: 11 Dec 2004, 2:21:35 UTC - in response to Message 52015.  


> Look once here:
>
> user: Drweaser
> result:
> http://setiweb.ssl.berkeley.edu/workunit.php?wuid=5421777 , look Host: 53673
> computers ? merge ?
> http://setiweb.ssl.berkeley.edu/hosts_user.php?userid=5800
>
> great over 100 hosts (detach/attach)!!! (ironical)
>
>

Actually I have approximately 75 machines that should be live (but in reality, only about 55 are active enough during the day - this is a high school). However, thanks for reminding me I needed to clean up the host list! If you had looked at my production and RAC you would know all this.....
<img src="http://www.boincstats.com/stats/banner.php?cpid=d793caacf9cdd73db86021887331a038"></img><br>
<img src="http://www4.macnn.com/team/sigs/TeamSigPredict1S.gif"></img>
ID: 52711 · Report as offensive
Profile Stephen Balch
Avatar

Send message
Joined: 20 Apr 00
Posts: 141
Credit: 13,912
RAC: 0
United States
Message 52786 - Posted: 11 Dec 2004, 8:36:55 UTC - in response to Message 52711.  

@Drweaser,

Thanks for responding to the thread. I am not blaming anyone, just trying to point out a problem with some users/computers.

It is of concern that some users download many WU's, but never send the results back. This delays the arrival of completed science results, and credit for results processed by other users, well past the expected deadline. Unfortunatly, this seems to be a common problem. Mostly, it seems to be the result of people resetting their projects as a "cure-all" solution for any problem they experience, and that they do so without regard to the problems it creats for others. Yet, there are other reasons for some problems.

We just recently pointed out a problem which, Rom has told us, was caused by file-access permissions on some machines, notably on groups of machines with OS's that use file permissions that are set differently for the installer and the user. In those situations, possibly similar to yours, the installer had permission to create/write a file, but the logged-in user did not have permission to write to the file and multiple instance of the machine were created in BOINC. In that case, another school, the user (Mikey2345) is kindly removing the software from all those problem machines until a newer version that will properly handle the problem can be deployed by the BOINC evelopment team.

>
> Actually I have approximately 75 machines that should be live (but in reality,
> only about 55 are active enough during the day - this is a high school).
> However, thanks for reminding me I needed to clean up the host list! If you
> had looked at my production and RAC you would know all this.....
>

I don't want to prevent anyone from running BOINC and as many of the projects as they can, on as many machines as they can. Preventing people from running the projects is not, and never has been, my intent. Pointing out possible problems to the BOINC and project development teams is what I am trying to do. BOINC and SETI have been running pretty well for a while, but there are still some small, annoying problems with the projects.

@Mikey2345,

I haven't had the opportuinty yet to thank you for taking the time, effort and steps necessary to prevent further problems with your machines. Please, as soom as the version that corrects the file permissions problem comes out, install it and start running the projects again. I want to see you back cranking out those results. (GRIN)

Cheers,

Stephen
<P>"I want to go dancing on the moon, I want to frolic in zero gravity!....", and now, I might be able to go someday! Thanks, SpaceShipOne and crew!<BR><a><img src="http://69.93.59.107/stats/banner.php?cpid=26cbd89db7fb85cbfe580729d76705c1"></a>
ID: 52786 · Report as offensive
HachPi
Avatar

Send message
Joined: 2 Aug 99
Posts: 481
Credit: 21,807,425
RAC: 21
Belgium
Message 52795 - Posted: 11 Dec 2004, 9:03:51 UTC - in response to Message 52786.  
Last modified: 11 Dec 2004, 9:06:14 UTC

@ Stephen Balch

Well said...

@ Drweaser

The only thing people here are worried about is to keep Seti going in a smooth way and to iron out as many bugs as possible. Don't feel offended for this one. Many in this community are using this board for the improvement and support of all of us.
If a bug at one/more/all of my computers would happen to be causing severe delay or malfunctioning in delivering the credits towards other users I would find it perfectly normal that they would draw my attention towards it and I surely would not be angry about it but rather say thanks...

Greetings from Belgium ;-))

ID: 52795 · Report as offensive
Grant (SSSF)
Volunteer tester

Send message
Joined: 19 Aug 99
Posts: 13840
Credit: 208,696,464
RAC: 304
Australia
Message 52806 - Posted: 11 Dec 2004, 10:27:18 UTC - in response to Message 52608.  



> So I have taken the consequences and shut my BOINCclient down for now until
> this has been fixed! I have other things to spend my money on than electricity
> bills!

?
Your reason for doing so is????

You will still get credits for the units you do, even if they are on phantom machines. It will just take longer to get the credits because you'll have to wait for the return by deadline to pass before they are re-issued & processed.
So I don't see what your problem is.
*shrug*
Grant
Darwin NT
ID: 52806 · Report as offensive
Previous · 1 · 2 · 3 · 4 · Next

Message boards : Number crunching : News From Rom! 12/10/04


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.