machine ditched all wu's

Message boards : Number crunching : machine ditched all wu's
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1728707 - Posted: 25 Sep 2015, 3:18:58 UTC
Last modified: 25 Sep 2015, 3:21:36 UTC

one of my machines just ditched all of it's wu's for no reason

nobody touched the machine

anyone a clue wth happened? 300 wu's trashed

EDIT:
Just checked the machine details no sign of 300 abandoned yet when i check all they are there....am slightly confused
I came down with a bad case of i don't give a crap
ID: 1728707 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1728733 - Posted: 25 Sep 2015, 6:20:20 UTC - in response to Message 1728707.  

one of my machines just ditched all of it's wu's for no reason

nobody touched the machine

anyone a clue wth happened? 300 wu's trashed

EDIT:
Just checked the machine details no sign of 300 abandoned yet when i check all they are there....am slightly confused

First, tell us which of your 20+ computers you suspect has issues. Based on your Computer pages on the Seti@Home website, I don't see any of your machines having a large number of errors or invalids.

Second, what did you see, and where (BOINC Manager or website page) that led you to believe so many Tasks had been trashed?

The more specific info you can give when asking a question like that, the easier it is to suggest a probable cause or solution.
Donald
Infernal Optimist / Submariner, retired
ID: 1728733 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1728778 - Posted: 25 Sep 2015, 11:25:23 UTC

http://setiathome.berkeley.edu/workunit.php?wuid=1908960649

This is one of the 299 abandoned wu's that got ditched yesterday on this machine

http://setiathome.berkeley.edu/show_host_detail.php?hostid=7772586

looking at the machine itself i only see 31 cpu tasks that got removed since the machine only runs gpu tasks
I came down with a bad case of i don't give a crap
ID: 1728778 · Report as offensive
Profile Donald L. Johnson
Avatar

Send message
Joined: 5 Aug 02
Posts: 8240
Credit: 14,654,533
RAC: 20
United States
Message 1728810 - Posted: 25 Sep 2015, 14:52:38 UTC - in response to Message 1728778.  

http://setiathome.berkeley.edu/workunit.php?wuid=1908960649

This is one of the 299 abandoned wu's that got ditched yesterday on this machine

http://setiathome.berkeley.edu/show_host_detail.php?hostid=7772586

looking at the machine itself i only see 31 cpu tasks that got removed since the machine only runs gpu tasks

So something happened on 24 Sep 2014 01:14:24 UTC on that machine. Looks like Most of the Tasks have been resent, completed and validated, and are clearing off the machine's Tasks list. Does the BOINC Manager Event Log or the machine's Event Log show any significant event at that time?
Donald
Infernal Optimist / Submariner, retired
ID: 1728810 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1728923 - Posted: 25 Sep 2015, 22:17:58 UTC - in response to Message 1728810.  

i will look at that tomorrow when i get a chance to go to the server room the machine is in there so nobody but me could possibly have touched it since i'm the only one who has a key to it and i wasn't there
I came down with a bad case of i don't give a crap
ID: 1728923 · Report as offensive
Cosmic_Ocean
Avatar

Send message
Joined: 23 Dec 00
Posts: 3027
Credit: 13,516,867
RAC: 13
United States
Message 1728939 - Posted: 25 Sep 2015, 23:43:48 UTC

It's not really a question of "what happened on/to that machine around that time," as much as "what do the logs say around that time?"

A while back, I helped to nudge troubleshooting the abandonment issue into the direction of an out-of-sequence scheduler request. It could still be that issue. That's why the logs are useful.
Linux laptop:
record uptime: 1511d 20h 19m (ended due to the power brick giving-up)
ID: 1728939 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1728945 - Posted: 26 Sep 2015, 0:23:14 UTC - in response to Message 1728939.  

It's not really a question of "what happened on/to that machine around that time," as much as "what do the logs say around that time?"

A while back, I helped to nudge troubleshooting the abandonment issue into the direction of an out-of-sequence scheduler request. It could still be that issue. That's why the logs are useful.


I get em tomorrow
I came down with a bad case of i don't give a crap
ID: 1728945 · Report as offensive
Profile James Sotherden
Avatar

Send message
Joined: 16 May 99
Posts: 10436
Credit: 110,373,059
RAC: 54
United States
Message 1729083 - Posted: 26 Sep 2015, 6:44:12 UTC

My vista machine had a load of WU abanded on or about the same time peiod. Some were Aps:( It happens and it sucks.
[/quote]

Old James
ID: 1729083 · Report as offensive
Profile Zombu2
Volunteer tester

Send message
Joined: 24 Feb 01
Posts: 1615
Credit: 49,315,423
RAC: 0
United States
Message 1729084 - Posted: 26 Sep 2015, 6:45:19 UTC

yeah that was a bummer and i probably got penalized for it too haha
I came down with a bad case of i don't give a crap
ID: 1729084 · Report as offensive

Message boards : Number crunching : machine ditched all wu's


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.