Getting rid of "stale" work units?

Message boards : Number crunching : Getting rid of "stale" work units?
Message board moderation

To post messages, you must log in.

AuthorMessage
Kurt Schmucker

Send message
Joined: 11 Jan 00
Posts: 72
Credit: 130,823,400
RAC: 207
United States
Message 36935 - Posted: 15 Oct 2004, 16:57:28 UTC

I had to turn off BOINC SETI on one of my servers for a few days, so now several of the WUs on that machine are past their report deadline. Running these WUs is a waste of time, right? So, how do I get rid of them? I tried updating the project, installing the latest BOINC, etc. but these stale units are still there.
Any suggestions?
ID: 36935 · Report as offensive
Pascal, K G
Volunteer tester
Avatar

Send message
Joined: 3 Apr 99
Posts: 2343
Credit: 150,491
RAC: 0
United States
Message 36936 - Posted: 15 Oct 2004, 16:58:31 UTC

reset
Semper Eadem
So long Paul, it has been a hell of a ride.

Park your ego's, fire up the computers, Science YES, Credits No.
ID: 36936 · Report as offensive
Profile THESPEEKER
Avatar

Send message
Joined: 3 Apr 99
Posts: 168
Credit: 48,990
RAC: 0
United Kingdom
Message 36941 - Posted: 15 Oct 2004, 17:21:26 UTC

As Pascal has said in his message of no meaning.

RESET should work.

but still has my approval...


It is 10 oclock, do you know what your WUWUs are doing tonight...

No but i know what i am... and the missus does too...

And it aint crunching...

ID: 36941 · Report as offensive
Kurt Schmucker

Send message
Joined: 11 Jan 00
Posts: 72
Credit: 130,823,400
RAC: 207
United States
Message 36955 - Posted: 15 Oct 2004, 18:37:34 UTC

Reset worked just fine. Thanks for the speedy replies.
ID: 36955 · Report as offensive
Profile Paul D. Buck
Volunteer tester

Send message
Joined: 19 Jul 00
Posts: 3898
Credit: 1,158,042
RAC: 0
United States
Message 36957 - Posted: 15 Oct 2004, 18:42:45 UTC - in response to Message 36955.  

> Reset worked just fine. Thanks for the speedy replies.

So, what about our SLOW replies?

:)


ID: 36957 · Report as offensive
Ned Slider

Send message
Joined: 12 Oct 01
Posts: 668
Credit: 4,375,315
RAC: 0
United Kingdom
Message 37012 - Posted: 15 Oct 2004, 23:06:09 UTC

There's got to be a better way than simply resetting - which as far as I can tell simply deletes the whole project directory and downloads a fresh seti client and new workunits. By doing this, not only are we dumping the expired units, but all the other units too which then just further delays the validation system as these must wait to expire and then be reassigned.

Has anyone simply tried manually deleting the units that have/will expire before you can process them and leaving the rest. Just wonder if this would work?

Ned


*** My Guide to Compiling Optimised BOINC and SETI Clients ***
*** Download Optimised BOINC and SETI Clients for Linux Here ***
ID: 37012 · Report as offensive
Ken Phillips m0mcw
Volunteer tester
Avatar

Send message
Joined: 2 Feb 00
Posts: 267
Credit: 415,678
RAC: 0
United Kingdom
Message 37024 - Posted: 16 Oct 2004, 0:13:55 UTC - in response to Message 37012.  
Last modified: 16 Oct 2004, 0:14:26 UTC

> There's got to be a better way than simply resetting - which as far as I can
> tell simply deletes the whole project directory and downloads a fresh seti
> client and new workunits. By doing this, not only are we dumping the expired
> units, but all the other units too which then just further delays the
> validation system as these must wait to expire and then be reassigned.
>
> Has anyone simply tried manually deleting the units that have/will expire
> before you can process them and leaving the rest. Just wonder if this would
> work?
>
> Ned
>
Ned,

I've tried the abort feature of boincview, and it works fine, you select the offending WU, pick abort, the WU almost instantly goes to 100%, then 'technically fails' while leaving everything else absolutely intact. I used this to ditch the infamous 14mar04aa WU's that were giving grief before boinc 4.12 got released.

TTFN

Ken Phillips

BOINC question? Look here



"The beginning is the most important part of the work." - Plato
ID: 37024 · Report as offensive
Bart Barenbrug

Send message
Joined: 7 Jul 04
Posts: 52
Credit: 337,401
RAC: 0
Netherlands
Message 37642 - Posted: 17 Oct 2004, 17:41:16 UTC - in response to Message 37012.  
Last modified: 17 Oct 2004, 17:43:04 UTC

Ned asked:

> Has anyone simply tried manually deleting the units that have/will expire
> before you can process them and leaving the rest. Just wonder if this would
> work?

Not quite, but after installing a newer boinc version to a new directory instead of the one where my old boinc client was located, I did notice that I lost my cache worth of WUs due to that. So the new client started by downloading a whole new batch of WUs. Wasting the WUs in the old cache seemed a bit of a shame (leaving others waiting for credit a long time), and since they were for the same version of the seti client, it seemed to me that I should be able to merge the two caches. This was accomplished by moving the contents of old project's directory to the new one, along with the `slots' directory with the work units that were being processed by the old boinc client (renaming the subdirectories to higher slot numbers not used by the new clients and adjusting the entries in the old client_state.xml accordingly), and subsequently merging the client_state.xml files by hand. A lot of work (I'll be sure to install future boinc clients to the same directory as the old one), with the potential that making a mistake would trash the newly cached WUs as well (so I made a copy of the whole new boinc directory before beginning), so only do this if your really know what you're doing.

In short: it does seem possible to edit the client_state.xml file (when boinc is not running of course) and get away with it, so I guess deleting WUs from there is an option too (though be sure to keep the file_info, workunit and results in correspondence: delete all of them for the same work unit or none). Maybe just deleting the WU file from the project subdirectory will also do the trick, as the seti application will have no choice other than reporting an error when it gets told to start that WU (since it won't have a WU file to load anymore), so boinc can move on quickly. This is probably what boincview is doing.
ID: 37642 · Report as offensive
Pepo
Volunteer tester
Avatar

Send message
Joined: 5 Aug 99
Posts: 308
Credit: 418,019
RAC: 0
Slovakia
Message 38192 - Posted: 19 Oct 2004, 11:09:18 UTC - in response to Message 37642.  

> Ned asked:
>
> > Has anyone simply tried manually deleting the units that have/will expire
> > before you can process them and leaving the rest. Just wonder if this
> > would work?
>
> [...]
>
> In short: it does seem possible to edit the client_state.xml file (when boinc
> is not running of course) and get away with it, so I guess deleting WUs from
> there is an option too (though be sure to keep the file_info, workunit and
> results in correspondence: delete all of them for the same work unit or none).
> Maybe just deleting the WU file from the project subdirectory will also do the
> trick, as the seti application will have no choice other than reporting an
> error when it gets told to start that WU (since it won't have a WU file to
> load anymore), so boinc can move on quickly. This is probably what boincview
> is doing.
>
Yes, it IS possible to manually delete the expired WUs. If you delete only the outdated WU files, they will be automatically downloaded again upon attempt to crunch them (see http://setiweb.ssl.berkeley.edu/forum_thread.php?id=4823#30541).
But you will off course penalize the others to wait longer for theit credit.
Until the proposed WU abort functionality will be available directly in the BOINC client, possibly letting the servers know about the aborted WUs.

Peter
ID: 38192 · Report as offensive
Thunder

Send message
Joined: 3 May 03
Posts: 65
Credit: 993,581
RAC: 0
United States
Message 38238 - Posted: 19 Oct 2004, 15:44:53 UTC

For what it's worth, I've had 5-6 WU's expire as a result of the massive increase on processing times and the fact that the benchmarks are now completely out to lunch. :P

It's so bad now that on some of my machines, setting it to connect ~every 2 days (upper limit is 4... I know), I was having WU's expire (after 14 days) with the resource share I had it set to. I've since changed the connect to every 1 day, but still find it vaguely disturbing that the project can't manage to pull down an appropriate amount of work. (sigh)

In any case, the only GOOD thing that I found was that I still got credit for those WU's even though they were returned after the 'expiration' date. I'm not sure if I just got lucky that they hadn't already gotten 3 good results on those or if mine had been a 4th, that I still would have been credited.

I sure like the fact that they fixed the results table, so I can see these things, but also think it's sure a shame that the 4.05 WU's are so screwed on their processing times vs benchmarks that they're hardly worth crunching anymore. :P
ID: 38238 · Report as offensive
John McLeod VII
Volunteer developer
Volunteer tester
Avatar

Send message
Joined: 15 Jul 99
Posts: 24806
Credit: 790,712
RAC: 0
United States
Message 38398 - Posted: 20 Oct 2004, 1:07:49 UTC - in response to Message 38238.  

>
> I sure like the fact that they fixed the results table, so I can see these
> things, but also think it's sure a shame that the 4.05 WU's are so screwed on
> their processing times vs benchmarks that they're hardly worth crunching
> anymore. :P
>
Actually it is the other way around. Previous to BOINC 4.13, the benchmarks were not completing their work as the optimizer removed some critical code. Therefore the old versions of BOINC were granting way too much credit.


BOINC WIKI
ID: 38398 · Report as offensive
Thunder

Send message
Joined: 3 May 03
Posts: 65
Credit: 993,581
RAC: 0
United States
Message 38563 - Posted: 20 Oct 2004, 15:50:51 UTC - in response to Message 38398.  

> Actually it is the other way around. Previous to BOINC 4.13, the benchmarks
> were not completing their work as the optimizer removed some critical code.
> Therefore the old versions of BOINC were granting way too much credit.

What I meant was that I'm now having problems with work expiring because the WU's now take far longer than the benchmarks indicate they should. For me at least, turning the 'cache' up to anything more than 1 day means some WU's will expire after 2 weeks.... that's a LITTLE off, dontcha think? ;)
ID: 38563 · Report as offensive

Message boards : Number crunching : Getting rid of "stale" work units?


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.