Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /disks/centurion/b/carolyn/b/home/boincadm/projects/beta/html/inc/boinc_db.inc on line 147
Tests of new scheduler features.

Tests of new scheduler features.

Message boards : News : Tests of new scheduler features.
Message board moderation

To post messages, you must log in.

Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 17 · Next

AuthorMessage
Richard Haselgrove
Volunteer tester

Send message
Joined: 3 Jan 07
Posts: 1451
Credit: 3,272,268
RAC: 0
United Kingdom
Message 45825 - Posted: 13 May 2013, 20:09:58 UTC - in response to Message 45824.  

I found the problem. There are apparently two different methods for computing speed... One is based on the predicted speed of the GPU, and it is what is used to determine which version is faster. When the random factor is added into that you're most likely to get the version that has computed the fewest results so far.

This is contrary to the behaviour David has said the scheduler should have and so I will fix it.

Mind you, the cuda42 we got first was the version which 63280 has completed most of.
ID: 45825 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 45827 - Posted: 13 May 2013, 21:21:59 UTC - in response to Message 45825.  

The random factor can go both ways, but the deviations from the average are larger for apps than have done fewer results.
ID: 45827 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45829 - Posted: 13 May 2013, 21:29:42 UTC
Last modified: 13 May 2013, 21:39:13 UTC

Enabled work fetch for v7. It's remote host though.. will try to speedup work fetch there a little...

EDIT: that host out of game for today... there was some power loss so all hosts rebooted. And GPU in lowered freq after reboot (local logon required to enable fan and full speed). So estimation times rised to 1-2 hours instead of ~10-20 mins...
it will ask new work not very soon
ID: 45829 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45830 - Posted: 13 May 2013, 21:55:33 UTC
Last modified: 13 May 2013, 21:59:36 UTC

Look for this host instead: http://setiweb.ssl.berkeley.edu/beta/show_host_detail.php?hostid=39394
there are MB and MB HD5, MB has bigger APR. Difference not so great as with cuda22 & other, but both app version have >200 eligible results already.
It just got new 20 tasks few mins ago. Could you post log for that fetch ?

EDIT: and there were SETI@home v7 v7.03 (opencl_ati5_sah), slower ones %)

SETI@home v7 7.03 windows_intelx86 (opencl_ati_sah)
Number of tasks completed 306
Max tasks per day 342
Number of tasks today 74
Consecutive valid tasks 309
Average processing rate 90.332877853703
Среднее оборотное время 1.48 days
SETI@home v7 7.03 windows_intelx86 (opencl_ati5_sah)
Number of tasks completed 275
Max tasks per day 309
Number of tasks today 113
Consecutive valid tasks 276
Average processing rate 82.306080532354
Среднее оборотное время 1.18 days


Was it random or bug ? ;)
ID: 45830 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45831 - Posted: 13 May 2013, 22:02:34 UTC - in response to Message 45824.  

When the random factor is added into that you're most likely to get the version that has computed the fewest results so far.


And the slowest app most probably will have fewer results ... un-optimization :)
ID: 45831 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 45832 - Posted: 13 May 2013, 22:29:28 UTC - in response to Message 45831.  

OK, I think I've fixed it. I want to confirm before I check the changes in. Let me know if you see more funny business.
ID: 45832 · Report as offensive
Urs Echternacht
Volunteer tester
Avatar

Send message
Joined: 18 Jan 06
Posts: 1038
Credit: 18,734,730
RAC: 0
Germany
Message 45833 - Posted: 13 May 2013, 22:33:32 UTC - in response to Message 45830.  

Look for this host instead: http://setiweb.ssl.berkeley.edu/beta/show_host_detail.php?hostid=39394
there are MB and MB HD5, MB has bigger APR. Difference not so great as with cuda22 & other, but both app version have >200 eligible results already.
It just got new 20 tasks few mins ago. Could you post log for that fetch ?

EDIT: and there were SETI@home v7 v7.03 (opencl_ati5_sah), slower ones %)

SETI@home v7 7.03 windows_intelx86 (opencl_ati_sah)
Number of tasks completed 306
Max tasks per day 342
Number of tasks today 74
Consecutive valid tasks 309
Average processing rate 90.332877853703
Среднее оборотное время 1.48 days
SETI@home v7 7.03 windows_intelx86 (opencl_ati5_sah)
Number of tasks completed 275
Max tasks per day 309
Number of tasks today 113
Consecutive valid tasks 276
Average processing rate 82.306080532354
Среднее оборотное время 1.18 days


Was it random or bug ? ;)
Guess you have switched apps in your post :
opencl_ati_sah == HD5
opencl_ati5_sah == non-hd5
_\|/_
U r s
ID: 45833 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45834 - Posted: 13 May 2013, 22:43:07 UTC - in response to Message 45833.  
Last modified: 13 May 2013, 22:45:20 UTC

why the hell such names was chosen?!!! There are enough tru issues to not to add such name substitudes >:[
(EDIT: yeah, "5" in stock name means non-HD5 app. "Funny" :/ )
ID: 45834 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45835 - Posted: 13 May 2013, 22:46:52 UTC - in response to Message 45832.  
Last modified: 13 May 2013, 22:47:19 UTC

OK, I think I've fixed it. I want to confirm before I check the changes in. Let me know if you see more funny business.


With posted log it would be more obvious why host got 20 slower app tasks (maybe I messed HD5 with non-HD5 one, but host recived task for slower app indeed).
ID: 45835 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45836 - Posted: 13 May 2013, 22:51:56 UTC
Last modified: 13 May 2013, 22:53:33 UTC

Did one more work fetch request - now got 20 tasks for fastest app.
Will see if it's consistent behavior or just random luck....

EDIT: btw, for this testing it would be worth to reduce number of simultaneously allocated tasks from 20 let say to 5 or maybe less. Not ?
It's not main, we do tasks here only for testing and fetch 20 similar tasks for current testing is just resources waste.
ID: 45836 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45837 - Posted: 13 May 2013, 23:04:45 UTC

2 more separate fetches - 3 and 2 tasks recived, all for fastest app.
Looks good so far.
ID: 45837 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45838 - Posted: 13 May 2013, 23:43:50 UTC
Last modified: 13 May 2013, 23:44:30 UTC

Started another NV GPU host (GSO9600).
Not too good beginning - 60 tasks already recived. All cuda22 ones. And this host supports cuda23 and cuda32 too and cuda23 expected to be fastest. But queue filled with slowest tasks.
http://setiweb.ssl.berkeley.edu/beta/show_host_detail.php?hostid=63368
ID: 45838 · Report as offensive
Richard Haselgrove
Volunteer tester

Send message
Joined: 3 Jan 07
Posts: 1451
Credit: 3,272,268
RAC: 0
United Kingdom
Message 45839 - Posted: 14 May 2013, 0:01:39 UTC

Meanwhile, on the other guinea-pig host, the first three fetches after Eric's 22:29 fix all resulted in cuda50 work - which is what would be desired, given the host's established APR records.
ID: 45839 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45840 - Posted: 14 May 2013, 0:09:59 UTC

And another fetch on ATi host: 13 tasks, all for fastest app.
So, with already established APR all Ok now. task distribution for newly added host could be better though.
ID: 45840 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 45841 - Posted: 14 May 2013, 0:18:24 UTC - in response to Message 45840.  

Yes, I wish it didn't pick a single app to distribute to on each RPC, but recalculated best app for each result. I dont see an easy way to fix that.
ID: 45841 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45842 - Posted: 14 May 2013, 0:20:20 UTC
Last modified: 14 May 2013, 0:24:46 UTC

SETI@home v7 7.00 windows_intelx86 (cuda22)
Number of tasks completed 85
Max tasks per day 119
Number of tasks today 0
Consecutive valid tasks 86
Average processing rate 57.869216272821
Среднее оборотное время 1.28 days
SETI@home v7 7.00 windows_intelx86 (cuda23)
Number of tasks completed 89
Max tasks per day 122
Number of tasks today 0
Consecutive valid tasks 89
Average processing rate 121.86273824761
Среднее оборотное время 1.53 days
SETI@home v7 7.00 windows_intelx86 (cuda32)
Number of tasks completed 214
Max tasks per day 248
Number of tasks today 0
Consecutive valid tasks 215
Average processing rate 124.01397894416
Среднее оборотное время 1.88 days


Because of GPU downclock APR for cuda23 now biased. APR for cuda32 became bigger than for cuda23 though cuda23 is the fastest for this host.
Interesting, will BOINC be able to correct APR when I will do local logon and restore normal GPU speed or host will recive only suboptimal tasks now?...

EDIT: and here we will be able to check how good random factor is. In particular, if SD of 1 is really enough. Host worked with full speed GPU many days already. It will work with downclocked GPU 1-2 days. If random factor does its work such deviation should be healed on subsequent work fetches.
If random factor too low host will stick in suboptimal state.
Interesting experiment ;)
ID: 45842 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 45843 - Posted: 14 May 2013, 2:27:45 UTC - in response to Message 45842.  

The rates are so close together that you should have a good chance of getting some cuda23. Otherwise you'd have to wait for the processing rate to get back down to normal on cuda32.
ID: 45843 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45844 - Posted: 14 May 2013, 7:01:49 UTC - in response to Message 45843.  

The rates are so close together that you should have a good chance of getting some cuda23. Otherwise you'd have to wait for the processing rate to get back down to normal on cuda32.


it's normal on cuda32. And if I return GPU to full speed before some downloaded cuda32 tasks will be processed, it will remain normal. While cuda23 APR was reduced.
Will see in what it results.
ID: 45844 · Report as offensive
Profile Raistmer
Volunteer tester
Avatar

Send message
Joined: 18 Aug 05
Posts: 2423
Credit: 15,878,738
RAC: 0
Russia
Message 45846 - Posted: 14 May 2013, 9:33:18 UTC

And another fetch request on GSO9600 - again cuda22...
Quite strange random, 3 in row with 3 possible variants.

Eric, could you check this host please http://setiweb.ssl.berkeley.edu/beta/show_host_detail.php?hostid=63368 - does server consider other plan classes besides cuda22 at all ?
ID: 45846 · Report as offensive
Profile Eric J Korpela
Volunteer moderator
Project administrator
Project developer
Project scientist
Avatar

Send message
Joined: 15 Mar 05
Posts: 1547
Credit: 27,183,456
RAC: 0
United States
Message 45854 - Posted: 14 May 2013, 21:16:12 UTC - in response to Message 45846.  
Last modified: 14 May 2013, 21:18:12 UTC

Sorry for not warning you. I needed to reset stats and cancel previous results. Your versions should be random again until you get back to 10 results per.

Checking on your hosts now.
ID: 45854 · Report as offensive
Previous · 1 · 2 · 3 · 4 · 5 · 6 . . . 17 · Next

Message boards : News : Tests of new scheduler features.


 
©2025 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.