Message boards :
News :
Tests of new scheduler features.
Message board moderation
Author | Message |
---|---|
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
I've added some new scheduler features to hopefully ensure that you (eventually) get the fastest application version for your processor/GPU. But in order to make sure I did it right, I'm going to need to reset the app_version and per host app version statistics. It'll take a week or so for things to get back to normal. You'll initially see some non-optimal application choices (like CUDA3.2 on your CUDA5 card or CAL on your ATI card /w OpenCL.) This is deliberate. Please let them run. It's the only way I can see that the slow versions stop being sent. Once I'm sure things work, I'll set them back the way they were. ![]() |
Send message Joined: 3 Jan 07 Posts: 1451 Credit: 3,272,268 RAC: 0 ![]() |
What would be the most helpful thing for anonymous platform users to do? Like our friend CElliott? |
Send message Joined: 29 May 06 Posts: 1037 Credit: 8,440,339 RAC: 0 ![]() |
What would be the most helpful thing for anonymous platform users to do? Like our friend CElliott? And myself, since i'm running the AMD/ATI r1817 MB app that works correctly with Cat 12.10/APP runtime 1016.4 and Cat 13.2/13.3/13.4/ APP runtime 1124.2, it would now only need to be not sent to Cat 13.1/APP runtime 1084.4 hosts, Note the AMD/ATI Astropulse apps have now been fixed for Cat 13.2/13.3/13.4/ APP runtime 1124.2 too. http://lunatics.kwsn.net/12-gpu-crunching/update-for-ati-mb-gpu-app.0.html Claggy |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
What would be the most helpful thing for anonymous platform users to do? Like our friend CElliott? Just keep running as normal. ![]() |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
I hope to be distributing a new ATI app next week, FYI. ![]() |
![]() Send message Joined: 18 Jan 06 Posts: 1038 Credit: 18,734,730 RAC: 0 ![]() |
I hope to be distributing a new ATI app next week, FYI. Hopefully it will be not only one this time, there are four new ATI apps at least. _\|/_ U r s |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
OK. Be sure to hold me to that. ![]() |
![]() Send message Joined: 10 Mar 12 Posts: 1700 Credit: 13,216,373 RAC: 0 ![]() |
I've added some new scheduler features to hopefully ensure that you (eventually) get the fastest application version for your processor/GPU. But in order to make sure I did it right, I'm going to need to reset the app_version and per host app version statistics. Oh well, my NVIDIA 315M got a CUDA50 WU, and even though it was considerably slower than CUDA23 and 32, it did crunch it without any protest :-) http://setiweb.ssl.berkeley.edu/beta/result.php?resultid=13671187 WARNING!! "THIS IS A SIGNATURE", of the "IT MAY CHANGE AT ANY MOMENT" type. It may, or may not be considered insulting, all depending upon HOW SENSITIVE THE VIEWER IS, to certain inputs to/from the nervous system. |
![]() Send message Joined: 14 Feb 13 Posts: 606 Credit: 588,843 RAC: 0 |
I hope this is transient: host 62763 cuda 2.2 <flops>9739745.861848</flops> estimated CPU time remaining: 7261720.044040 [that's ~2000 hours - boinc's wording. It's actually the initial estimate] saturated 717141815.35 busy 717141815.35 [that's the next 22 years] this task was the one to send the several magnitudes too small flops. NB the task will take aprox. 10k sec and hasn't started yet A person who won't read has no advantage over one who can't read. (Mark Twain) |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
Hopefully the bad FLOPs estimates will fix themselves. If not, well, that's why we're doing this test. ![]() |
![]() Send message Joined: 14 Feb 13 Posts: 606 Credit: 588,843 RAC: 0 |
I forgot to say that the card proclaims to have 91 GFLOPS peak :) I've no idea where those flops are coming from? Since new flops are received on work allocation only, I'll probably have to abort tasks to get new work with fresh flops allocated. We probably want to see numbers before 11 tasks validate and APR kicks in (and even that will take a few days, with that host's expected turnover). A person who won't read has no advantage over one who can't read. (Mark Twain) |
Send message Joined: 3 Jan 07 Posts: 1451 Credit: 3,272,268 RAC: 0 ![]() |
OK, tried the equivalent test with William's other host (62652). That has been doing CPU work only up till now, to generate reference results. GPU has a peak flops rating of 336 Gfl. Cuda50 was the allocated application (probably not optimal for a 9800GT, but we'll wait on that). Server calculated <flops> at 23,578,084,425 - that sounds very much like 10x the <p_fpops> benchmark, which is 2,350,462,673 [and nothing like the long-established APR of 157 the same machine has on the main project]. Edit - first task returned: run time 48 minutes, against an initial estimate of 70 minutes. That's actually very good. |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
It turns out that the host and app version stats will be messed up until all of the GPU results sent out before the server change are returned. That could take a while. I'll keep resetting the stats in the meantime to speed up the process. ![]() |
![]() Send message Joined: 14 Feb 13 Posts: 606 Credit: 588,843 RAC: 0 |
OK, tried the equivalent test with William's other host (62652). That has been doing CPU work only up till now, to generate reference results. That value sounds familiar... Tasks first came in with an estimate in the 3k region, which translates into the same 23.5e9 give or take. So I'm wondering where that initial flops estimate derives from and where the far too small one originated. Oh, runtime estimates are linked into CreditNew, aren't they? Nuff' said... A person who won't read has no advantage over one who can't read. (Mark Twain) |
Send message Joined: 3 Jan 07 Posts: 1451 Credit: 3,272,268 RAC: 0 ![]() |
This is going to get a bit weird. Host 62652 has now been allocated test work from all five cuda apps. Only the first one (cuda50) has any completed tasks and hence an APR, but not yet the 11 completions that would trigger usage of that APR in runtime estimation. But the flops values are all over the place: <flops>23520469976.920357</flops> <plan_class>cuda22</plan_class> <flops>141034284.159673</flops> <plan_class>cuda23</plan_class> <flops>77735925.456133</flops> <plan_class>cuda32</plan_class> <flops>100125333.859577</flops> <plan_class>cuda42</plan_class> <flops>23578084425.701229</flops> <plan_class>cuda50</plan_class> cuda22 is estimating 50 minutes, cuda32 is estimating 252 hours - same WU batch. |
Send message Joined: 26 Jun 10 Posts: 22 Credit: 601,692 RAC: 0 ![]() |
Did something change in the credit system as well? All tasks that validated after the change in the scheduler did receive very low credits (0.5 or less for a task that normally yields around 50 credits) I checked a few wingmen, including Eric, and everyone seems to receive low credits ![]() |
Send message Joined: 3 Jan 07 Posts: 1451 Credit: 3,272,268 RAC: 0 ![]() |
First tasks are in for all versions now. I think all WUs are comparable. cuda22 2,780 seconds cuda23 1,254 seconds cuda32 1,122 seconds cuda42 1,429 seconds cuda50 2,897 seconds (I've taken the first task in each case, to allow for fair comparison with wisdom generation) A useful reminder that with older hardware, biggest is not necessarily best. |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
The credit system didn't change, but it appears the huge run time estimates for the results are causing tiny credit grants when CUDA result validates against CUDA result. I'll fix those later. Hopefully credits will converge towards normal values as time goes on. ![]() |
![]() Send message Joined: 12 Dec 08 Posts: 7 Credit: 3,116,266 RAC: 0 ![]() |
Meanwhile it is every second Workingunit with low credits. For example: http://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=5209949 Given Credit of 0.23, validating Ati-OpenCl against Ati-OpenCL. Gimme the whole credit, please! ;-) |
![]() Send message Joined: 10 Mar 12 Posts: 1700 Credit: 13,216,373 RAC: 0 ![]() |
Meanwhile it is every second Workingunit with low credits. We don't need no steenking credit. WARNING!! "THIS IS A SIGNATURE", of the "IT MAY CHANGE AT ANY MOMENT" type. It may, or may not be considered insulting, all depending upon HOW SENSITIVE THE VIEWER IS, to certain inputs to/from the nervous system. |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.