Message boards :
News :
SETI@home v8 beta to begin on Tuesday
Message board moderation
Previous · 1 . . . 28 · 29 · 30 · 31 · 32 · 33 · 34 . . . 99 · Next
Author | Message |
---|---|
![]() ![]() Send message Joined: 21 Apr 13 Posts: 23 Credit: 2,253,909 RAC: 0 ![]() |
I noticed that my folder updates are now instantaneous. Sure enough, all the OpenCL WUs have been replaced with CUDA, but not short run times (almost an hour @ 0.403 core.) |
Send message Joined: 21 Nov 12 Posts: 1015 Credit: 5,459,295 RAC: 0 ![]() |
Guys, it looks as if I'm going to be "somewhat random" over the next few days - I've just had a long conversation with work and I've got to head off to cover fr someone who has just suffered "a bit of a domestic crisis" (not life threatening, but not good either) I'll have to put this trial on ice until the storm settles and I can devote a few hours to it later in the week. (Now frantically throwing things in a bag, ready for the off in about 8 hours and one sleep's time...) |
![]() ![]() Send message Joined: 18 Aug 05 Posts: 2423 Credit: 15,878,738 RAC: 0 ![]() |
Could you next time you see such freeze in Explorer try to look what task performed on GPU connected to monitor. VLAR or not and what AR if not? try to troubleshoot this via -period_iterations_num N -sbs N parameters change until acceptable Explorer refreshing will be found. Perhaps defaults should be changed for NB build cause it's apparent NV cards can't deal with PulseFind as good as ATi's one do. News about SETI opt app releases: https://twitter.com/Raistmer |
![]() Send message Joined: 10 Mar 12 Posts: 1700 Credit: 13,216,373 RAC: 0 ![]() |
How can these two results have two results in one stderr? I've seen the same in many WU's run with (opencl_atiapu_sah) And the two results in the same stderr, are different too, in the same stderr... Scroll down in the stderrs, and you'll see two different results in the same stderr: https://setiweb.ssl.berkeley.edu/beta/result.php?resultid=22039477 https://setiweb.ssl.berkeley.edu/beta/result.php?resultid=22039478 |
![]() ![]() Send message Joined: 18 Aug 05 Posts: 2423 Credit: 15,878,738 RAC: 0 ![]() |
How can these two results have two results in one stderr? I've seen the same in many WU's run with (opencl_atiapu_sah) Overflow was found in triplet logging then GPU was queried for spikes results and spikes were reported too. It's pecularity of SoG build. News about SETI opt app releases: https://twitter.com/Raistmer |
![]() Send message Joined: 10 Mar 12 Posts: 1700 Credit: 13,216,373 RAC: 0 ![]() |
How can these two results have two results in one stderr? I've seen the same in many WU's run with (opencl_atiapu_sah) Heh, thanks. Yeah peculiarity it was indeed :-) My CUDA was kicked out as invalid though, when it reported just: Spike count: 30 Autocorr count: 0 Pulse count: 0 Triplet count: 0 Gaussian count: 0 . |
![]() ![]() Send message Joined: 18 Aug 05 Posts: 2423 Credit: 15,878,738 RAC: 0 ![]() |
Both apps most probably reported valid subset of whole signals that that task contains. They just picked them in different orders hence different subsets. Next SoG build will have more "traditional" signals ordering. Though this issue can be properly handled only on validator level. News about SETI opt app releases: https://twitter.com/Raistmer |
![]() Send message Joined: 10 Mar 12 Posts: 1700 Credit: 13,216,373 RAC: 0 ![]() |
And on this one https://setiweb.ssl.berkeley.edu/beta/workunit.php?wuid=7985228 , my CUDA did not report anything else than: Find triplets Cuda kernel encountered too many triplets, or bins above threshold, reprocessing this PoT on CPU... cudaAcc_free() called... cudaAcc_free() running... cudaAcc_free() PulseFind freed... cudaAcc_free() Gaussfit freed... cudaAcc_free() AutoCorrelation freed... cudaAcc_free() DONE. Cuda sync'd & freed. Preemptively acknowledging a safe Exit. -> SETI@Home Informational message -9 result_overflow NOTE: The number of results detected equals the storage space allocated. Flopcounter: 206561637.989259 Spike count: 0 Autocorr count: 0 Pulse count: 0 Triplet count: 30 Gaussian count: 0 While the other two reported: Spike count: 0 Autocorr count: 0 Pulse count: 16 Triplet count: 14 Gaussian count: 0 Seems as if CUDA v8, isn't really ready for main.... |
Send message Joined: 29 Nov 14 Posts: 12 Credit: 63,078 RAC: 0 ![]() |
Well the 266.58 whql(cuda 32) on a dirty Zotac(EVGA firmware) GTX580 Infinity @ 700MHz is validating and someone said something about needing 350+ Nvidia driver under Windows 7 Pro sp1 x64. My results as of this moment in time: (614) · In progress (1) · Validation pending (5) · Validation inconclusive (2) · Valid (606) · Invalid (0) · Error (0) |
![]() Send message Joined: 14 Feb 13 Posts: 606 Credit: 588,843 RAC: 0 |
Seems as if CUDA v8, isn't really ready for main.... nothing to do with v8. the problem has been a longstanding one and results from vastly different processing orders on CPU and GPU. Raistmer has tried to get his app to report the same subset as the CPU would find, so you find those apps agree. usually the sets match enough to get credit, you were unlucky t have sucha disjunct set, that the validator threw it out. To make a graphic approximation of the problem: Imagine you and a collegue are sent ou to check the roads for potholes and to report the exact location of each. if there are only a few potholes, you'll come back with the same numbers (give or take). now imagine you have a pothole riddled road, but only space for 30 holes on your sheet (and no spares). now if you and you collegue start on different ends of the road you may find you've run out of space before you even reached the bit your mate was checking. The boss doesn't know who reported the right holes, so he sends a third guy. and that guy starts where your mate was. so boss tells you 'sorry yours can't be right, the other two guys are giving me completely different numbers.' bad luck, and something that will need addressing [is on the list for being addressed], but not when we are working like crazy to get a version update released. A person who won't read has no advantage over one who can't read. (Mark Twain) |
Send message Joined: 3 Jan 07 Posts: 1451 Credit: 3,272,268 RAC: 0 ![]() |
Please note that this problem is (almost exclusively) prevalent when there are a huge number of signals at the very start of processing. In the example WU 7985228 Tutankhamon discussed, nobody spent more than 30 seconds on the task - the first user (using CPU only) would have spent over four hours working on it if the WU hadn't suffered RFI. |
![]() Send message Joined: 10 Mar 12 Posts: 1700 Credit: 13,216,373 RAC: 0 ![]() |
Thanks William, and Richard for the explanation. I'll just lean back and relax then....:-) |
![]() ![]() Send message Joined: 18 Aug 05 Posts: 2423 Credit: 15,878,738 RAC: 0 ![]() |
My GTX260 host: SETI@home v8 8.00 windows_intelx86 (cuda23) Number of tasks completed 61 Max tasks per day 95 Number of tasks today 5 Consecutive valid tasks 62 Average processing rate 94.51 GFLOPS Average turnaround time 0.37 days SETI@home v8 8.00 windows_intelx86 (cuda32) Number of tasks completed 90 Max tasks per day 124 Number of tasks today 1 Consecutive valid tasks 91 Average processing rate 85.52 GFLOPS Average turnaround time 0.61 days SETI@home v8 8.00 windows_intelx86 (cuda42) Number of tasks completed 33 Max tasks per day 68 Number of tasks today 1 Consecutive valid tasks 35 Average processing rate 70.06 GFLOPS Average turnaround time 0.63 days SETI@home v8 8.00 windows_intelx86 (cuda50) Number of tasks completed 42 Max tasks per day 77 Number of tasks today 0 Consecutive valid tasks 44 Average processing rate 79.46 GFLOPS Average turnaround time 0.65 days So, APR correctly shows CUDA23 as fastest for this GPU (it was for v7 in all offline benchmarks before). But full mix of tasks still on host. It will be interesting to observe how and when BOINC will be able to select best one. At least it's on rihgt way now. For MB v7 it wasn't able to select right build. News about SETI opt app releases: https://twitter.com/Raistmer |
Send message Joined: 18 Jun 08 Posts: 76 Credit: 113,089 RAC: 0 ![]() |
Eric, About the result_overflow not always turning into runtime_outlier. The validator checks for overflow only when it's still searching for the canonical result. The weakly similar tests are missing the checks. Calls to check_overflow_result() need to go inside the loop that starts at line 263 and in to check_pair() around line 418. Also, the v6/v7 cheater check is misplaced in check_pair(), it should go after the if (retval) block, not inside it. The check is also completely missing from the weakly similar test loop that starts at line 263. |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
Thanks! Glad other eyes are looking at this. ![]() |
![]() Send message Joined: 14 Feb 13 Posts: 606 Credit: 588,843 RAC: 0 |
Which file does that go in Mike and is it all one line or two? please elaborate on 'make sure there is a space in front of the first char' do you mean - start the entry with a ' ' - make sure there are ' ' between different '-' entries - something else anything that's not straightforward needs to go into the documentation we are currently finalising. A person who won't read has no advantage over one who can't read. (Mark Twain) |
Send message Joined: 3 Jan 07 Posts: 1451 Credit: 3,272,268 RAC: 0 ![]() |
I was just testing that. With intel_gpu, -v 0 (with no space at the beginning of the line) was recognised and acted upon correctly. William is right, there needs to be a space between each separate switch (and if the switch needs a number or numbers as well, like -v, they should always be present). |
Send message Joined: 12 Nov 10 Posts: 1149 Credit: 32,460,657 RAC: 1 ![]() |
Today I've had a flurry of tasks (12 so far over 5 hosts (4 Windows, 1 Android), both CPU and GPU), that have reported Completed, marked as invalid, despite them completing and validating for the two other wingmen. Anyone else seen anything similar? Not sure if this link will work http://setiweb.ssl.berkeley.edu/beta/results.php?userid=14462&offset=0&show_names=0&state=5&appid= |
![]() Send message Joined: 15 Mar 05 Posts: 1547 Credit: 27,183,456 RAC: 0 ![]() |
That was due to the validator problem Juha found. I've triggered the validator to run again on all the invalid result that came in in the past few days. It should mark any that caught the bug as valid. ![]() |
Send message Joined: 18 Jun 08 Posts: 76 Credit: 113,089 RAC: 0 ![]() |
That was due to the validator problem Juha found. I've triggered the validator to run again on all the invalid result that came in in the past few days. It should mark any that caught the bug as valid. Umm, no, sorry, not really like that. The bug was there already but now that you moved the v6/v7/v8 cheater check in check_pair() the bug is really exposed. The invalids I checked are all overflow results and the third result to come in. You forgot to add the call to check_overflow_result() into check_pair() . It needs to go before the cheater check since the cheater check needs to know if the result was an overflow one. |
©2025 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.