Message boards :
Number crunching :
Lunatics_x41g_linux64_cuda32.7z
Message board moderation
Previous · 1 · 2 · 3 · 4 · 5 · 6 · Next
Author | Message |
---|---|
Andrew Send message Joined: 28 Mar 15 Posts: 47 Credit: 1,053,596 RAC: 0 |
Time clock on that one is incorrect as it was one of the tasks that kept getting tested on the various GPU enabled executables until it started working and probably had some CPU time at the very beginning.. theres probably a couple of these tasks in there. Normally it would take me 4-6 hours to crunch a 150-180GFLOP wu, but the GPU is doing them now in 15-20 mins. Of course the GPU is not registering any GFLOPS so I have no idea how that is going to affect the credits..? Of course now boinc for some reason has a completely screwed up sense of completion time.. when running on CPU only it was fairly accurate, now its trying to work its way down from 44 hrs/wu even after re-running the CPU Benchmark.. |
Claggy Send message Joined: 5 Jul 99 Posts: 4654 Credit: 47,537,079 RAC: 4 |
Do you know what include the boinc_temporary_exit() function is in? I had that problem when I tried compiling the Stock seti_boinc app on my Raspberry Pi, seemed to be a linking problem with the compiler, switching to a later Pi image fixed it for me: http://setiathome.berkeley.edu/forum_thread.php?id=76903&postid=1652926#1652926 Claggy |
Raistmer Send message Joined: 16 Jun 01 Posts: 6325 Credit: 106,370,077 RAC: 121 |
I would say it fails before linker has chance to be called. Perhaps, outdated BOINC sources. Try to update boinc dir from Berkeley's SVN. |
OTS Send message Joined: 6 Jan 08 Posts: 371 Credit: 20,533,537 RAC: 0 |
@dsh At the moment the GPU is processing work (Thanks Jason) so I think I will see what happens before trying to use slackpkg or slapt-get. My big concern at the moment is how the GPU is processing work. So far the GPU has processed eight WUs but they are all pending so I do not know if the results are valid or not. The times for each seem to be three to six minutes which is so much better than CPU times that I have my doubts on their being valid. As a result I have currently stopped downloads with about 20 WUs downloaded until I know. Time will tell. |
Andrew Send message Joined: 28 Mar 15 Posts: 47 Credit: 1,053,596 RAC: 0 |
I would say it fails before linker has chance to be called. The sources I am using are 2882 from SVN. There was a few issues as I tried to compile with correcting library/include paths and adding a couple #includes so that it would compile against Cuda toolkit 5.5.22. For the missing boinc_temporary_exit, i commented them out and it finished compiling. I checked out everything under https://setisvn.ssl.berkeley.edu/svn/branches/ and the function's definition didn't exist anywhere but there was some comments in one of the files that Jason had done some changes that affected boinc_temporary_exit. Edit: It seems to be working though as no wu's have failed validation yet. One that had been inconclusive did pass but the other cuda32 from the x41zc failed? http://setiathome.berkeley.edu/workunit.php?wuid=1751056465 Edit: The user seems to be generating a lot of invalid and errors .. http://setiathome.berkeley.edu/show_host_detail.php?hostid=7336550 |
Claggy Send message Joined: 5 Jul 99 Posts: 4654 Credit: 47,537,079 RAC: 4 |
I would say it fails before linker has chance to be called. Boinc doesn't use SVN any longer, it uses GIT: http://boinc.berkeley.edu/trac/wiki/SourceCodeGit Cloning the remote Git repository You'll then need to build the libraries and api: http://boinc.berkeley.edu/trac/wiki/CompileAppLinux Claggy |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
At the moment the GPU is processing work (Thanks Jason) so I think I will see what happens before trying to use slackpkg or slapt-get. @dsh Great to see it's running the Cuda 3.2 build, though looks like excessive pulses detected. That looks like the weird problem that was spotted on Windows With Maxwell and Cuda 3.2 (Then stopped sending to those) In that case, back to the plan of updating BoincAPI and spitting out a Later Cuda version. That'll probably happen later tonight, when I'll probably bring the newer Centos install development tools, drivers and sources up to scratch. [Edit:] and as I typed first invalid ticked over, so moving on. Making an updated Linux build fits with my current development plans anyhow, so a bit of juggling to do tonight. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
OTS Send message Joined: 6 Jan 08 Posts: 371 Credit: 20,533,537 RAC: 0 |
At the moment the GPU is processing work (Thanks Jason) so I think I will see what happens before trying to use slackpkg or slapt-get. After I spotted the first invalid I tried one more WU with boinc 6.10.58 and the run time was about the same as the others with the same SETI@Home Informational message -9 result_overflow message so I suspect that will be an invalid as well.
Thank you very much for your continued effort. |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
Progress here as I stumbled across a Cuda 6 build I made, which *should* be nearly identical to that last Cuda32 one, which runs/works on anything but Maxwell. Cuda 6 should be fine, though I'll witrhold until I prove it here, and at the moment the project is giving me 'no tasks available', so the waiting game it is, lol [Edit:] Lol, got 7 vlars for CPU... well not what I wanted exactly, but a start [Edit2:] And looks like it's off and running on the 680 at least. Will let it accumulate some pendings and see if anything looks strage. If it looks OK will add it to the same download page, keeping the Cuda32 one for older GPUs. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
OTS Send message Joined: 6 Jan 08 Posts: 371 Credit: 20,533,537 RAC: 0 |
Progress here as I stumbled across a Cuda 6 build I made, which *should* be nearly identical to that last Cuda32 one, which runs/works on anything but Maxwell. The pending WUs look promising - no overflows. It is after 1 AM here and I had to wait quite a while for a confirmation on mine so I will check back later in the morning - much later zzzzz :-). |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
Seem to be getting valids, and seems reasonably fast, so added the Cuda60 build to bottom of Downloads page at: http://jgopt.org/download.html Assuming works on Maxwell, next would be nice to know which Linux distributions &/or Boinc versions it has problems on (and some idea of why) That would involve looking into slot directories for task stderr.txt, on problematic systems. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
Andrew Send message Joined: 28 Mar 15 Posts: 47 Credit: 1,053,596 RAC: 0 |
Progress here as I stumbled across a Cuda 6 build I made, which *should* be nearly identical to that last Cuda32 one, which runs/works on anything but Maxwell. I've hadd success with the compile on the cuda 5.5 here. Over a dozen have vailidated not long ago and its been running smoothly on the old card. http://setiathome.berkeley.edu/results.php?hostid=7533120 If you want a list of what I did to get it to compile or a diff of the changes I made I can post it or send it over. |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
I've hadd success with the compile on the cuda 5.5 here. Over a dozen have vailidated not long ago and its been running smoothly on the old card. I'd like that list via PM please, which I will compare to what I have setup on the Ubuntu dev machine where the Cuda6 build is running, and choose the best simplifying set of updates adding in Cuda7 for the Centos install as a fresh environment. Ultimately, as briefly mentioned before, I'll be going with the Gradle build system, though it makes sense to keep the traditional gnutools setup operational, since it seems to play ball with effort. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
OTS Send message Joined: 6 Jan 08 Posts: 371 Credit: 20,533,537 RAC: 0 |
Seem to be getting valids, and seems reasonably fast, so added the Cuda60 build to bottom of Downloads page at: Looking good. None of the new WUs have validated yet but I am not seeing any overflows which is a good sign. |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
Looking good. None of the new WUs have validated yet but I am not seeing any overflows which is a good sign. Seeing some [Cuda 6] valids there now, so Cuda 3.2 no good on Maxwell confirmed for Linux. (for this application anyway, as on Windows) "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
petri33 Send message Joined: 6 Jun 02 Posts: 1668 Credit: 623,086,772 RAC: 156 |
Looking good. None of the new WUs have validated yet but I am not seeing any overflows which is a good sign. I have done some research in/on linux and a hand full of GTX780's (1- max 4). What would be the best way to deliver the code to the community To Help Us All to do more science before we get the Jason's x42? (Jason?) I have done some kernel optimizations (780 specific, not tested anywhere else. just on my computer) and some stream-inclined/induced/oriented changes to the already Well and Good optimized Jason's and his precursors code. What would be the most neutral way to publish? (I can not host any piece of a code for three-four years) ?? I'll drop the source to Your mail box or whatever. I'm running two MB at at time with 3 GPUs and in between an AP on GPU or CPU if available. See for yourselves (on top hosts) .. (remember to divide the time by 2 for any MB). To overcome Heisenbergs: "You can't always get what you want / but if you try sometimes you just might find / you get what you need." -- Rolling Stones |
arkayn Send message Joined: 14 May 99 Posts: 4438 Credit: 55,006,323 RAC: 0 |
Looking good. None of the new WUs have validated yet but I am not seeing any overflows which is a good sign. You can post it on my site if you want. |
jason_gee Send message Joined: 24 Nov 06 Posts: 7489 Credit: 91,093,184 RAC: 0 |
Yeah current process is pretty lengthy/involved, and I've been thinking about that for some time (actually since your submissions, and my major redesigns for x42). I was finding options that I've used before too restrictive (systemically slow) for myself, or getting others' code in to test quickly & easily, and then melded into the general Berkeley sources after that (where sufficiently generalised and field proven etc) The rough conclusion that I've come to is that despite a painful transition (for me) getting more familiar with Github and the workflows for that would be best, then you (or anyone) could fork there and make pull requests for additions back to the optimised master. Periodically we'd collectively decide (with Eric and others input when we think some major revision is good enough to be called stock), and rolled into Berkeley's svn. Still nutting out the details, And working out the finer points of github, but seems like it might be more workable. "Living by the wisdom of computer science doesn't sound so bad after all. And unlike most advice, it's backed up by proofs." -- Algorithms to live by: The computer science of human decisions. |
OTS Send message Joined: 6 Jan 08 Posts: 371 Credit: 20,533,537 RAC: 0 |
Looking good. None of the new WUs have validated yet but I am not seeing any overflows which is a good sign. It would appear that with 24 Nvidia results validated with no new invalids that I finally, thanks to your had work, have the GPU happily crunching MB work units. Thank you very much. I do have one parting question in this thread. When I run top, I see the following. 13672 root 39 19 53328 41m 2352 R 100 0.5 9:44.66 MBv7_7.05r2549_ 13675 root 39 19 51132 39m 2416 R 100 0.5 9:47.39 MBv7_7.05r2549_ 13666 root 39 19 50052 30m 2372 R 100 0.4 9:47.56 MBv7_7.05r2549_ 13667 root 39 19 51132 39m 2400 R 100 0.5 9:48.68 MBv7_7.05r2549_ 13668 root 39 19 49984 30m 2360 R 99 0.4 9:49.51 MBv7_7.05r2549_ 13676 root 39 19 53324 41m 2364 R 99 0.5 9:48.11 MBv7_7.05r2549_ 13665 root 39 19 49984 30m 2396 R 98 0.4 9:46.26 MBv7_7.05r2549_ 13670 root 39 19 53328 41m 2372 R 95 0.5 9:48.00 MBv7_7.05r2549_ 13664 root 30 10 28.3g 198m 68m S 6 2.5 0:35.44 setiathome_x41z Is it safe to assume, based on the nice value of 19 for the CPU processes and 10 for the GPU processes, that the OS will allocate sufficient CPU resources to the GPU app so that it is not slowed by the CPU? That would be my take on it, and I will certainly monitor the results page to see if the GPU times change much over the just using the GPU, but thought I would ask to see if more knowledgeable people might have an opinion. Now on to see if I can get the GPU to play nice with APs as well. That is of course if there are any. I am beginning to forget what they look like :-) |
Andrew Send message Joined: 28 Mar 15 Posts: 47 Credit: 1,053,596 RAC: 0 |
Looking good. None of the new WUs have validated yet but I am not seeing any overflows which is a good sign. Depends on how much else you have going on on the machine and the amount of processing power available. If the machine is sitting idle and only BOINC is running, then it shouldn't matter.. but if you are browsing the web, playing movies, etc, it will affect how much processor time is allocated to BOINC. A nice value of 19 as per 'man renice' sets the priority so that the process will only run when nothing else on the system wants to. -19 is the exact opposite. you can set it to 0 if you want to give it fair share amongst all other processes or set it to say -5 to give it a priority other processes. Setting it any more than -5 shouldn't be necessary for any reason that I know of unless you are going for realtime processing... |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.