CUDA apps continue to run on CPU in suspend mode |
![]() |
| log in |
Questions and Answers : GPU applications : CUDA apps continue to run on CPU in suspend mode
| Author | Message |
|---|---|
|
My CUDA apps will not stop running. They normally run on the GPU until I or a setting puts Boinc into suspend, then the CUDA app just runs on the CPU and ignores and violates my CPU % usage rules as well. This also happens if windows puts the monitor to sleep. | |
| ID: 1225758 · | |
This also happens if windows puts the monitor to sleep. Why "also", it have to happen only when "windows puts the monitor to sleep" due to bug in NVIDIA driver 296.10 http://setiathome.berkeley.edu/forum_thread.php?id=67844&nowrap=true#1223973 ____________ - ALF - "Find out what you don't do well ..... then don't do it!" :) | |
| ID: 1225898 · | |
|
Because that is not the only time it happens. | |
| ID: 1225906 · | |
|
| |
| ID: 1225942 · | |
|
Thank you for your response, but you have overlooked the real problem. | |
| ID: 1226211 · | |
when Boinc is in "suspend" mode (whether I put Boinc into suspend or a rule puts it into suspend, it does not matter). So you either: - from icon menu choose Snooze or Snooze GPU http://boinc.berkeley.edu/wiki/The_BOINC_Manager#The_BOINC_Manager_icon_and_menu - from Activity Menu choose Suspend or Suspend GPU http://boinc.berkeley.edu/wiki/Advanced_view#BOINC_Manager_Menus - use cc_config.xml with <exclusive_gpu_app>important.exe</exclusive_gpu_app> http://boinc.berkeley.edu/wiki/Client_configuration http://www.boinc-wiki.info/Cc_config.xml http://boincfaq.mundayweb.com/index.php?language=1&view=91 - have preference to "Suspend GPU work while computer is in use" or other similar Instead of stopping completely, CUDA will instead quit running on the GPU and begin running on the CPU. Strange behavior. So you see in Windows Task Manager or Process Explorer that the CUDA app .exe starts using 100% of a core (~50% CPU in your case)? If this happens again - look in stderr.txt (in slots directory) to see are there messages similar to these: setiathome_CUDA: No CUDA devices found setiathome_CUDA: Found 0 CUDA device(s): setiathome_CUDA: CUDA Device 1 specified, checking... Device cannot be used SETI@home NOT using CUDA, falling back on host CPU processing 2) When CUDA applications are running on the CPU during times that they are not supposed to be running at all, they ignore the CPU settings of "max 20% CPU", and they use up to 100% on all cores. Yes, this have to be true as BOINC applies the setting of "max 20% CPU" only to CPU tasks/apps. We may consider this as "lack of feature" or bug in BOINC but I don't know is it possible for BOINC to detect this. BOINC see that CUDA app is running and don't know it have trouble to use CUDA and is "falling back on host CPU processing". There are in fact projects with not very efficient GPU apps that use "normally" a big deal of CPU along with the computing on GPU (so "GPU app" have high CPU usage at all "normal" times) So far there has been no damage because I have good cooling and I was near by and I heard the fans running at full speed and I intervened (I do not know what the temp was). To "know what the temp is" use one or several of these Temperature Monitoring Programs: http://setiathome.berkeley.edu/forum_thread.php?id=59292 Did you clean the dust/fur in the computer/fan/heatsink lately? Other may not have the same luck. By "Other" you mean "Other people"? Most people use 100% CPU all the time (for years) especially if they "have good cooling". If this is not laptop and the CPU can't run at 100% for a long time you in fact may have problem with cooling (dust, dried thermal compound, fan lubrication, ...) if the Temperature of the CPU go near to TJMax (e.g. 5-10°C to TJMax). ____________ - ALF - "Find out what you don't do well ..... then don't do it!" :) | |
| ID: 1226271 · | |
|
His first 3 CUDA units all fell back to CPU. stderr reported "emulation device, unusable". I've seen that one before, and IIRC, it's the sleep bug in 296.10. Interestingly, the last two CUDA's he reported both completed normally, though the 9400GT took awhile, and restarted several times (his suspension settings, no doubt). Then, he aborted the rest, so there's no way to know how they might have run. | |
| ID: 1226444 · | |
|
| |
| ID: 1226460 · | |
From what I hear, 301.24 is reasonably bug-free, at least for Fermi-class cards, plus, it's the default for Kepler cards. ____________ | |
| ID: 1226462 · | |
|
I kept the 285.62 driver in-case I had issues with 296.10 which it seems now like I am having if I am reading you guys correctly. | |
| ID: 1227004 · | |
Questions and Answers : GPU applications : CUDA apps continue to run on CPU in suspend mode
| Copyright © 2013 University of California |