CUDA WU errs with -12 and then loops with -9s until rebooted

Questions and Answers : GPU applications : CUDA WU errs with -12 and then loops with -9s until rebooted
Message board moderation

To post messages, you must log in.

AuthorMessage
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 872482 - Posted: 5 Mar 2009, 15:30:15 UTC

This just started happening on my two CUDA systems. One yesterday and then again today on the other system.

This WU got -12 error and from that point forward, all CUDA WUs get -9 overflow or -12 errors. I don't have access to logs at the moment, but scanning my tasks list this appears to be the earliest one that errored.

XP Pro-32b w/SP3, Driver is 181.20 (I believe...don't have access to system at the moment) AMD X2 4000 cpu.

Had my wife shut the system down as it's just dumping WUs like crazy for now.

My other CUDA system, after rebooting yesterday, seems to be running fine again. Cycling BOINC did NOT clear the problem as the gpu seemed to be fubar'd.
ID: 872482 · Report as offensive
Profile Jord
Volunteer tester
Avatar

Send message
Joined: 9 Jun 99
Posts: 15184
Credit: 4,362,181
RAC: 3
Netherlands
Message 872487 - Posted: 5 Mar 2009, 15:49:51 UTC - in response to Message 872482.  

By the looks of it, there's something stuck in memory. I have forwarded it to the developers, nonetheless.
ID: 872487 · Report as offensive
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 872498 - Posted: 5 Mar 2009, 16:07:45 UTC - in response to Message 872487.  

By the looks of it, there's something stuck in memory. I have forwarded it to the developers, nonetheless.


Thanks. As I said, only a reboot clear the loop, so the gpu has been screwed up somehow.
ID: 872498 · Report as offensive
Profile RandyC
Avatar

Send message
Joined: 20 Oct 99
Posts: 714
Credit: 1,704,345
RAC: 0
United States
Message 872647 - Posted: 5 Mar 2009, 22:20:18 UTC

Additional info now that I'm home to collect it.

This WU got error messages:

05-Mar-2009 08:49:51 [SETI@home] Task 12ja09ae.28485.172479.3.8.180_0 exited with zero status but no 'finished' file
05-Mar-2009 08:49:51 [SETI@home] If this happens repeatedly you may need to reset the project.
05-Mar-2009 08:49:51 [SETI@home] Restarting task 12ja09ae.28485.172479.3.8.180_0 using setiathome_enhanced version 608
05-Mar-2009 08:50:17 [SETI@home] Computation for task 12ja09ae.28485.172479.3.8.180_0 finished

After the restart it finished with -9 overflow. All WUs that followed got -9 overflow or -12 error until system was shutdown.

I have restarted the system and there does not seem to be further problems with it.
ID: 872647 · Report as offensive

Questions and Answers : GPU applications : CUDA WU errs with -12 and then loops with -9s until rebooted


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.