No schedulers responded... (Only on Linux-SMP machnie)

Message boards : Number crunching : No schedulers responded... (Only on Linux-SMP machnie)
Message board moderation

To post messages, you must log in.

AuthorMessage
[IsraAliens] Gilboa Davara

Send message
Joined: 6 Jun 99
Posts: 32
Credit: 23,800,328
RAC: 0
Israel
Message 106395 - Posted: 2 May 2005, 8:45:53 UTC
Last modified: 2 May 2005, 8:48:00 UTC

Hello all,

I've reported this problem before and found no solution.
All my dual workstation, be that at work and at home, after a two-three weeks of processing go idle, spewing "No scheduler responded" errors.
If I delete the client, and start over, the boinc client downloads WU without a problem.

Help?

2005-05-02 11:39:19 [---] Starting BOINC client version 4.19 for i686-pc-linux-gnu
2005-05-02 11:39:19 [SETI@home] Project prefs: no separate prefs for home; using your defaults
2005-05-02 11:39:19 [SETI@home] Host ID is 782428
2005-05-02 11:39:19 [---] General prefs: from SETI@home (last modified 2004-07-19 09:51:18)
2005-05-02 11:39:19 [---] General prefs: no separate prefs for home; using your defaults
2005-05-02 11:39:19 [---] Insufficient work; requesting more
2005-05-02 11:39:19 [SETI@home] Requesting 1727789 seconds of work
2005-05-02 11:39:19 [SETI@home] Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
2005-05-02 11:39:25 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-05-02 11:39:25 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-05-02 11:39:25 [SETI@home] No schedulers responded
2005-05-02 11:39:25 [SETI@home] No schedulers responded
2005-05-02 11:39:25 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2005-05-02 11:39:25 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2005-05-02 11:40:26 [---] Insufficient work; requesting more
2005-05-02 11:40:26 [SETI@home] Requesting 1725572 seconds of work
2005-05-02 11:40:26 [SETI@home] Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
2005-05-02 11:40:37 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-05-02 11:40:37 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-05-02 11:40:37 [SETI@home] No schedulers responded
2005-05-02 11:40:37 [SETI@home] No schedulers responded
2005-05-02 11:40:37 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2005-05-02 11:40:37 [SETI@home] Deferring communication with project for 1 minutes and 0 seconds
2005-05-02 11:41:38 [---] Insufficient work; requesting more
2005-05-02 11:41:38 [SETI@home] Requesting 1725572 seconds of work
2005-05-02 11:41:38 [SETI@home] Sending request to scheduler: http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi
2005-05-02 11:41:50 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-05-02 11:41:50 [SETI@home] Scheduler RPC to http://setiboinc.ssl.berkeley.edu/sah_cgi/cgi failed
2005-05-02 11:41:50 [SETI@home] No schedulers responded
2005-05-02 11:41:50 [SETI@home] No schedulers responded
2005-05-02 11:41:50 [SETI@home] Deferring communication with project for 1 minutes and 52 seconds
2005-05-02 11:41:50 [SETI@home] Deferring communication with project for 1 minutes and 52 seconds


ID: 106395 · Report as offensive
[IsraAliens] Gilboa Davara

Send message
Joined: 6 Jun 99
Posts: 32
Credit: 23,800,328
RAC: 0
Israel
Message 106396 - Posted: 2 May 2005, 8:47:42 UTC - in response to Message 106395.  
Last modified: 2 May 2005, 8:49:14 UTC

I tried stracing the client, and it connects to the target server without a problem... Help anyone?
(Plus, it happens at home, even if I open the firewall for it.)

One last thing, I always happens on dual machines. I never saw it on single CPU machines.

How can I report bugs (AKA bugzilla) for boinc?

Cheers,
ID: 106396 · Report as offensive
Profile Paul D. Buck
Volunteer tester

Send message
Joined: 19 Jul 00
Posts: 3898
Credit: 1,158,042
RAC: 0
United States
Message 106456 - Posted: 2 May 2005, 14:03:46 UTC

I don't understand this. Why are you ggetting duplicated messages? Are you running two instances of BOINC on the machine?

BOINC runs well on all of my systems though I don't run the Linux versions. I do have two systems that are dual processor, and one of them has Xeons so it looks like 4 processors and it has been running like a top since I plugged it in ...
ID: 106456 · Report as offensive
Profile Neil Walker
Volunteer tester
Avatar

Send message
Joined: 23 May 99
Posts: 288
Credit: 18,101,056
RAC: 0
United Kingdom
Message 106500 - Posted: 2 May 2005, 18:48:23 UTC - in response to Message 106396.  

> One last thing, I always happens on dual machines. I never saw it on single
> CPU machines.

I recently retired my dual processor machines but they ran faultlessly for months on Linux and Windows 2000.

> How can I report bugs (AKA bugzilla) for boinc?

As you are the only person, AFAIK, to experience this problem, it seems the bug is between the operator's chair and the keyboard. Reporting a bug that doesn't appear to exist elsewhere would not help anyone.


Be lucky

Neil



ID: 106500 · Report as offensive
Profile spacemeat
Avatar

Send message
Joined: 4 Oct 99
Posts: 239
Credit: 8,425,288
RAC: 0
United States
Message 106507 - Posted: 2 May 2005, 19:41:02 UTC - in response to Message 106456.  

> I don't understand this. Why are you ggetting duplicated messages? Are you
> running two instances of BOINC on the machine?
>

My linux clients did that for a while as well, but there was no functional impact, just double messages. i dont think they do it anymore, but i have recompiled boinc and seti several times with more recent source since then.

my dual xeon doesnt have the connection problem, but it is also not running linux. i HAVE had such a connection problem like this before, and it turned out to be a bad router. my router had stopped passing http traffic, so i could never get to a scheduler or any websites even though i could ping, tracert, and ftp every test address i tried.
ID: 106507 · Report as offensive
Profile Mike Special Project $75 donor
Volunteer tester
Avatar

Send message
Joined: 17 Feb 01
Posts: 34258
Credit: 79,922,639
RAC: 80
Germany
Message 106874 - Posted: 3 May 2005, 12:21:03 UTC

Hi

I get this message too on my windows maschine today since 10 o´clock UTC.
Uploaded quickly but no shedulers responded.

greetz Mike



With each crime and every kindness we birth our future.
ID: 106874 · Report as offensive
[IsraAliens] Gilboa Davara

Send message
Joined: 6 Jun 99
Posts: 32
Credit: 23,800,328
RAC: 0
Israel
Message 111347 - Posted: 15 May 2005, 13:47:22 UTC - in response to Message 106500.  

<blockquote>> One last thing, I always happens on dual machines. I never saw it on single
> CPU machines.

I recently retired my dual processor machines but they ran faultlessly for months on Linux and Windows 2000.

> How can I report bugs (AKA bugzilla) for boinc?

As you are the only person, AFAIK, to experience this problem, it seems the bug is between the operator's chair and the keyboard. Reporting a bug that doesn't appear to exist elsewhere would not help anyone.

</blockquote>

I run a *single* client from a normal text terminal.
The double messages are there by default.
I doubt that it has anything to do with me doing something wrong as:
A. All I do is run a binary; There's not much else I can do.
B. If I delete the project's files and start over, the problem disappears.
C. I've yet to see the same problem manifest itself on a single CPU machine.

ID: 111347 · Report as offensive

Message boards : Number crunching : No schedulers responded... (Only on Linux-SMP machnie)


 
©2024 University of California
 
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.