Message boards :
Number crunching :
Panic Mode On (106) Server Problems?
Message board moderation
Previous · 1 . . . 3 · 4 · 5 · 6 · 7 · 8 · 9 . . . 29 · Next
Author | Message |
---|---|
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13840 Credit: 208,696,464 RAC: 304 |
It would seem that the staff in the "centre" are not quite up to speed because, like Grant, my caches were COMPLETELY EMPTY. Zero tasks of any kind. Not for GPU nor CPU. Zilch. Every request met with "No tasks available". Whatever the cause it was NOT a shorty storm. They were talking about the return times for completed work after they got the servers going again. Grant Darwin NT |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Generally I just run the rescheduler in place and let it do its thing of stopping BOINC and continuing on. Its only one computer that doesn't like that if it has an Einstein task running. That will almost certainly cause a TDR fault in the video driver. That also means I lose the overclock on the video card driving the monitor. I've learned to fully exit BOINC before running the rescheduler on that machine if it has a Einstein task running. I never got around to running the Windows Task Scheduler or creating a PowerScript to fully automate the process. I've been hands on so far and it has worked out for me. I can't say that I can correlate fully exiting BOINC and causing that computer to get starting tasks again after any prolonged period of receiving " no tasks available" messages. The preference flip has almost always worked for me. In fact today I think was the first time I've experienced any ineffectiveness with that method. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . OK, my bad ... . . Put it down to frustration ......... :( Stephen :( |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13840 Credit: 208,696,464 RAC: 304 |
OK, the web site is back, but i'm still getting "Project has no tasks available" on my work requests. That's when it doesn't result in a Scheduler error. EDIT- changed application preferences & down the work came. Since installing the AP application this had been a very minor issue & didn't occur too often, or very severely, now it's becoming as bad as it was before installing AP. Grant Darwin NT |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13840 Credit: 208,696,464 RAC: 304 |
"Couldn't connect to server" on one system, "HTTP service unavailable" then "Couldn't connect to server" on the other for the last 2 Scheduler requests. Third time lucky? EDIT- Third time lucky. Grant Darwin NT |
kittyman Send message Joined: 9 Jul 00 Posts: 51477 Credit: 1,018,363,574 RAC: 1,004 |
The kittyman is alive and 'well' in Kittyland, Wisconsin, USof A. Going through a few changes and just kinda layin' low and chillin' with the kitties for a bit. Thanks to those who noticed. If you are inclined, you may read a little more in my kittyman thread in the cafe. Meowfornow. Meow. "Time is simply the mechanism that keeps everything from happening all at once." |
kittyman Send message Joined: 9 Jul 00 Posts: 51477 Credit: 1,018,363,574 RAC: 1,004 |
Anybody else notice that the Seti server timebase is almost 2 minutes ahead of UTC? Not the one used for the timestamps on these forum posts, but when I look at 'all tasks' and the time sent.............almost 2 minutes fast. "Time is simply the mechanism that keeps everything from happening all at once." |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
. . Well meow and brrrp! Stephen .. |
EdwardPF Send message Joined: 26 Jul 99 Posts: 389 Credit: 236,772,605 RAC: 374 |
I also note there are four BLC splitter jobs that haven't progressed in several hours, blc02_2bit_guppi_57835_15340_HIP48113_0051 52.39 GB (66) blc02_2bit_guppi_57835_15675_HIP49197_0052 52.39 GB (40) blc02_2bit_guppi_57835_16015_HIP48183_0053 52.39 GB (20) blc02_2bit_guppi_57835_16355_HIP49197_0054 52.39 GB (3) bump |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13840 Credit: 208,696,464 RAC: 304 |
I also note there are four BLC splitter jobs that haven't progressed in several hours, I think might be a case of processing order for the splitters. Those files that have been sitting there for a while now partly processed got dropped when new files were loaded; for some reason the splitters decided the newer files need processing first. And for whatever reason all the later files since then have been the ones chosen to split, hence the partially split files sitting up the top by themselves. Grant Darwin NT |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13840 Credit: 208,696,464 RAC: 304 |
Well, that was interesting. The Seti web site went AWOL, so did the data servers. BOINC went AWOL, as did the Berkeley IST pages & web site, as did Berkeley.edu itself. Grant Darwin NT |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Yea, we need a "Berkeley is Down" café when The Seti is Down Café is unreahable, LOL |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Yea, we need a "Berkeley is Down" café when The Seti is Down Café is unreahable, . . There is a "SETI is down Cafe"? You learn something everyday :) Stephen :) [edit] I had a look and I don't think I will visit there very much, the current topic is rather sad. |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
When seti is down it becomes active - usually. |
Grant (SSSF) Send message Joined: 19 Aug 99 Posts: 13840 Credit: 208,696,464 RAC: 304 |
When seti is down it becomes active - usually. ...as usually it's only Seti that is down, not everything like this last (luckily short) outage. Grant Darwin NT |
Wiggo Send message Joined: 24 Jan 00 Posts: 36468 Credit: 261,360,520 RAC: 489 |
Well I wonder if that server MIA put anymore completed w/u's into pending waiting mode. I did a check of my pendings on 1 rig and found 8 batches (of varying sizes) in that state, though I'll have near on 100 of them clearing in the next 18hrs. Cheers. |
Gary Charpentier Send message Joined: 25 Dec 00 Posts: 30953 Credit: 53,134,872 RAC: 32 |
Well, that was interesting. I do see that the Earl Warren Data Center load balancers are due for some work. Someone might have been poking around making sure that they have a written config to fall back on and while in, frozen a box by accident. Anyway on Thursday if it all crashes we have had a warning. http://systemstatus.berkeley.edu/ |
Brent Norman Send message Joined: 1 Dec 99 Posts: 2786 Credit: 685,657,289 RAC: 835 |
Time to see if I have enough tasks to make it this week - see you in 13 hours. LOL |
Stephen "Heretic" Send message Joined: 20 Sep 12 Posts: 5557 Credit: 192,787,363 RAC: 628 |
Time to see if I have enough tasks to make it this week - see you in 13 hours. . . . And ... we're back!! . . Less than 12 hours this week, I am amazed :) Stephen :) |
Keith Myers Send message Joined: 29 Apr 01 Posts: 13164 Credit: 1,160,866,277 RAC: 1,873 |
Yes, amazed too. Caught me off guard. It looks like I banked enough tasks to get through the outage finally. And didn't make any ghosts. I barely squeaked through with the Ryzen system for CPU task. Probably wouldn't have made for the typical 13 hour long outages of late. Seti@Home classic workunits:20,676 CPU time:74,226 hours A proud member of the OFA (Old Farts Association) |
©2024 University of California
SETI@home and Astropulse are funded by grants from the National Science Foundation, NASA, and donations from SETI@home volunteers. AstroPulse is funded in part by the NSF through grant AST-0307956.