So where's the science..? |
![]() |
| log in |
Message boards : SETI@home Staff Blog : So where's the science..?
1 · 2 · 3 · 4 · Next
| Author | Message |
|---|---|
|
As time goes on more SETI@home pariticipants are frustrated with the perceived lack of scientific progress. So what's the deal? | |
| ID: 508104 · | |
|
Nice post. | |
| ID: 508125 · | |
|
reading all that,and seeing that ,at the beginning, seti was not really perfect ,speaking about ways of data's treatment, needing now to do one main database, checking it for redundancies, cleaning it, having to do reobservations... it's an obvious fact that you need rather IBM's BluGenes or NASA's Columbia or Los Alamos supercomputers... than even all our computers in the world. Finally why not, some are available... | |
| ID: 508145 · | |
Should there be a seti2 client thats purpose it to analyze the science database much the way we look at the raw data looking for important signals? A related question is: what kind of problem is distributed computing good at solving? The first pass of SETI@home data reduction is, obviously, great for distributed computing. Each workunit is a tiny chunk of sky/frequency that needs to be whittled down to its most important features - to do so requires no knowledge of any of sky/frequency chunks, so each computer can work separately and efficiently. However, to find persistent signals, you need to have access to the *entire* database. Think of it as a game of concentration. You need the whole deck to play. I don't think people would like us sending them workunits that are a terabyte in size. Which leads to the question: Why not divide the sky up into a finite set of "pixels" and then just send the signals that fall into each pixel to other computers to process. Well.. that's actually the problem we're working on now (hence the concurrent work with skymaps). In order to make these pixels useful (i.e. based on the size of our receiver's "beam") we end up with about 15 million pixels. I think that's the right number, and if it isn't it's close enough. Well, that means on average we'll have 66 signals per pixel. Not very many. So it would be easier/faster to do this processing ourselves rather than bundle it up, send it out for analysis, and then validating the results. In fact, once the analysis engine is revved up, there's no reason we couldn't do this in real time, which is the plan. Okay I might have the math messed up in this last paragraph, but you get the idea. Basically, for final analysis, we don't need CPU power - we need human brain power for programming/analysis. - Matt ____________ -- BOINC/SETI@home network/web/science/development person -- "Any idiot can have a good idea. What is hard is to do it." - Jeanne-Claude | |
| ID: 508154 · | |
|
Matt | |
| ID: 508216 · | |
|
Hi Matt, | |
| ID: 508221 · | |
|
Thank you Matt for explaining this to us. | |
| ID: 508251 · | |
|
What I suppose is we might likely got into 10 or 50 or 5000 years long project even more longer. | |
| ID: 508254 · | |
|
Hmm... | |
| ID: 508271 · | |
|
Many many years from now, the man in charge of the SETI project will adress the crowd during the celebration after the first extraterrestrial signal was found, confirmed in its source and planet of origin, and answered. | |
| ID: 509540 · | |
Hmm... Hello ShvrDavid: You mirror a few thoughts that have been bouncing around in my head for a little while. A program does what it is told to, to work any differently requires a change in mindset, firstly by the programmer and then possibly also the hardware {depending on how radical you want to go}. The possible ebb and flow of longduration cyclic anomalies is certainly possible for this type of project.... if you have a signal that is actually too weak to break through the background "noise" and be identified as a standout spike, then looking for another form of interaction pattern often helps. eg if a signal is too weak to be detected directly it may still be strong enough to cause swelling and rolling in the background disturbance patterns... if you follow my drift... sort of a cross pattern of smaller waves on the sea top carried along with the normal waves.. so to speak gn's -genetic apps tend to be more of a adaptation and fittest survival mode program.. although i certainly am not ruling them out. nn's -neural networks can be trained to look directly for the types of additional noise riding within a noise signal. sort of like thinking about amplified noise on a noisey background, noise interferring with noise. its more a pattern recognition problem than a straight high amplitude spike detection. sort of tied in with some stuff here. and Diego... you are right... time is sometimes not on our side... but then you never know...! cheers | |
| ID: 520666 · | |
|
The signal processing is well known by now and has been for decades --it is not realistic to think that a hueristic can spring from a program to find true data amongst the noise. The data processing or "crunching" is to look for signals that have a power spectrum and distribution that are different from random noise which has a well defined spectrum. Purposive signals would stand out assuming that they have sufficient signal power so that correlation techniques could lift them out of the noise. | |
| ID: 520679 · | |
The signal processing is well known by now and has been for decades --it is not realistic to think that a hueristic can spring from a program to find true data amongst the noise. The data processing or "crunching" is to look for signals that have a power spectrum and distribution that are different from random noise which has a well defined spectrum. Purposive signals would stand out assuming that they have sufficient signal power so that correlation techniques could lift them out of the noise. Hello William: Yes, the processing surely is working hard on identifying a signal that is itself powerful enough to be recognised as a signal. I am merely musing on the possibilities of signals that are not powerful enough to be seen as such by any straight detection scheme/program and at an acceptable power level...and im not criticising the project in the least, or else i would not be here. Think for a second on the problem a few years ago of detecting submarines... if you have enough microphones in place you can hear their props beating the water... and then electro-hydrodynamic flow changed the rules.. no props, just water jets. if they are silent can you detect them... yes... by looking for disturbance patterns in the waves, or impulse/pressure waves with much longer wave lengths than the original, generated by their movement. Your correct in saying that there will not be "true data" amongst the noise, its far far too small a signal to extract anything meaningful. I am musing on the interference pattern that may be present at longrange, an anomalous oscillation of normal background noise may/may-not indicate the presence of a relatively sub harmonic signal being scattered and absorbed by the gas clouds etc. | |
| ID: 520726 · | |
Yes, the processing surely is working hard on identifying a signal that is itself powerful enough to be recognised as a signal. I am merely musing on the possibilities of signals that are not powerful enough to be seen as such by any straight detection scheme/program and at an acceptable power level... s@h already state that we are processing the data to look for signals that purposefully are intended to be found. The present search scheme is based on that assumption and looks for whatever is most clearly "artificial" that stands out against the natural background of the cosmos. Unfortunately, we're a very long way away from having the capability to start doing anything more advanced such as "spread spectrum" decoding or sniffing for even rudimentary modulated signals. We're looking for beacons only, and that's hard enough! If you're actually talking about "sensitivity", then s@h are already doing clever tricks to integrate over various time periods to pull any "non-randomness" out of the noise. The best trade-off has been made in s@h-enhanced to boost the sensitivity versus time to crunch through the data. Slightly more sensitivity could be gained but at the expense of far greater processing time that just gives ever diminishing returns. The new ALFA data is the way to go with (x6? forgot the number) sensitivity in the first place. Keep searchin', Martin ____________ Mandriva Linux A user friendly OS! See new freedom Mageia2 The Future is what We make IT (GPLv3) | |
| ID: 520917 · | |
Yes, the processing surely is working hard on identifying a signal that is itself powerful enough to be recognised as a signal. I am merely musing on the possibilities of signals that are not powerful enough to be seen as such by any straight detection scheme/program and at an acceptable power level... Good morning Martin Just to clear the air, I am very happy with what is being done here at s@h, in these days of a declining interest in science it is hard to get anything of real interest done. I was just musing on the possibilities that exist. You are correct in that the best track to take is to look for an obvious signal, even that process is hard enough. I suppose what set me off was a few other comments on the boards here and the inter-relationship they had to some processes that I am currently looking at. I think both of us suffer from a common problem of either a lack of hardware, or the time available to us exclusively to use it, to make some serious headway. now theres a thought!!!... excuse my obvoius ignorance of your operational status, but you folk are linked into an educational institute of some sort??..arent you? ... if so with some string pulling you may be able to effectively do what I do. Each "holiday" or semester break or what ever you folk call it, you "may" be able to use all the unused/spare computers and bind them into a single parallel processing cluster. There are various bootable "cluster" cdroms available {one called BCCD comes to mind} that dont touch/use the harddrive of the box you boot them on at all, they just allow you to grab its cpu and ram and networking capabilities. You can control the cluster from a master machine, then just reboot the pc when finished and it runs as normal. the cdrom does everything for you, networking, dhcp, dynamic cluster joining, automatic job distribution, load balancing... gaining a few hundred cpu's makes a lot of difference... | |
| ID: 521236 · | |
According to BOINCstats.com, there are 1,342,994 hosts currently crunching SETI. So, adding "a few hundred" would be something like a 0.05% boost (5 hundredths of one percent) -- assuming that they're "average" machines, whatever that means. ____________ | |
| ID: 521305 · | |
According to BOINCstats.com, there are 1,342,994 hosts currently crunching SETI. No; that’s the number of hosts that have ever been attached to the project. At the moment their figure for currently active hosts (those that have been granted credit in the last month) is 320,062. Likewise, of 605,982 total participants only 180,955 are considered active. ____________ | |
| ID: 521336 · | |
Hello folks A possible cross connection of thoughts here, sorry folks to those who didnt catch up on the drift of the discussioin.. so ..I was NOT referring to adding a few hundred computers to the S@H project as such. What I meant was if anyone wanted to do some specialised work as per ML's text, and do a search with different parameter sets, - Aim: To analyse longer term data patterns, or other patterns.. {not just go over the same ground again}.. - Requirement: They have access to the S@H database "or had it on a dvd's or whatever". - Hardware: They "could" have a have a high speed, high throughput, high performance system to do the analysis on, buy utilising a universities or who-evers computers over the holidays {while they are "not used"}, and join them into a "processing cluster" that effectively makes them one single parallel processing system. What I was simply musing on was the possibility to look at completly different parameters in short burst runs on a high performance system...just to have a look and see what pops out... so you see its a different beast from the one presently operational. s@h cant do it at the moment because of standard issues (eg A system that could do it.. you need high throughput to do this stuff, Expense.. you cant buy this sort of processing power in one box anyway... unless you can afford a supercomputer... and who can?.. so the next best thing is creating a supercomputer level of processing by using mass parallel distribution. There was no slight intended to S@H who do a wonderful job of looking for signals Now tell me you guys and girls at seti, you scientists and technicians... if you had the chance to get your hands on a cluster, even for short bursts... would you do it.? | |
| ID: 521423 · | |
|
A host does not always mean cpu in s@h. That over million host count is actually some several hundred thousand cpu's. | |
| ID: 521602 · | |
Where can we actually see active working seti cpu's stats? Here? I disagree about the definition of 'host'. The box I'm working on has 2 CPUs with 8 cores between them, but it's only one host (number 2901600, as it happens). | |
| ID: 521609 · | |
Message boards : SETI@home Staff Blog : So where's the science..?
| Copyright © 2013 University of California |