View Single Post
  #14 (permalink)  
Old 12-28-2007, 01:36 PM
RichAtVNS's Avatar
RichAtVNS RichAtVNS is offline
WebProWorld Pro
 

Join Date: Jul 2003
Location: New York, USA
Posts: 108
RichAtVNS RepRank 1
Lightbulb Re: Difference in rankings when searching Google datacenters

ok, a little programming information, from someone who does systems design and development for many years on large scale systems (see VNSInc.com for my Profile of Richard Jay Silverman Cofounder/CEO resume lol)

Profile of Richard Jay Silverman Cofounder/CEO

1) Someone said that the local googlebots load ther spidering information to the local datacenters. This is a collection process done in almost all large distributed redundent multiprocessing systems.

2) In all collection systems, there is no real time analysis and storage going on which causes latency. This is called "offline processing" and "propagation lag"

3) all information collected is sent to a central location and then finally modified for use in the final system in this case google.com. This is what is called a simple "store and forward process"

4) I saw the results of your query and all but google they were identical
that meant that the information had been populated back to the other datacenters. Guess what this has a term also "bridgeback" which also has a processing lag.

5) Google had 1 site put ahead for the number one position. This was obvious a paid override or an algorithm to decide the expert in a highly competative field. And a core process on its main delivery system.

6) Google.com does resolve to different ip addresses all the time for me. I use multiple browsers and machines and routes to it. So Dynamic routers are used to decide which datacenters are delivering information when you go to google.com. This is to allow "redundent systems" and "distrbuted processing".

Now for a little hypothosis on my part.

This means if any datacenter is out of whack being worked on, it is possible for google to not have it show up in the dynamic routing,
So for more accurate results for your client run your queries off of google.com.

This way your assured to be accurate are not dealing with a datacenter which may be off line or a propogation lag.

---- one final term ... since we do not know the exact algorithms google or other search engines use to decide on their ranking systems we are using heuristic approaches to doing this. This is what Marvin Minsky and his chief rival Melvin Klerer (who I studied under) used to argue about in the artiifical intelligence debates of the late 60s, 70s and early 80s!

That's why I find this forum so enjoyable at times. It's the lack of understanding for simple programming paradigns....

Of course it still means I can't spell correctly or use correct grammer! LOL

Last edited by RichAtVNS : 12-28-2007 at 01:47 PM. Reason: spelling
Reply With Quote