iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 10-16-2005, 03:00 AM
WebProWorld MVP
WebProWorld MVP
 
Join Date: Jul 2004
Location: Omaha
Posts: 2,714
brian.mark RepRank 3brian.mark RepRank 3
Default Google much clearer now.

We bought Google. Ok, so it was just a mini, but it's cleared up my understanding of Google a bit.

Upon setting it up, nothing really great came to mind. However, after doing some tweaking (it's powering the search on toolbarn.com right now), it has become clear that the more I learn about this box and it's capabilities the more I understand Google.

For starters, I ended up having to cloak some pages to our mini to get our results to come out right. A search for makita drills gave me results of milwaukee drills as well because of our breadcrumb navigation and the cross-linking. Every page on our site was returned for power tools because it's in the main navigation. Some searches return poor results, such as makita 5000, which several people have searched for. I have a temp solution in place for that.

So, after playing with it and then sitting back to think about how it works / serves results, I figured something out that may end up being priceless.

Searches done on our site that the Google Mini return 0 results for need site changes.

I'm logging how many results the mini returns for every search that is done on our site. What I'm seeing is patterns in the way people search that yield no results. Well guess what... people search for those same phrases at Google and many times get 0 relevant results there as well. Sure, they'll get results, but the relevancy isn't there.

For example, the search for makita 5000 could be a GV5000 or an HR5000, accessories for either of those, or perhaps something else. The results at Google.com are 5000 RPM, 5000 staples per pack on a stapler page, or 5000 Watts for a generator. Why wouldn't I do some work to make my site come up #1 for makita 5000 since I've seen quite a few searches on our site for it (I'm sure I'll see more after hitting submit for this post) and the results are poor in the SERPs.

Now, that's not the only thing I've learned. Google's mini, while having some technical differences due to only being concerned with a small sampling of the web, gives me a sense of what optimizes better between 2 pages. For example, I can create a test result set and have 1 page using identical link text to point to 2 pages, then have their algo decide which is better optimized. Any SEO that just read that should be getting out their credit cards. How useful is that? I've seen some results from those experiments already within the Google SERPs. Oh, and I can supress those pages from being served in the results, allow them for a few minutes to do my test, then hide them again. Very cool.

It also makes sense now why there is a delay between crawling and showing up in the SERPs.

There is a 3 step process that the mini uses.

1) Crawl.
2) Build Index.
3) Launch / Replicate Index.

While they've undoubtedly got more processing power and storage than thousands of these little guys for their primary engine (Dual PIII with 2GB of RAM in that little blue box), indexing our site takes it over 4 hours. By default, it tries to keep no more than 4 connections open at a time to any domain. Given how many pages our site is comprised of, 4 pages at a time makes for a very long crawl time.

Once everything is crawled, the index building takes it almost 30 minutes for our sites. That's just 25,000 pages that we index out of the billions that they index. We're limiting which pages the mini crawls and assigning it a cookie so it doesn't see 100,000 different checkout page URL's to evaluate. Talk about some major processing power to build an index on the data they gather - mind blowing. When this machine takes that long for 25,000 pages it's got to take a while for their index and that's got to take more processing power than I've ever considered building. =)

Then, after everything is crawled and an index is built, it replicates the index. It copies the old index to a new location, sets the copy active, then replaces the primary index with the new build, followed by a switch to the new index after testing for our required results. After considering the safeguards that it gives by having some test searches with required results, I'm sure they've got a ton of required results to make an index active in their web search. For example, searching for microsoft better give you microsoft.com somewhere in the top so many pages of the SERPs or you've got issues.

I've got more, but I'm still pondering what useful information I can garner from the insight. Really, for under $4000 (we bought the extra year of upgrades and hardware replacement) it's probably going to be a worthwhile investment just for increasing our SERPs, let alone the search results it gives our customers.

Brian.
__________________
ToolBarn.com, an Internet Retailer Top 500 and Inc. 500 Company | Tool Parts | Pet Supplies
Reply With Quote
  #2 (permalink)  
Old 10-16-2005, 09:01 PM
WebProWorld Veteran
 
Join Date: Oct 2005
Posts: 528
aaron2005 RepRank 0
Default

I didn't even know what a "mini" was until I read this post, great stuff and intersting idea. You study it to better understand how google works? This is very cool, but I could never afford to buy a mini...so you must blog about it, what's your URL? ;)
__________________
SEO Blog
Reply With Quote
  #3 (permalink)  
Old 10-17-2005, 10:45 AM
WebProWorld MVP
WebProWorld MVP
 
Join Date: Jul 2004
Location: Omaha
Posts: 2,714
brian.mark RepRank 3brian.mark RepRank 3
Default I'll try to start that up soon..

Quote:
Originally Posted by aaron2005
I didn't even know what a "mini" was until I read this post, great stuff and intersting idea. You study it to better understand how google works? This is very cool, but I could never afford to buy a mini...so you must blog about it, what's your URL? ;)
What's this blog thing you speak of? LOL.

I'll probably have to do more blogging now... that's probably a great way to make friends and influence people (to steal a famous book title).

I'll see what I can start to post here shortly. I need to learn how my blog works a little better first. So far, I don't have any categories and it's mostly just posts about products that I found cool or how some sessions at SES went. Nothing too important. But you're probably right... I could make it my mini blog and that may get read a bit more. I'll let you know if / when I get that fired up.

Brian.
__________________
ToolBarn.com, an Internet Retailer Top 500 and Inc. 500 Company | Tool Parts | Pet Supplies
Reply With Quote
  #4 (permalink)  
Old 10-18-2005, 01:17 AM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default

Brain it seems just the 0 results information (new keywords to target and optimze for) is the worth the $4000 alone.
Reply With Quote
  #5 (permalink)  
Old 10-18-2005, 01:46 AM
WebProWorld MVP
WebProWorld MVP
 
Join Date: Jul 2004
Location: Omaha
Posts: 2,714
brian.mark RepRank 3brian.mark RepRank 3
Default

Quote:
Originally Posted by incrediblehelp
Brain it seems just the 0 results information (new keywords to target and optimze for) is the worth the $4000 alone.
Absolutely. Some of it, because of the way our internal search worked before, was previously handled by our own search. However, much of it is a bit quirky due to the way that the Google server handles it, but it's the same way for Google.com, so we have many, many new phrases to work on.

The end result should be very positive. I'm excited about it.

Oh, and there will be some details coming soon in my blog about what tags hold more weight in the algo and what other SEO tidbits I can mine from toying (combos of tags, etc.) Of course, the results may be a little different since it's considered a "trusted data source" (why would you spam yourself, right?), so some basic analysis will be interesting but not entirely applicable in non-ideal situations.

Brian.
__________________
ToolBarn.com, an Internet Retailer Top 500 and Inc. 500 Company | Tool Parts | Pet Supplies
Reply With Quote
  #6 (permalink)  
Old 10-18-2005, 01:54 AM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default

What is your blog link?
Reply With Quote
Reply

  WebProWorld > Search Engines > Google Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 10:34 PM.



Search Engine Optimization by vBSEO 3.3.0