iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 05-27-2008, 06:20 PM
WebProWorld New Member
 
Join Date: Sep 2004
Posts: 13
Global777 RepRank 0
Default What content does Google index?

When a GoogleBot visits a site, what code is actually being viewed and indexed? Is it the raw html, for example, or the html that is presented through a browser?

More to the point, are the bots indexing the css code, if the stylesheet is external, or what I would see if I did a View Source?

Thanks in advance...
Reply With Quote
  #2 (permalink)  
Old 05-27-2008, 06:55 PM
NateDesmond's Avatar
WebProWorld Member
 
Join Date: May 2008
Posts: 70
NateDesmond RepRank 0
Default Re: What content does Google index?

There is a tool called seo browser that will show you what the search engines see when they crawl your site. You can use it at Free SEO Software Tool & Text Browser, Search Engine Optimization Tools - SEO Browser.
__________________
Nate Desmond
Visit My Ecommerce Blog!
Reply With Quote
  #3 (permalink)  
Old 05-27-2008, 08:56 PM
WebProWorld New Member
 
Join Date: Sep 2004
Posts: 13
Global777 RepRank 0
Default Re: What content does Google index?

Thanks Nate. That helps.
Reply With Quote
  #4 (permalink)  
Old 05-27-2008, 10:25 PM
erikko's Avatar
WebProWorld Pro
 
Join Date: Aug 2007
Posts: 182
erikko RepRank 0
Default Re: What content does Google index?

your tool Nate is amazing, i tested it and i love it, now i will now if a site used raw html
__________________
want to create unique philippine web designs?
check our featured Malaysia Property
Reply With Quote
  #5 (permalink)  
Old 05-28-2008, 01:17 AM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: What content does Google index?

Another good one:

Rex Swain's HTTP Viewer
Reply With Quote
  #6 (permalink)  
Old 05-28-2008, 03:34 AM
Janna122003's Avatar
WebProWorld Veteran
 
Join Date: Dec 2006
Posts: 344
Janna122003 RepRank 2
Default Re: What content does Google index?

Nice tool buddy. thanks for sharing it.
Reply With Quote
  #7 (permalink)  
Old 05-28-2008, 03:45 AM
jabo's Avatar
WebProWorld Pro
 
Join Date: Feb 2008
Location: car parks and under the bridge
Posts: 299
jabo RepRank 3jabo RepRank 3jabo RepRank 3
Default Re: What content does Google index?

Quote:
Originally Posted by NateDesmond View Post
There is a tool called seo browser that will show you what the search engines see when they crawl your site. You can use it at Free SEO Software Tool & Text Browser, Search Engine Optimization Tools - SEO Browser.


Great tool nate. Thanks for sharing that. I also found other useful tools in that site so it helped me alot..
Reply With Quote
  #8 (permalink)  
Old 05-28-2008, 05:15 AM
mit mit is offline
WebProWorld Pro
 
Join Date: Apr 2006
Location: INDIA
Posts: 125
mit RepRank 1
Smile Re: What content does Google index?

Quote:
Originally Posted by incrediblehelp View Post
Another good one:

Rex Swain's HTTP Viewer
Thanks for sharing the tool buddy...
Reply With Quote
  #9 (permalink)  
Old 05-29-2008, 06:47 PM
WebProWorld Veteran
 
Join Date: Apr 2004
Posts: 349
imvain2 RepRank 1
Default Re: What content does Google index?

Google also suggests using the linx browser to see what the search engines see.
Reply With Quote
  #10 (permalink)  
Old 05-29-2008, 06:56 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: What content does Google index?

It should probably be noted that the search engines do look beyond what the spider simulators indicate. These tools will remove the javascript, client side scripting and style sheets, which is helpful for estimating whether or not your content and navigation "work". However, the search engines do periodically check files such as your CSS stylesheets and the on page scripting to look for possible spamming.

Font sizes, the visibility of elements, color, and other factors are occasionally reviewed by the spiders, and while they may not directly impact the indexing of your site, they can cause sections of content to be ignored and can impact the ranking of a page. Most tools do not emulate this behavior, because no one outside the search engine engineers know exactly how these factors are determined or weighted. In fact, I have not seen any emulators that attempt to factor these elements into their analysis.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #11 (permalink)  
Old 05-29-2008, 09:24 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,167
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: What content does Google index?

I do not know any better tool than this one: Browser Simulator/Emulator Tool, Web Page Tester, URL Source/Headers Viewer
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #12 (permalink)  
Old 05-30-2008, 12:54 AM
Peter (IMC)'s Avatar
WebProWorld MVP
WebProWorld MVP
 
Join Date: Dec 2003
Posts: 1,485
Peter (IMC) RepRank 4Peter (IMC) RepRank 4Peter (IMC) RepRank 4Peter (IMC) RepRank 4
Default Re: What content does Google index?

Quote:
Originally Posted by Global777 View Post
When a GoogleBot visits a site, what code is actually being viewed and indexed? Is it the raw html, for example, or the html that is presented through a browser?

More to the point, are the bots indexing the css code, if the stylesheet is external, or what I would see if I did a View Source?

Thanks in advance...
Basically they index the html code and then process that data. For example they will extract all the text from the document, like those tools show. They will also look at what tags the various texts are in and the content of these various tags will be weighted differently. For example the content in the title tag has more weight.

But they´re smarter than that. They will also compare the content of various tags. For example, the page will be considered less useful (and thus rank lower) if the content of the title is unique and none of the words in it are found in the rest of the page.


It would be strange if it made a difference if the css is in the page or in a seperate file. If a simple browser can figure out where the css data is, a search engine should have no difficulty finding it too.

One thing that's important is that they do not execute code like a browser does.

And also realize that a bot is not the search engine it self. It doesn't do anything but grabbing data.
__________________
FREE SEO ! Really? YES! All you have to do is implement it!
Follow me on Twitter PeterIMC
Reply With Quote
  #13 (permalink)  
Old 05-30-2008, 01:55 AM
WebProWorld Member
 
Join Date: Oct 2004
Location: California
Posts: 42
benc007 RepRank 0
Default Re: What content does Google index?

Great tools ... thank you for sharing.
Reply With Quote
  #14 (permalink)  
Old 05-30-2008, 02:16 AM
WebProWorld New Member
 
Join Date: Sep 2004
Posts: 13
Global777 RepRank 0
Default Re: What content does Google index?

Thanks to each of you for the information. I appreciate!
Reply With Quote
  #15 (permalink)  
Old 09-09-2008, 08:21 AM
2fk 2fk is offline
WebProWorld Member
 
Join Date: Jun 2008
Posts: 70
2fk RepRank 1
Default Re: What content does Google index?

Quote:
Originally Posted by Peter (IMC) View Post
Basically they index the html code and then process that data. For example they will extract all the text from the document, like those tools show. They will also look at what tags the various texts are in and the content of these various tags will be weighted differently. For example the content in the title tag has more weight.

But they´re smarter than that. They will also compare the content of various tags. For example, the page will be considered less useful (and thus rank lower) if the content of the title is unique and none of the words in it are found in the rest of the page.


It would be strange if it made a difference if the css is in the page or in a seperate file. If a simple browser can figure out where the css data is, a search engine should have no difficulty finding it too.

One thing that's important is that they do not execute code like a browser does.

And also realize that a bot is not the search engine it self. It doesn't do anything but grabbing data.
Very understandable..very well explained... much better on the tools - learning the concept.. thanks peter
__________________
Search Engine Optimization Tutorials: Seo Query
Reply With Quote
  #16 (permalink)  
Old 10-02-2008, 01:16 PM
WebProWorld New Member
 
Join Date: Oct 2008
Posts: 11
JenKumar RepRank 0
Default Re: What content does Google index?

You can get your site spidered at SeoChat.com - they have many tools.
Reply With Quote
  #17 (permalink)  
Old 10-08-2008, 04:14 PM
sjk sjk is offline
WebProWorld Pro
 
Join Date: Oct 2004
Location: Detroit, MI
Posts: 109
sjk RepRank 0
Default Re: What content does Google index?

I use Web Sniffer quite often.
Reply With Quote
Reply

  WebProWorld > Search Engines > Google Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Does iFrame content index at all? nashville Search Engine Optimization Forum 3 06-07-2007 11:28 AM
index.(html | php | etc...) Bad? Duplicate Content? fourfivetwofour Search Engine Optimization Forum 1 03-10-2007 01:33 PM
Duplicate Content Issues Between domain.com and index.php... stretch dog Google Discussion Forum 5 06-15-2006 08:48 PM
Google Joins NASDAQ 100 Stock Index; Visualize the Index wi WPW_Feedbot Search Engine Optimization Forum 0 12-13-2005 04:00 PM
How often does Google Re-index? Haltos Google Discussion Forum 2 05-01-2004 06:01 PM


All times are GMT -4. The time now is 07:13 AM.



Search Engine Optimization by vBSEO 3.3.0