|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
||||
|
Over the past week, I have noticed a disturbing trend with my pages in Google. It started with the major (highest PR) pages: they remained in Google, but they were not showing a title or description. As a few days passed, the same started to occur and now most of my pages in the index are showing this way. They are still in Google, and some are holding rankings (due to anchor text, I assume).
It is alomost as if there were "noindex" meta on all of the pages, but there are none. This has happened for multiple domains, all on the same hosting account. I use .htaccess to redirect some domains to sub-folders, a technique that had worked fine for many months with out issue. www.stanthecaddy.com&hl=en&lr=&ie=UTF-8&start=40&sa=N]Here is an example from my Seinfeld site[/url]. Most of the pages are showing no title and description. Thoughts? Advice? |
|
|||
|
I am not familiar with XHTML and have basic knowledge of XML. Anyway - i went to your site and here is how your <html> tag looks:
<html xmlns="http://www.w3.org/1999/xhtml"> Up to my understanding of this, this tag declares an XHTML page and not HTML, which by itself is actually an XML document. I bet Google, although able to scan this, may not follow links from this type of documents the regular way. I could not find some info on Google about indexing this type of documents, so if someone can share a light on this, please post to this topic. The usual html tag everyone uses is something like: <html lang="en">. Also there is a considerable amount of blank lines at the beginning of the page. While this by itself should not cause trouble with googlebot, i'd rather remove them. The third thing is the page content itself. for example by looking at http://www.stanthecaddy.com/judge-vandelay-discuss.html . There is no actual Description tag. The page contains lots of Javascript and forms, but the actual textual content is really low. If i had a page of such low content, unless this page had a lot of external links pointing to it, it would not be very strange for me to see it disappearing from the index. Out of these, the XHTML issue seems the most significant. The pages not showing title and description in Google, looks to me like a confirmation of Google treating differently than html. As to the DNS thing, i strongly doubt there's a problem on that. |
|
||||
|
Thanks for the reply, emils.
First, I am not an expert with XHTML stuff either. The format of that tag is the default for pages created by Movable Type, a CMS that powers hundreds of thousands (millions?) of pages on web. I have had no probel with that for ~14 months. However, I did remove a <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> declaration from the top of the default MT templates (because it was causing a problem with external stylesheets in Mozilla). With Google experimenting with XML/RSS indexing, perhaps a recent Google spidering change could be treating the files as XML? Yes, some pages have little content, like the short discussion thread you cited. But this is a global problem (20,000+ pages), affecting pages of all content sizes. The disturbing thing is that Google hasn't requested any of these page in at least 5 days... |
|
|||
|
Quote:
Quote:
Quote:
|
|
||||
|
Thanks, emils.
I have tried the opposite: I did not put back the DOCTYPE, but I changed it to plain <html> tags instead. I did this for one large section of one of the sites, as a test. The "home page" of the section has many links pointing to it, so hopefully GB will stop by soon... |
|
|||
|
It might be google is just getting confused about what it is spidering... Your pages are definitely not valid HTML:
http://validator.w3.org/check?uri=ww...a-discuss.html |
|
||||
|
Quote:
You're also fogetting that Google is not even requesting these pages -- or at least my server is not receiving and responding to any such requests. |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |