Submit Your Article Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #51 (permalink)  
Old 03-03-2009, 12:27 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

I assume I know now how the whole thing works.

First you need to clean up those urls with mod_rewrite and then you should write a script that automatically sets a unique link relationship for every single page.

If that solution is easier to implement than just writing an appropriate robots.txt, for me it is not.

Am I correct?

The canonical link relationship tells search engines that the preferred location of this url (the “canonical” location, in search engine speak) is http://example.com/page.html instead of http://www.example.com/page.html?sid=asdf314159265

But the canonical link relationship tag is only a "hint" to the search engine. While they'll probably use it 99% of the time, they reserve the right to handle things any way they want, in case of errors etc.

If I use the robots.txt, using in addition the "noindex" directive, I feel it will be a far much better solution.

Or am I still missing something?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #52 (permalink)  
Old 03-03-2009, 06:23 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

Quote:
But the canonical link relationship tag is only a "hint" to the search engine. While they'll probably use it 99% of the time, they reserve the right to handle things any way they want, in case of errors etc.
As I said before, none of us can decide the tags effectiveness until its been road tested. One thing I've noticed is that google are now using it all over the place so they are obviously happy with it's effectiveness.

Quote:
If I use the robots.txt, using in addition the "noindex" directive, I feel it will be a far much better solution.

Or am I still missing something?
But what a robots files doesn't do is re-assign link factors to the canonical. It only blocks the non-canonicals from being indexed. So any links that appear to duplicate pages will do nothing to help the ranking of the canonical version, which is the one we will probably be trying to improve rankings for? So why not add robots exclusion, noindex tag and a canonical tag?

Any answer? --
Quote:
"One thing that id like to know john; if the tag works correctly and all seo benefits are assigned to one page then why does it matter if duplicates are in the index?"
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #53 (permalink)  
Old 03-03-2009, 07:40 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
As I said before, none of us can decide the tags effectiveness until its been road tested. One thing I've noticed is that google are now using it all over the place so they are obviously happy with it's effectiveness.
Some examples?

Quote:
Originally Posted by inertia View Post
But what a robots files doesn't do is re-assign link factors to the canonical. It only blocks the non-canonicals from being indexed.
It blocks them from being indexed? Do you recall what you quoted in a previous post. You quoted:

Quote:
Originally Posted by Inertia
The tag cant possibly remove pages from the index, only tell google which one is the canonical. So duplicate pages will still be in the index and continue to get indexed but they should only appear when we hit the "repeat the search with the omitted results included." option.
1. The tag can't possibly remove pages from the index. So they remain in the index? Fine. But...

2. The duplicate pages will still be in the index and continue to get indexed but they should only appear when we hit the "repeat the search with the omitted results included." option.
Can you explain which pages show up hitting the link "repeat the search with the omitted results included"? Aren't they pages of the supplemental index? If yes, what happened to PageRank Sculpting (Siloing)? Is that dead now?

Quote:
Originally Posted by Inertia
So any links that appear to duplicate pages will do nothing to help the ranking of the canonical version, which is the one we will probably be trying to improve rankings for? So why not add robots exclusion, noindex tag and a canonical tag?
Any answer? --
If I use a robots.txt with the "disallow" and "noindex" in combination with the allow directives, what is the sense to use the canonical tag in this case?

Can you clarify?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-03-2009 at 07:43 PM.
Reply With Quote
  #54 (permalink)  
Old 03-03-2009, 08:30 PM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

Quote:
Some examples?
https://www.google.com/support/webma...ctx=tltp&hl=en

Quote:
It blocks them from being indexed? Do you recall what you quoted in a previous post. You quoted:
A robots files blocks a page, a canonical tag tells google what the original copy is... Robots.txt files and canonical tags are two different things but you seem to think im talking about them as if theyre one and the same? Im lost...

Quote:
Can you explain which pages show up hitting the link "repeat the search with the omitted results included"? Aren't they pages of the supplemental index? If yes, what happened to PageRank Sculpting (Siloing)? Is that dead now?
The pages that will show up in the expanded search will be the duplicate/supplemental pages but the page which they state is the canonical, using the new tag, will be placed in the main index (assuming it passes the other standards).

This is page rank sculpting on a lower level, its sending pagerank to the canonical.

I think your thinking about it way too much, the tag does a simple thing as ive repeated many a times in this thread...

Quote:
If I use a robots.txt with the "disallow" and "noindex" in combination with the allow directives, what is the sense to use the canonical tag in this case?

Can you clarify?
...

Quote:
But what a robots files doesn't do is re-assign inbound link factors to the canonical and tell google that its not the original. It only blocks the non-canonicals from being indexed. So any external links that appear to duplicate pages...
will point to a noindexed page which will pass that pagerank to the other pages via its external links (e.g. menu) so why not send all that juice and the link vote to the canonical?

But my other point was... whats the need to block a page if its reassigning its pagerank to the canonical? Is there any harm in having a duplicate in the supplemental index if all its pagerank is going to the canonical?
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #55 (permalink)  
Old 03-03-2009, 09:47 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia
As I said before, none of us can decide the tags effectiveness until its been road tested. One thing I've noticed is that google are now using it all over the place so they are obviously happy with it's effectiveness.
All over the place? I expected real life examples. Some webmaster that are happy with it's effectiveness, and not Googles claims.

Quote:
Originally Posted by inertia
What should that tell me?

I can only say to that, that Google tells a lot when the day is long. If you believe everything they say, I would like to ask a very simple question:

How is it possible that a monster search engine (biggest on the Internet) like Google, does not even understand 300 Multiple Choices HTTP Headers and show them as web site pages in their index.
But still they provide us the list of the HTTP Status Codes http://www.google.com/support/webmas...n&answer=40132 How kind of them.

So what the hell are we talking about? Are they kidding?

I have posted previously in the thread a screenshot taken on the 27th of February 2008.

But I am not the only one who is suffering. I was reading that others have the same problem, which is also very harmful: http://www.webmasterworld.com/google/3210610.htm

So come on brother...

Quote:
Originally Posted by inertia
A robots files blocks a page, a canonical tag tells google what the original copy is... Robots.txt files and canonical tags are two different things but you seem to think im talking about them as if theyre one and the same? Im lost...
How the heck do people sculpt their PR? How do people eliminated duplicated pages? Using the "nofollow" attribute, robots.txt or redirects. What are canonical issues? Aren't those issues causing duplicated pages versions? If not what else do they do? Now I am getting more confused. So please explain.

Quote:
Originally Posted by inertia
The pages that will show up in the expanded search will be the duplicate/supplemental pages but the page which they state is the canonical, using the new tag, will be placed in the main index (assuming it passes the other standards).

This is page rank sculpting on a lower level, its sending pagerank to the canonical.
Ah! So it is just a way to take out the most lucrative pages out of the supplemental index, and the bad pages found in the main index to go into the supplementals. Is that what you want to achieve? If you are a satisfied SEO with that, then I am very sorry. ME NOT! Why?

1. Matt Cutts, head of the Google Webspam Team’s quote, speaks for itself:

Quote:
"PageRank is the primary factor determining whether a url is in the main web index vs. the supplemental results."
Source: Fall weather forecast

2. Thanks to Andy Beal, you can hear Matt Cutts say:

Quote:
"If you got 60,000 pages, and you only got ’this much’ PageRank, and you divide it [...he mumbles], some of them are going to be in the supplemental index. Given 'this many people' who link to you, we’re willing to include 'this many' pages in the main index."
Source: SMX Video - Matt Cutts Explains How to Get Out of Google’s Supplemental Index

So my question to you: Does this canonical tag help me to boost my PageRank as I can achieve with PageRank Sculpting? After all I read in the thread so far, not!

Don't the existing PageRank sculpting methods already do what that "canonical attribute does, PLUS boosts the web sites PageRank?

That said, simply using or relying on that attribute we are going backwards instead of forward. Do you get my point now?

Quote:
Originally Posted by inertia
I think your thinking about it way too much, the tag does a simple thing as ive repeated many a times in this thread...
Do you know any professional SEO that should not think too much? Oh, ok.
If I am spoiling the thread, or I am getting boring, I will drop the subject right here man. No problem.

Quote:
Originally Posted by inertia
will point to a noindexed page which will pass that pagerank to the other pages via its external links (e.g. menu) so why not send all that juice and the link vote to the canonical?
That is the only wise reason I see for implementing the "canonical" attribute. And that is the only reason I added it on all the pages of the seoworkers.com .
But we took this to another level some hours ago, which my CTO and SEO Technician will explain in a few. It is about misspelled IBLs.

Quote:
Originally Posted by inertia
But my other point was... whats the need to block a page if its reassigning its pagerank to the canonical? Is there any harm in having a duplicate in the supplemental index if all its pagerank is going to the canonical?
I think I already explained above.

P.S. Please have a look at this wonderful info a Google too: http://www.google.com/support/webmas...y?answer=66359 (Please read before responding to this post.)

And for the last time:

"The canonical link relationship tag is only a "hint" to the search engine. While they'll probably use it 99% of the time, they reserve the right to handle things any way they want, in case of errors etc."

Personally I cannot find peace with that! I will go on researching and developing new methods to have as much control as possible over my own sites and my customers over theirs. We own our web sites and not the search engines. The search engines should knock our door and ask: Which pages may I enter and share with the public. And not the way around.

Is that just me? If yes, then I am very proud about that.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-03-2009 at 11:04 PM.
Reply With Quote
  #56 (permalink)  
Old 03-03-2009, 10:45 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
A robots files blocks a page, a canonical tag tells google what the original copy is... Robots.txt files and canonical tags are two different things but you seem to think im talking about them as if theyre one and the same? Im lost...

The pages that will show up in the expanded search will be the duplicate/supplemental pages but the page which they state is the canonical, using the new tag, will be placed in the main index (assuming it passes the other standards).

This is page rank sculpting on a lower level, its sending pagerank to the canonical.
If you call that method PageRank Sculpting, then I am lost. That has nothing to do with PageRank Sculpting. You are most probably telling google to re-assign the PR values (I assume) of the individual pages without elminating one of the duplicated pages. The total PR of the both pages (duplicated pages) will remain the same. If both pages stay in the main and supplemental index, then there is not sculpting. I explained already in my previous post. Or am I miss something again?

Quote:
Originally Posted by inertia View Post
will point to a noindexed page which will pass that pagerank to the other pages via its external links (e.g. menu) so why not send all that juice and the link vote to the canonical?
A page that is disallowed on the robots.txt will not be crawled. But if an external link link to a duplicated page, still the URL can appear, or even with data from links or trusted third party data sources like the ODP.
That said, it still consumes and can pass PR.

Please study this very carefully: http://www.seobook.com/robots-txt-vs...obots-nofollow
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-03-2009 at 11:37 PM.
Reply With Quote
  #57 (permalink)  
Old 03-04-2009, 01:14 AM
Narasinha's Avatar
WebProWorld Pro
 
Join Date: Aug 2003
Location: Urbana, Illinois, US
Posts: 232
Narasinha RepRank 1
Default Re: New Canonical Tag from the big 3

As John (Webnauts) previously suggested, I came in to post the present official position of SEO Workers about the use of the canonical tag.

The shiny new canonical link tag may have its uses, but I'm not sure that pagerank sculpting is one of those uses. As Webnauts indicated in his quotes from Matt Cutts and others, the presence of the tag on a page that is not the canonical location will not remove that page from the index, and from what I can see, will not result in a recovery of any pagerank lost from the non-canonical page.

As Matt Cutts indicated, the canonical link relationship is a "hint" , not a directive/mandate/requirement. The search engines are not required to follow the instruction given them in the link tag. Matt Cutts also says, "If you're a power user, exhaust alternatives first." Personally, I think of SEO professionals as "power users" in this case. If that is the case, I prefer to primarily use existing methods to eliminate canonicalization issues and perform pagerank sculpting before relying on the canonical attribute.

This certainly does not mean the canonical link relationship tag is without its uses. In addition to implementing the tag on the bulk of the pages at SEO Workers (reinforcing our existing methods), we have used it to assist in another problem which Google has yet to tackle.

Google's indexing routines, or perhaps Googlebot on its own, does not make a distinction between an http server response code of 200 ("OK", meaning that the requested document was found) and a response code of 300 ("Multiple Choices", meaning that the requested URI does not exist, but one or more close matches are available).

I'm sure that many people are not familiar with this response code. It is a response given when the Apache "mod_speling" module has been enabled, and it finds one or more partial matches to the requested URI, but not a perfect match. In some cases, such as simple misspelling, letter transposition, or capitalization, the module will generate a 301 (permanent redirect) to the correct URI. You can see this in action by going to the link http://www.seoworkers.com/SITEMAP.HTML which will automatically redirect you to the URI with proper capitalization.

Sometimes "mod_speling" finds a situation in which it cannot be certain about the correct URI. This can occur when the given URI matches the filename, but not the extension, or when more than one filename closely match the given URI. When this is the case, the server generates a "300 Multiple Choices" page, presenting links to the close matches. This page, as generated by the server, is just as stylish as a default "404 Not Found" page.

The problem with Google is that it will index the "300 Multiple Choices" pages. This is definitely not what you want to happen. This is akin to indexing 404 error pages on your site. Google is now indexing a URI that does not actually exist on your server.

Most sites now present customized 404 response pages when a file is not found. Sometimes a comical picture is shown, or a pithy quote, or even a sitemap to assist users in finding what they were looking for. How do you make a custom 300 response page? Custom "error" pages are set up on an Apache webserver by using an .htaccess file to specify the location of the HTML file to present in the event of such an error.

With a 404 error code, this is fairly straightforward. A static HTML file can suffice quite nicely. With a 300 response, a static HTML file will not show the list of links that mod_speling generates. This can be solved by using a PHP file instead. When the 300 response is given, the server also generates a variable (REDIRECT_VARIANTS) which contains a list of the URIs which closely match the requested URI, and the reasons why mod_speling believes they match. Using PHP, we parse that list into an array which is then presented as a list within the body of the response page.

So what about the canonical relationship tag? In the event that the list of URIs contains only one element, we implement the canonical tag using that single element as the canonical URI. In all cases, the response page also uses a robots meta tag with noindex, follow, noarchive, and nosnippet directives. By using the follow directive, all links on the page will be followed by the search engines. However, only the links within the list are the ones which we wish to use to pass any possible advantage to the correct page. It may not be desirable to have search engines index some URIs which may appear in the list. Such files should be excluded using the robots.txt file.

As the response page also contains our sites primary navigation elements, we wish to exclude those links from being followed. Because we strictly do not use the "nofollow" attribute, we follow the alternative advise of Google, redirecting such links with a 301 to an intermediate page that is blocked from search engines with a robots.txt file (see Paid links - Webmasters/Site owners Help). This ensures that only the links to the suspected matches will be followed by the search engines.

For an example of a 300 response page with a single match, see http://www.seoworkers.com/contact.php.
For an example of a 300 response page with multiple matches, see http://www.seoworkers.com/sitemap.php.

Though this may not be what Google, et al., had in mind when implementing support for the canonical link relationship tag, we think that such a use will help us in guiding the search engines through our sites in a way that best reflects our wishes.

--Dan Johnson, CTO, SEO Workers

Last edited by Narasinha; 03-04-2009 at 01:24 AM. Reason: Clarification.
Reply With Quote
  #58 (permalink)  
Old 03-04-2009, 01:35 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Thank you Dan (Narasihna) for your valuable time and effort for explaining our company position about the pros and cons of the "canonical tag."
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #59 (permalink)  
Old 03-04-2009, 06:50 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Narasinha
This certainly does not mean the canonical link relationship tag is without its uses. In addition to implementing the tag on the bulk of the pages at SEO Workers (reinforcing our existing methods)
So your company are now doing what i suggested...

Quote:
Originally Posted by me
So why not add robots exclusion, noindex tag and a canonical tag?
Quote:
Originally Posted by webnauts
All over the place? I expected real life examples. Some webmaster that are happy with it's effectiveness, and not Googles claims.
I said GOOGLE have started using it and gave you an example, as you requested.

Quote:
Originally Posted by You
If you believe everything they say,
I dont and if you had read my earlier posts properly you would have figured that out...

Quote:
Originally Posted by Me
I suppose it all comes down to the effectiveness of the tag but as with other SE co-creations (thinking nofollow attribute) theres probably issues which we are yet to uncover.
I'm doing my own experimentation on how the canonical tag works but we are discussing the theory. How can any of us know if it works yet!?

I assume that you are posting this link because you believe the article to be 100% correct. If that is the case then can you explain to me how blocking a duplicate page from indexing with robots.txt and a meta noindex is pagerank sculpting? A noindex page can still build pagerank(according to aaron wall)...

Quote:
Originally Posted by aaron walls blog

So what would be the best way to sculpt your pagerank then if you need those links on the page?
ravetildon on August 06, 2008 11:02 AM

rel=nofollow on the link itself...all the other techniques have you spending PageRank on the pages you do not want indexed.
Aaron Wall on August 06, 2008 11:20 AM
But you dont use nofollow so how do you scuplt pagerank? Do you add noindexed 301s to certain internal links?

Thanks for the info on 300s and thank you for the official position of the SEO Workers.
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society

Last edited by inertia; 03-04-2009 at 06:54 AM.
Reply With Quote
  #60 (permalink)  
Old 03-04-2009, 07:41 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
So your company are now doing what i suggested...
I started somewhere in the the thread mentioning about the use of the "canonical" tag for misspelling and for nothing else. We did not implement for the purpose discussed in the thread, e.g. PageRank Sculpting or canonical issues. We only added that as an additional feature (reinforcing our existing methods) for possible misspelled IBLs, which our Apache Module might cannot handle.

Quote:
Originally Posted by inertia View Post
I said GOOGLE have started using it and gave you an example, as you requested.
Oh. Google is testing that on their own site? I thought that Google are power users. Didn't Matt Cutts advised power users to avoid using that?

Quote:
Originally Posted by inertia View Post
I dont and if you had read my earlier posts properly you would have figured that out...
Then we are standing on the same side there.

Quote:
Originally Posted by inertia View Post
I'm doing my own experimentation on how the canonical tag works but we are discussing the theory. How can any of us know if it works yet!?
We never claimed that it does not work. But when Matt Cutts indicated, that the canonical link relationship is a "hint" and not a directive/mandate/requirement, we do not see any reason going into to a test, since that tells us in advance how much reliable it can be. Do we disagree?

Quote:
Originally Posted by inertia View Post
I assume that you are posting this link because you believe the article to be 100% correct.
Well if Matt Cutts is lying is many interviews I read about that, then I feel confirmed and great, when we implement every possible (even if others say are redudant) Internet technologies to avoid being abused by search engines.

Quote:
Originally Posted by inertia View Post
If that is the case then can you explain to me how blocking a duplicate page from indexing with robots.txt and a meta noindex is pagerank sculpting? A noindex page can still build pagerank(according to aaron wall)...
Please read again the post of Aaron Wall. It is so clear, that I have no idea how to make that even more clear. Or post a specific example.

Quote:
Originally Posted by inertia View Post
But you dont use nofollow so how do you scuplt pagerank? Do you add noindexed 301s to certain internal links?
We do PageRank Sculpting using robots.txt, implementing the required directives like "Allow", "Disallow" and only for Google "Noindex". In addition where necessary we also use the meta robots directives and X-Robots too. Whatever needed.

Just a couple examples: http://www.gameshop.gr/robots.txt or http://www.seoworkers.com/robots.txt

And about the alternative to the nofollow" attribute, Dan tried to explain that above, but for more details have a look at my post in our forums:
Googlebot will follow and index these links? - Page 3 - SEO Workers Forums

Quote:
Originally Posted by inertia View Post
Thanks for the info on 300s and thank you for the official position of the SEO Workers.
You are always welcome man. You know that.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-04-2009 at 08:00 AM.
Reply With Quote
  #61 (permalink)  
Old 03-04-2009, 09:05 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

What are your opinions on this Rand article: SEOmoz | 12 Easy Mistakes that Plague Newcomers to the SEO Field?

Specifically these points...

Quote:
5. Blocking Bots Access to Duplicate Content
I think fear plays a major part in this one. People read that duplicate content causes a penalty, so they block bots from accessing duplicate versions of their content. The problem is that you lose the link juice those pages may accumulate and potentially restrict access to pages that would let the engines better crawl and index your site. If you want to fight duplicate content, try either 301 re-directing the duplicate version back to the original or employing the new canonical URL tag (if & when appropriate). Don't just go blocking bots from pages unless you're sure you know what you're doing.
Quote:

7. Blocking Bots Rather than Using Nofollow

I think there's been some confusion about how PageRank sculpting works. You should NOT block bots access to pages you don't want to send link juice to - in fact, this doesn't even accomplish that goal. If a link points to a page, even if that page is blocked via robots.txt or meta robots noindex, it still accrues link value metrics. The only way to stop a link from passing juice is to use a nofollow (or to make it via an external Javascript redirect or embed in a non-HTML-parseable object). Please be careful about what you noindex or block to bots and why - you don't just hurt that page, you can hurt downstream areas of your site by walling off navigation paths too (and even stop link juice that flows in from flowing out).
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #62 (permalink)  
Old 03-04-2009, 10:00 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
What are your opinions on this Rand article: SEOmoz | 12 Easy Mistakes that Plague Newcomers to the SEO Field?

Specifically these points...
I respect the opinion of Rand, but I do not need to agree if I disagree somewhere.

Quote:
Originally Posted by inertia View Post
5. Blocking Bots Access to Duplicate Content
I think fear plays a major part in this one. People read that duplicate content causes a penalty, so they block bots from accessing duplicate versions of their content.
I agree. There is no duplicated content penalty.

Quote:
Originally Posted by inertia View Post
The problem is that you lose the link juice those pages may accumulate and potentially restrict access to pages that would let the engines better crawl and index your site.
I fully agree. I think I mentioned that in the thread already.

Quote:
Originally Posted by inertia View Post
If you want to fight duplicate content, try either 301 re-directing the duplicate version back to the original
Is this a spiral tribe? Again read my previous posts and the post of Aaron Wall.

But it would have been also a nice idea if Rand, you or someone else could explain how could we deal with that efficiently. For example with an online shop with ugly non-mod-rewritten urls .

Quote:
Originally Posted by inertia View Post
or employing the new canonical URL tag (if & when appropriate). Don't just go blocking bots from pages unless you're sure you know what you're doing.
Our position about that is stated above. And when we block pages, you can be sure that we know what we are doing.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-04-2009 at 10:09 AM.
Reply With Quote
  #63 (permalink)  
Old 03-04-2009, 10:22 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Rand
The only way to stop a link from passing juice is to use a nofollow (or to make it via an external Javascript redirect or embed in a non-HTML-parseable object).
When you block a page using robots directives do you also block all the site links going to that page?
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #64 (permalink)  
Old 03-04-2009, 12:58 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
When you block a page using robots directives do you also block all the site links going to that page?
If you are asking, "When linking to a page blocked by robots.txt, is that link not counted against your outgoing pagerank?" The answer would be no.

Lets say you have a page with three links, and a pagerank of 2. One link is a normal ordinary link, one is a link to a page blocked by robots.txt, and one has the nofollow tag. When Google calculates the distribution of your pagerank to the pages you link to, the page with the nofollow link is ignored. As a result, as far as Google is concerned, the page contains two valid links, each of which receives 1/2 of the available pagerank, or 1 point.

Now, when Google goes to index the two linked pages that it could "see" on your page, the first page is credited with it's 1 point of pagerank. The second page is inaccessible because it is blocked by robots.txt, so Google either drops that point of pagerank, or credits it to a page that is not in the index. In either case, that pagerank point is essentially lost.

There are two processes at work here - the distribution of pagerank from the linking page, and the crediting of pagerank to the destination page. This is a one-way process. Once pagerank is distributed, if Google later determines that it can't be credited (noindex meta tag, robots.txt directive, etc) it will not go back and recalculate the distribution of pagerank to the other linked-to pages.

This was really off-topic, wasn't it?
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #65 (permalink)  
Old 03-04-2009, 01:41 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

I know, I know, I'm slow in replying and responding to topics like this. Had to do some reading and research.

Like this new tag, there are a lot of conventions, both new and old, which Google has a "typical" way of handling, but with which there are often exceptions. For example, when there is a 301 redirect, the destination URL is usually shown in the results page, however, sometimes the original URL is shown. I think this new tag needs to be looked at in light of these other conventions, as well as Sitemaps (also called hints by Google) and internal link structures.

Google has a few different goals that they seem to be trying to achieve with this new tag. The first thing is that they want to have a way of creating a cluster of the URLs that are likely to be the same content. This is the overall goal, really.

From that, they also want to know what is the preferred URL to display in the search results. When you use this tag, you are explicitly telling Google "for every duplicate URL you find in this cluster, I prefer the URL /blahblah.bla." Now, that doesn't mean Google will always follow your stated preference. If every other site on the web uses one of the other URLs in the cluster, for example, Google is likely to use that externally-preferred variation over the webmaster preference.

The other sub-goal is to keep up with changes to the content. As you know, Google crawls web sites asynchrously. By using this new tag, if Google sees one page in the cluster has changed, it knows what the rest of the pages in the cluster are, and can check all or some of the others to see if they have also changed - this gives Google a few benefits. Pages merged because of duplicate content would no longer bounce in and out of the merge when content is updated but not fully crawled, and Google can crawl less pages. Once Google knows that 5 pages are in a cluster, when it detects one page is updated, it could simply pick a random second page to check to verify the change is cluster-wide (as opposed to the updated page no longer being in the cluster) and then assume the change applies to the entire cluster.

Like sitemaps, this system seems to me to be a way for webmasters to provide the search engines with additional information about the structure of your own web site when the search engines have no other way of getting that information. I don't think the information and preferences from this tag will be considered more authoritative than external links for example, but I do think it will be weighted equivalently.

At the very least, this gives a much cleaner alternative to the method of pure guesswork that search engines have been using up to now. For example, to handle session IDs, according to the instruction manual for Google's database engine, each element of the query string would be compared to the known session identifier names from a wide range of known server technologies, and removed if they were on the list. Of course, if you named a product id field the same as a known session id, you would risk that portion of the query string being automatically removed.

Quote:
Originally Posted by Webnauts View Post
But it would have been also a nice idea if Rand, you or someone else could explain how could we deal with that efficiently. For example with an online shop with ugly non-mod-rewritten urls.
All this notwithstanding, if you want to ensure that Google indexes your content with the exact URLs you want, and without the possibility of duplication, the only way to do it would be with 301 redirects. The (seemingly) hardest part would probably be preventing duplication in query strings because the parts of the string are out of order. This could probably be handled with a simple PHP code.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #66 (permalink)  
Old 03-05-2009, 04:20 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
What are your opinions on this Rand article: SEOmoz | 12 Easy Mistakes that Plague Newcomers to the SEO Field?

Specifically these points...

Quote:
5. Blocking Bots Access to Duplicate Content
I think fear plays a major part in this one. People read that duplicate content causes a penalty, so they block bots from accessing duplicate versions of their content. The problem is that you lose the link juice those pages may accumulate and potentially restrict access to pages that would let the engines better crawl and index your site. If you want to fight duplicate content, try either 301 re-directing the duplicate version back to the original or employing the new canonical URL tag (if & when appropriate). Don't just go blocking bots from pages unless you're sure you know what you're doing.
What about a 301 redirect instead of a Javascript redirect? For example Google advises:

Quote:
Originally Posted by Google
Not all paid links violate our guidelines. Buying and selling links is a normal part of the economy of the web when done for advertising purposes, and not for manipulation of search results. Links purchased for advertising should be designated as such. This can be done in several ways, such as:
  • Adding a rel="nofollow" attribute to the <a> tag
  • Redirecting the links to an intermediate page that is blocked from search engines with a robots.txt file
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-05-2009 at 04:23 AM.
Reply With Quote
  #67 (permalink)  
Old 03-05-2009, 04:45 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Wige I see the dagling (links, pages) came into play, so I was wondering what about the following alternative:

Google advises:

Quote:
Hidden text and links

Print
Hiding text or links in your content can cause your site to be perceived as untrustworthy since it presents information to search engines differently than to visitors. Text (such as excessive keywords) can be hidden in several ways, including:
  • Using white text on a white background
  • Including text behind an image
  • Using CSS to hide text
  • Setting the font size to 0
Hidden links are links that are intended to be crawled by Googlebot, but are unreadable to humans because:
  • The link consists of hidden text (for example, the text color and background color are identical).
  • CSS has been used to make tiny hyperlinks, as little as one pixel high.
  • The link is hidden in a small character - for example, a hyphen in the middle of a paragraph.
If your site is perceived to contain hidden text and links that are deceptive in intent, your site may be removed from the Google index, and will not appear in search results pages. When evaluating your site to see if it includes hidden text or links, look for anything that's not easily viewable by visitors of your site. Are any text or links there solely for search engines rather than visitors?
We developed about a year ago a PHP script, which we can hide content (text and links) for the search engines we desire.

As we mentioned above, we have internal and/or external links we do not want search engines to follow, so we redirect them to an intermediate page which is blocked from search engines with a robots.txt file. The alternative to the "nofollow" attribute.

If we would hide those links from the search engines but not from the users, will the search engine consider that we are trying to deceive them?

As far I understand Googles guideline seems to be fine. Or not?

If that would be allowed, and we would hide those links, we would not that the dagling issues.

Any thoughts?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #68 (permalink)  
Old 03-05-2009, 04:55 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
So your company are now doing what i suggested...
I thought I must clarify this once again. We implemented the tag only on the 300 Multiple Choices page of SEO Workers (reinforcing our methods), which we have used to assist in another problem which Google has yet to tackle (not understand 300 HTTP Responses.)

Just to avoid any misunderstandings.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #69 (permalink)  
Old 03-05-2009, 05:55 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Wige a question. Matt Cutts posted:

Quote:
At a link level, you can add a nofollow tag on the granularity of individual links to prevent Googlebot from crawling individual links (you could also make the link redirect through a page that is forbidden by robots.txt). Bear in mind that if other pages link to a url, Googlebot may find the url through those other paths. If you can, I’d recommend using .htaccess or robots.txt (at a directory level) or meta tags (at a page level) to be safe. I’ve seen people try to sculpt Googlebot visits at the link level, and they always seem to forget and miss a few links.
We make some links redirect through a page that is forbidden by robots.txt.

Where do you see the difference between the "nofollow" and to what we are doing? Can you explicit? Or didn't I understand something from your previous posts?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #70 (permalink)  
Old 03-05-2009, 07:01 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

Quote:
What about a 301 redirect instead of a Javascript redirect?
Quote:
Originally Posted by inertia
Do you add noindexed 301s to certain internal links?
Quote:
Originally Posted by Webnauts
We implemented the tag only on the 300 Multiple Choices page
Are you sure...
Quote:
Originally Posted by Narasinha
In addition to implementing the tag on the bulk of the pages at SEO Workers (reinforcing our existing methods), we have used it to assist in another problem which Google has yet to tackle.
I dont see how it can be of any use on 300 pages as they are a server generated response to incorrect urls. Are you dynamically generating the canonical depending on the url?

Quote:
If we would hide those links from the search engines but not from the users, will the search engine consider that we are trying to deceive them?
Thats a bit off topic john. I wouldnt hide anything intentionally. The only time when i touch that area is with image replacement.
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #71 (permalink)  
Old 03-05-2009, 07:14 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by inertia View Post
Are you sure...
Too sure to be true. Or did I miss any?

Quote:
Originally Posted by inertia View Post
I dont see how it can be of any use on 300 pages as they are a server generated response to incorrect urls. Are you dynamically generating the canonical depending on the url?
Yes. I mentioned many times in this thread already man.

Quote:
Originally Posted by inertia View Post
Thats a bit off topic john. I wouldnt hide anything intentionally. The only time when i touch that area is with image replacement.
It was a pure theoretical question.

Image replacement? We used to do that, but we found a better alternative to overcome its need.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-05-2009 at 07:16 AM.
Reply With Quote
  #72 (permalink)  
Old 03-05-2009, 07:30 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,073
inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7inertia RepRank 7
Default Re: New Canonical Tag from the big 3

Quote:
Image replacement? We used to do that, but we found a better alternative to overcome its need.
Oh right. What alternative is that?
__________________
My LinkedIn Profile - Matt Inertia's SEO Blog - SEOers.org - My Twitter

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #73 (permalink)  
Old 03-05-2009, 11:01 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
Wige a question. Matt Cutts posted:

We make some links redirect through a page that is forbidden by robots.txt.

Where do you see the difference between the "nofollow" and to what we are doing? Can you explicit? Or didn't I understand something from your previous posts?
I think there is a point that you miss when it comes to linking through an interstitial page that is blocked by robots.txt. Google wants the destination of a paid link to not receive pagerank from the paid link. This method does accomplish that. However, you "bleed" that pagerank - it is lost and doesn't benefit anything else linked to from your page.

So, lets say you have a pagerank of 4 (using the simple example of TBPR rather than internal PR) which contains only two links. One link is paid, the other is not. If you have the paid link go through an interstitial page, and the other link is direct, Google sees the page as having two valid outgoing links. The page's pagerank is divided in half, and two points are reserved for each of the links. Because Google can't go to the interstitial page, it can't credit those two points to the link buyer. However, Google now drops those two points of pagerank.
4points available = 2points to the natural link + 2points for paid link ignored.

Now lets say you have the same page, but mark the paid link with nofollow. When Google crawls the page, it only sees one valid link on the page. That link receives all of the pagerank available from the page, a full four points.
4points available = 4points to the natural link + 0points reserved for paid link.

In both scenarios, Google's goal of pagerank not going to paid links is accomplished. However, the second scenario is more beneficial to the webmaster because all of the page's pagerank is being distributed. None gets discarded.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #74 (permalink)  
Old 03-05-2009, 11:07 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
I thought I must clarify this once again. We implemented the tag only on the 300 Multiple Choices page of SEO Workers (reinforcing our methods), which we have used to assist in another problem which Google has yet to tackle (not understand 300 HTTP Responses.)

Just to avoid any misunderstandings.
Just out of curiousity, because I haven't tested with 300 responses, what scenario do you use these pages to address?

And what does Google do when it encounters the 300 page? Does it consider it an HTTP error and not index the page, or index it with an error message, or treat it as something else?

My big wish is that Google would support 400 response codes, but since most webmasters don't how to implement or use them, I doubt it would ever happen.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #75 (permalink)  
Old 03-05-2009, 11:30 AM
crankydave's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Aug 2004
Location: Playing with fire!
Posts: 4,328
crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by wige View Post
So, lets say you have a pagerank of 4 (using the simple example of TBPR rather than internal PR) which contains only two links. One link is paid, the other is not. If you have the paid link go through an interstitial page, and the other link is direct, Google sees the page as having two valid outgoing links. The page's pagerank is divided in half, and two points are reserved for each of the links. Because Google can't go to the interstitial page, it can't credit those two points to the link buyer. However, Google now drops those two points of pagerank.
4points available = 2points to the natural link + 2points for paid link ignored.
If I'm reading and understanding this correctly, the link going through to the interstitial page would be treated as a "dangling link" and therefor dropped from the equation. It which case, the available PR would be sent via the other link.

Dave
Reply With Quote
  #76 (permalink)  
Old 03-05-2009, 04:53 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by wige View Post
Just out of curiousity, because I haven't tested with 300 responses, what scenario do you use these pages to address?
To automatically correct simple speling errors with the Apache Module mod_speling.c

But thats no all. Read the post of Narasinha New Canonical Tag from the big 3

Quote:
Originally Posted by wige View Post
JAnd what does Google do when it encounters the 300 page? Does it consider it an HTTP error and not index the page, or index it with an error message, or treat it as something else?
It treats it as every other single normal web page.
See screenshot and facts: New Canonical Tag from the big 3


Quote:
Originally Posted by wige View Post
My big wish is that Google would support 400 response codes, but since most webmasters don't how to implement or use them, I doubt it would ever happen.
My big wish is that Google will learn to understand the real temporary redirects like 307, instead of that 302.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #77 (permalink)  
Old 03-05-2009, 05:12 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by wige View Post
However, the second scenario is more beneficial to the webmaster because all of the page's pagerank is being distributed. None gets discarded.
Dave & Wige, what about my scenario I mentioned above? New Canonical Tag from the big 3
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-05-2009 at 05:30 PM.
Reply With Quote
  #78 (permalink)  
Old 03-05-2009, 05:30 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Wige, about the redirect technique we use for links we do not want to pass PR, Matt Cutts claims:

Quote:
We may see links to the pages with the nocrawl parameter, but we won’t crawl them. At most, we would show the url reference (the uncrawled link), but we wouldn’t ever fetch the page.
Source: Googlebot: Keep out!

About Google showing the url reference is not possible, since I have additionally implemented the in the robots.txt the "noindex" directive.

What is Google doing different when someone uses the nofollow attribute? Can't they see the links to the pages, but they just don't crawl them?

Can you please explain?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 03-05-2009 at 05:33 PM.
Reply With Quote
  #79 (permalink)  
Old 03-05-2009, 05:33 PM
crankydave's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Aug 2004
Location: Playing with fire!
Posts: 4,328
crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
Dave & Wige, what about my scenario I mentioned above? New Canonical Tag from the big 3
It seems to me that should work John.

The link should be treated as a dangling link and thus, removed from PR caculations. As long as your visitors can get to the page you'd be fine IMO. As long as the SE's can't get to and don't index the page you should be fine from a PR perspective also. I don't see where would be any issues.

Dangling links aren't a problem as I see it John. They are simply removed from PR calculations and then available PR is split among the remaining links much like the "nofollow" does.

Dave

Last edited by crankydave; 03-05-2009 at 05:36 PM.
Reply With Quote
  #80 (permalink)  
Old 03-05-2009, 05:38 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by crankydave View Post
If I'm reading and understanding this correctly, the link going through to the interstitial page would be treated as a "dangling link" and therefor dropped from the equation. It which case, the available PR would be sent via the other link.

Dave
My understanding is that it works in the opposite way. Once Google has decided how much pagerank should go to each link, if it later determines that it can't deliver the pagerank for one of the links, it will never go back and recalculate how the pagerank is distributed. It might be easier to think of it as the pagerank goes to the redirect page, but since the redirect page is not indexed, those two points just "die" there.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #81 (permalink)  
Old 03-05-2009, 05:43 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
Wige, about the redirect technique we use for links we do not want to pass PR, Matt Cutts claims:


Source: Googlebot: Keep out!

About Google showing the url reference is not possible, since I have additionally implemented the in the robots.txt the "noindex" directive.

What is Google doing different when someone uses the nofollow attribute? Can't they see the links to the pages, but they just don't crawl them?

Can you please explain?
Nofollow says a few different things to the spider:
  1. Don't add me to the discovery index.
  2. Don't include me when distributing outgoing pagerank.
When the bot divides the pagerank among the pages linked to by the page, it does not count the links with "nofollow" in that calculation. They don't get a share of the page rank.

Probably saying that you are "hiding" the link is not the best way of phrasing it in my post. What you are doing is telling Google you don't want the link to be considered in certain calculations and processes.
__________________
The best way to learn anything, is to question everything.

Last edited by wige; 03-05-2009 at 05:45 PM.
Reply With Quote
  #82 (permalink)  
Old 03-05-2009, 05:51 PM
crankydave's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Aug 2004
Location: Playing with fire!
Posts: 4,328
crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by wige View Post
My understanding is that it works in the opposite way. Once Google has decided how much pagerank should go to each link, if it later determines that it can't deliver the pagerank for one of the links, it will never go back and recalculate how the pagerank is distributed. It might be easier to think of it as the pagerank goes to the redirect page, but since the redirect page is not indexed, those two points just "die" there.
Here's an exerpt from the original paper...

Quote:
"Dangling links are simply links that point to any page with no outgoing links. They affect the model because it is not clear where their weight should be distributed, and there are a large number of them. Often these dangling links are simply pages that we have not downloaded yet..........Because dangling links do not affect the ranking of any other page directly, we simply remove them from the system until all the PageRanks are calculated. After all the PageRanks are calculated they can be added back in without affecting things significantly."
PR preservation would seem to indicate that only the link would be removed and not the available PR that they would pass ie. links are removed, PR divvied up among the remaining links and calculated. We already know that "nofollow" works in this manner. It wouldn't make sense to have dangling links to work otherwise.

Dave

Last edited by crankydave; 03-05-2009 at 05:58 PM.
Reply With Quote
  #83 (permalink)  
Old 03-05-2009, 06:06 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Andy Beard tells:

Quote:
Dangling Pages

One of the best descriptions of dangling pages is on the Webworkshop site, though they are assuming that links are totally taken out of the equation based on what they quote from the PageRank paper.

Quote:
"Dangling links are simply links that point to any page with no outgoing links. They affect the model because it is not clear where their weight should be distributed, and there are a large number of them. Often these dangling links are simply pages that we have not downloaded yet……….Because dangling links do not affect the ranking of any other page directly, we simply remove them from the system until all the PageRanks are calculated. After all the PageRanks are calculated they can be added back in without affecting things significantly." - extract from the original PageRank paper by Google’s founders, Sergey Brin and Lawrence Page.
Alternate interpretation

This is just an aside, as the amount of juice lost to dangling pages currently is hard to determine, and could be handled differently

They are assuming that if page A links to 6 other pages, 5 of them being dangling links, then the website will be treated as only having 2 pages until the end of the calculation.

Whilst I haven't delved into the maths (and probably couldn't through lack of information and lack of knowledge), it also seems to me that at the time the pages are taken out of the cyclic calculation, a percentage of the link value can still be taken with them.

Thus though the site for cyclic calculations will be just 2 pages, the link from A to B might only transfer 1/6 of the juice on each cycle.

At the time the original paper was written, Google only had a small proportion of the web indexed due to hardware and operating system restraints.
In modern times they have a lot more indexed, thus a more complex way of handling dangling pages could be possible.

More food for thought, a link to a page that is considered supplemental could be treated as a full link or as a link to a dangling page, or some other variant.

Even more food for thought, a site with multiple interlinked pages with no external links at all could be looked on as a "dangling site".

Ultimately what is important is that dangling pages are a juice leak, though it is difficult to determine exactly how much.
Source: SEO Linking Gotchas Even The Pros Make | Andy Beard - Niche Marketing

So whats next?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #84 (permalink)  
Old 03-05-2009, 06:08 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

I know, I am going out of order. Some people type too fast...

Quote:
Originally Posted by Webnauts View Post
To automatically correct simple speling errors with the Apache Module mod_speling.c

But thats no all. Read the post of Narasinha New Canonical Tag from the big 3

It treats it as every other single normal web page.
See screenshot and facts: New Canonical Tag from the big 3

My big wish is that Google will learn to understand the real temporary redirects like 307, instead of that 302.
My first question (unless you wrote the mod, this is mostly rhetorical) would be, why does the mod_speling.c utility use a 301 redirect? Syntactically, that is not the proper way to handle it - it should be a 302 redirect if only a single match is found.

Just a review of the (pertinent to this discussion) response codes, for those following along. This is in the format Code - Official definition - Translation:
300 - Multiple Choice - "The URL you requested doesn't exist. I found several matches." or "The URL you requested used to exist. The contents have been split into multiple files."
301 - Moved Permanently - "The requested URL used to exist, but the file is now at another URL"
302 - Found - The URL you requested does not exist. However, I found a match."
400 - Gone - "The URL you requested used to exist. It doesn't anymore. You don't need to ask again."
404 - Not Found - "The URL you requested is not available. I don't know if it ever existed, but I can't find anything similar to what you are looking for. Keep checking though, it might be created/come back."

Now, only two of these error codes are supposed to be considered permanent, 301 and 400. These are used when files are deleted or moved. All of the other codes are intended to indicate temporary situations. Well, 404 is considered both - the server is not sure if the condition is temporary or permanent. If it knew, it would respond with one of the other mentioned codes.

So what does this mean in practice? Lets say you have a page, contact.html. Now lets say someone requests contact.php. If you have the right software, your server should respond with a 302 message ("You requested contact.php. That is invalid. I FOUND contact.html. Go there.") or it could respond with a 300 message ("You requested contact.php. That is invalid. I found contact.html and contact.asp. CHOOSE one.")

In both situations, the server is guessing what the user is looking for.

So, for the way search engines handle 300, I am not suprised that they index the page. There is no reason not to, since 1) the page doesn't exist, 2) it has links to pages that do exist, 3) it is subject to change, 4) in this case, it requires user action - the user has to decide which page they actually want.

Now, with all that said, in general use, 301, 302 and 404 are used much differently, and 300 and 400 are rarely used at all. The search engines have to cater to both the power users who actually know how this stuff works, and the casual webmaster who just has the basics down and just wants things to work. For 300 pages even, even browsers don't supprt it, and there are no standards for implementation. Ideally, upon seeing the 300 response, a supporting browser would just show the user the list of options and then the user would pick and continue browsing. I believe that was the original intent. However, the code is so rarely used it isn't supported at all.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #85 (permalink)  
Old 03-05-2009, 06:18 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by wige View Post
I know, I am going out of order. Some people type too fast...

My first question (unless you wrote the mod, this is mostly rhetorical) would be, why does the mod_speling.c utility use a 301 redirect? Syntactically, that is not the proper way to handle it - it should be a 302 redirect if only a single match is found.
1. There are cases where you can find 3 matches. Go for example here: http://www.seoworkers.com/sitemap.php

2. Can you pass PR with 302 redirects?

Quote:
Originally Posted by wige View Post
Now, only two of these error codes are supposed to be considered permanent, 301 and 400. These are used when files are deleted or moved. All of the other codes are intended to indicate temporary situations. Well, 404 is considered both - the server is not sure if the condition is temporary or permanent. If it knew, it would respond with one of the other mentioned codes.

So what does this mean in practice? Lets say you have a page, contact.html. Now lets say someone requests contact.php. If you have the right software, your server should respond with a 302 message ("You requested contact.php. That is invalid. I FOUND contact.html. Go there.") or it could respond with a 300 message ("You requested contact.php. That is invalid. I found contact.html and contact.asp. CHOOSE one.")

In both situations, the server is guessing what the user is looking for.
Isn't that what we do already? Check my link above.

Quote:
Originally Posted by wige View Post
So, for the way search engines handle 300, I am not suprised that they index the page. There is no reason not to, since 1) the page doesn't exist, 2) it has links to pages that do exist, 3) it is subject to change, 4) in this case, it requires user action - the user has to decide which page they actually want.

Now, with all that said, in general use, 301, 302 and 404 are used much differently, and 300 and 400 are rarely used at all. The search engines have to cater to both the power users who actually know how this stuff works, and the casual webmaster who just has the basics down and just wants things to work. For 300 pages even, even browsers don't supprt it, and there are no standards for implementation. Ideally, upon seeing the 300 response, a supporting browser would just show the user the list of options and then the user would pick and continue browsing. I believe that was the original intent. However, the code is so rarely used it isn't supported at all.
Can you tell me which browser cannot deal with our Multiple Choices custom page?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #86 (permalink)  
Old 03-05-2009, 06:23 PM
crankydave's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Aug 2004
Location: Playing with fire!
Posts: 4,328
crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9
Default Re: New Canonical Tag from the big 3

I'd read Andy's thoughts on it John but thank you for reminding me.

Here's part of the problem as I see it...

Once you start "removing" PR from the system, you start deviating from the sum of all PR=1... PR preservation. A deviation that would increase with each successive iteration.

Any PR that is removed, the remaining PR becomes less and less and less with each successive iteration because of the dampening factor.

While it's certainly possible that's how it works, the amount of PR that would be transferred by a dangling link gets "lost", it would really surprise me if it did.

Additionally, we already know that "nofollow" links do not cause the amount of PR they would ordinarily pass to be lost. It doesn't make sense (to me) to treat a dangling link any differently.

Dave

Last edited by crankydave; 03-05-2009 at 06:27 PM.
Reply With Quote
  #87 (permalink)  
Old 03-05-2009, 06:27 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by crankydave View Post
Here's an exerpt from the original paper...

Quote:
"Dangling links are simply links that point to any page with no outgoing links. They affect the model because it is not clear where their weight should be distributed, and there are a large number of them. Often these dangling links are simply pages that we have not downloaded yet……….Because dangling links do not affect the ranking of any other page directly, we simply remove them from the system until all the PageRanks are calculated. After all the PageRanks are calculated they can be added back in without affecting things significantly." - extract from the original PageRank paper by Google’s founders, Sergey Brin and Lawrence Page.
PR preservation would seem to indicate that only the link would be removed and not the available PR that they would pass ie. links are removed, PR divvied up among the remaining links and calculated. We already know that "nofollow" works in this manner. It wouldn't make sense to have dangling links to work otherwise.

Dave
A few things about this.

First, no I am not rereading the patent, its too late in the day, so I am hoping I am fairly accurate in this. I can reread it tommorow and fact check this post then.

The first thing is that the quote in question relates to the recursion aspect of applying pagerank. Basically, pagerank constantly changes because after each crawl cycle, the system reapplies the pagerank to take into account the pages discovered in that crawl. During the recursive process, dead end pages represent an anomaly so they are removed, the recursion is processed, and then they are reinserted. But this is something that happens later, after Google decides how much of the pages rank gets shared

However, Google no longer does recursive processing when calculating pagerank. They now use a system called everflux. As a result, I think the entire passage is outdated.

Also, I believe it was stated that robots.txt excluded pages were one of very few special case that were not addressed in the original paper, and that they were one of a few sources of "pr leak". Not sure how I would find that document at this point though.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #88 (permalink)  
Old 03-05-2009, 06:35 PM
crankydave's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Aug 2004
Location: Playing with fire!
Posts: 4,328
crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9
Default Re: New Canonical Tag from the big 3

One point wige... I read it as the links are removed and not the pages.

If Google did indeed remove the PR from the calculations for every "dead link" they had in their system their PR calculation wouldn't even be close.

Dave
Reply With Quote
  #89 (permalink)  
Old 03-05-2009, 06:43 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
1. There are cases where you can find 3 matches. Go for example here: http://www.seoworkers.com/sitemap.php

2. Can you pass PR with 302 redirects?
No. But I am looking more at the technical end than the SEO end. Using a 301 redirect, you are telling clients that all requests for that URL should always be considered requests for the new URL. Worse, it is an automated process that is guessing and doing this. What happens down the road when you see in your logs a lot of requests for some URL, so you create a page there? Because you have been giving 301s, most bots including search engines will take longer to actually check the URL again.


Quote:
Originally Posted by Webnauts View Post
Isn't that what we do already? Check my link above.
Mostly, yes. But you said above that the search engines shouldn't index that page. However, there are a few reasons why it should - the matches discovered could change as pages are added/removed/altered, your server can't figure out which is the best match so the page rank might as well get divided up anyway, and if the indexing of the page means that for some search term that page ends up showing up in the SERPs, why not let users have that option of selecting which page is the best match - at that point, neither your server nor Google could decide.

Quote:
Originally Posted by Webnauts View Post
Can you tell me which browser cannot deal with our Multiple Choices custom page?
Name one that can handle 300 as gracefully as a 301 or 302.

Deal with, all of them. However, "supporting" the code would mean simply presenting the user with a list of pages to choose from - no displaying or rendering of a web page, no requesting linked files, etc. I don't think "defaulting to pretending the 300 was a 200" is the same thing as actually fully supporting the code.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #90 (permalink)  
Old 03-05-2009, 06:49 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by crankydave View Post
One point wige... I read it as the links are removed and not the pages.

If Google did indeed remove the PR from the calculations for every "dead link" they had in their system their PR calculation wouldn't even be close.

Dave
Don't suppose there's any chance you have a link to the original paper do you?

I skimmed the patent, and couldn't find it. I seem to remember this as pertaining to another part of the calculation, I thought it was the recursion, it might have been the normalization. I don't think it was the raw distribution, it had to do with the averaging and recalculating of the pagerank values, something that happened later than the initial calculation I was trying to describe in my post.

Edit: Never mind, I found it. This passage has to do with the random surfer idea. If a page has no links, it becomes a "sink". This is resolved by taking the pagerank that has been received by that page, and dividing it among every page in the index - because each page has an equal chance of being visited by the random surfer. However, this represents an anomoly when recalculating pagerank. At that time, the dead ends are removed, the index is normalized, and the pages are replaced. Bah, I'll read it closer tommorrow.
__________________
The best way to learn anything, is to question everything.

Last edited by wige; 03-05-2009 at 06:58 PM.
Reply With Quote
  #91 (permalink)  
Old 03-05-2009, 06:57 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Eric Enge interview with Matt Cutts:
Quote:
Matt Cutts: … So, with robots.txt for good reasons we've shown the reference even if we can't crawl it, whereas if we crawl a page and find a Meta tag that says NoIndex, we won't even return that page. For better or for worse that's the decision that we've made. I believe Yahoo and Microsoft might handle NoIndex slightly differently which is little unfortunate, but everybody gets to choose how they want to handle different tags.
Look at the HTTP Headers of this page: http://www.seoworkers.com/mailing-li...3&bots=nocrawl
Code:
GET /mailing-list/?p=unsubscribe&id=3&bots=nocrawl HTTP/1.1
Host: www.seoworkers.com
User-Agent: Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-us,en;q=0.5
Accept-Encoding: gzip,deflate
Accept-Charset: UTF-8,*
Keep-Alive: 300
Connection: keep-alive
Referer: http://www.seoworkers.com/
Cookie: __utma=215523349.1795271455.1236145108.1236285263.1236290809.17; __utmz=215523349.1236191584.8.6.utmccn=(referral)|utmcsr=seowatchblog.com|utmcct=/|utmcmd=referral; _csuid=49ae09512794c5b5; __utmc=215523349; PHPSESSID=e95s1j11u2ktchf65pcai8t075; __utmb=215523349
X-lori-time-1: 1236292418383

HTTP/1.x 200 OK
Date: Thu, 05 Mar 2009 22:33:47 GMT
Server: Apache
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Content-Encoding: gzip
Vary: Accept-Encoding
P3P: policyref="/w3c/p3p.xml"
Pics-label: (pics-1.1 "http://www.icra.org/pics/vocabularyv03/" l gen true for "http://seoworkers.com" r (n 0 s 0 v 0 l 0 oa 0 ob 0 oc 0 od 0 oe 0 of 0 og 0 oh 0 c 3)  gen true for "http://www.seoworkers.com" r (n 0 s 0 v 0 l 0 oa 0 ob 0 oc 0 od 0 oe 0 of 0 og 0 oh 0 c 3))
Imagetoolbar: no
X-Robots-Tag: noindex,nofollow,noarchive,nosnippet
Content-Length: 1372
Connection: close
Content-Type: text/html
Content-Language: en-us
As you see the page shall not be indexed since we implemented for that page the X-Robots directives "noindex,nofollow,noarchive,nosnippet". In other words we have the same results as we would have if we have implemented the robots meta tags directives "noindex,follow,noarchive,nosnippet".

So when Google crawls that page and finds the X-Robots directive "NoIndex", so they won't even return that page.

In addition, the link to that page found on all our pages have that "bots=nocrawl" in the url, which in the robots.txt we have:

User-agent: Googlebot
Disallow: *bots=nocrawl$
Noindex: *bots=nocrawl$

So what is happening in this case. Is that link still accumulating PR?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by rah; 03-06-2009 at 12:43 PM. Reason: Requested by webnauts
Reply With Quote
  #92 (permalink)  
Old 03-05-2009, 06:59 PM
crankydave's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Aug 2004
Location: Playing with fire!
Posts: 4,328
crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9crankydave RepRank 9
Default Re: New Canonical Tag from the big 3

Here's the PDF wige...

http://ilpubs.stanford.edu:8090/422/1/1999-66.pdf

Dave
Reply With Quote
  #93 (permalink)  
Old 03-05-2009, 07:02 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
Is that link still accumulating PR?
Does it matter? My thinking is that whether or not the PR gets credited to the blocked page, that page's portion of the linking page's PR is not being distributed to the other pages linked to.

I reread that, and I know its time for me to go home. See you all tommorrow.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #94 (permalink)  
Old 03-05-2009, 07:02 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

POST CONTINUED...

Quote:
Eric Enge: Can a NoIndex page accumulate PageRank?

Matt Cutts: A NoIndex page can accumulate PageRank, because the links are still followed outwards from a NoIndex page.

Eric Enge: So, it can accumulate and pass PageRank.

Matt Cutts: Right, and it will still accumulate PageRank, but it won't be showing in our Index. So, I wouldn't make a NoIndex page that itself is a dead end. You can make a NoIndex page that has links to lots of other pages. For example you might want to have a master Sitemap page and for whatever reason NoIndex that, but then have links to all your sub Sitemaps.
That is not the case because we have set for that the X-Robots "Nofollow" directive too, as I mentioned above.

Whats next?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #95 (permalink)  
Old 03-05-2009, 10:17 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
As you see the page shall not be indexed since we implemented for that page the X-Robots directives "noindex,follow,noarchive,nosnippet". In other words we have the same results as we would have if we have implemented the robots meta tags directives "noindex,follow,noarchive,nosnippet".

So when Google crawls that page and finds the X-Robots directive "NoIndex", so they won't even return that page.

In addition, the link to that page found on all our pages have that "bots=nocrawl" in the url, which in the robots.txt we have:

User-agent: Googlebot
Disallow: *bots=nocrawl$
Noindex: *bots=nocrawl$

So what is happening in this case. Is that link still accumulating PR?
I have mistaken:

As you see the page shall not be indexed since we implemented for that page the X-Robots directives "noindex,nofollow,noarchive,nosnippet".
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #96 (permalink)  
Old 03-05-2009, 10:53 PM
Narasinha's Avatar
WebProWorld Pro
 
Join Date: Aug 2003
Location: Urbana, Illinois, US
Posts: 232
Narasinha RepRank 1
Default Re: New Canonical Tag from the big 3

Hi wige, long time no see.

Quote:
Originally Posted by wige View Post
My first question (unless you wrote the mod, this is mostly rhetorical) would be, why does the mod_speling.c utility use a 301 redirect? Syntactically, that is not the proper way to handle it - it should be a 302 redirect if only a single match is found.
mod_speling, from what I can tell, only has one option other than being turned on or off, and that is whether or not to check capitalization only. This is controlled through .htaccess when the mod is available. From what I can tell, the module made its appearance in June of 1996.

The conditions that mod_speling finds correspond to one of the following: "identical", "miscapitalized", "transposed characters", "character missing", "extra character", "mistyped character", "common basename". When redirection is called for, HTTP_MOVED_PERMANENTLY is the only redirection used, and it is hard-coded in mod_speling.c. From the source code: "Conditions for immediate redirection: a) the first candidate was not found by stripping the suffix AND b) there exists only one candidate OR the best match is not ambiguous then return a redirection right away."

Note that the "identical" case does not redirect to the found URL. Why not? This is what it says in comments in the source: "If we end up with a "fixed" URL which is identical to the requested one, we must have found a broken symlink or some such. Do _not_ try to redirect this, it causes a loop!"

I agree that a 302 redirect might be more appropriate than 301 for some results. However, the 300 response is just as temporary as a 404: maybe not at all. If someone were to compile their Apache server from the source code, they could change the response to 302.

Quote:
Originally Posted by wige View Post
So, for the way search engines handle 300, I am not suprised that they index the page. There is no reason not to, since 1) the page doesn't exist, 2) it has links to pages that do exist, 3) it is subject to change, 4) in this case, it requires user action - the user has to decide which page they actually want.
I'm perplexed. Why index a page that does not exist, particularly when it requires decision-making on the part of the user? I see the 300 response as a "somewhat kinder" 404 response.

(Hey, I just thought of a tool that I want! Maybe someone already makes it. For a given web page, display a list of the response codes given by each link on said page.)

Quote:
Originally Posted by wige View Post
For 300 pages even, even browsers don't support it, and there are no standards for implementation. Ideally, upon seeing the 300 response, a supporting browser would just show the user the list of options and then the user would pick and continue browsing. I believe that was the original intent. However, the code is so rarely used it isn't supported at all.
It's done pretty much the way a 404 response is, if the conditions for immediate redirect are not met. You get a simple HTML 2.0 page telling you that the page wasn't found, matches were found, here's a list of the matches. Browsers, of course, can handle the 300 response in any way they're designed to, but to my knowledge, none have ever done anything besides simply display the HTML result from the server. I guess you could say that it's as well-supported by the browsers as a 404 response is.

Last edited by Narasinha; 03-05-2009 at 10:59 PM. Reason: Clarification on 302
Reply With Quote
  #97 (permalink)  
Old 03-06-2009, 11:41 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
POST CONTINUED...



That is not the case because we have set for that the X-Robots "Nofollow" directive too, as I mentioned above.

Whats next?
Well, that quote exactly supports my point. Even if a page is not indexed, it still collects pagerank. Even if it is a dead end (no outgoing links, or blocked by robots.txt) that page still gets pagerank. This is spelled out in the original Stanford paper (section 2.5 and 2.6) where the pagerank of a dead end gets distributed to every page in the entire index. (Random Surfer model)

Some dead end pages have no benefit - 302 redirectors, robots.txt blocked pages, etc. - to the webmaster. In these cases, a nofollow on the linking page is the only way you can use that pagerank in a way that benefits your site.

I think the misunderstanding is on this point: if a page accumulates pagerank but has a nofollow tag, thus becoming a dead end, the pagerank does not go back to the pages that linked to it. It goes forward and is divided among every page in the index.

I think this is where the quote crankydave posted comes into play. Lets assume Google's index is composed of three pages, A B and C. A contains no links, but B and C both link to A. A is thus a dead end (same as if it were not indexable because of robots.txt). When Google calculates PR, at the first iteration, all three pages have a pagerank of .3333 since they are each 1/3 of the known web. At the next iteration, Google takes into account the links. The .333 that B and C have all gets sent to A, since both pages only link there. But how does A's outgoing PR get calculated? A's rank of .333 needs to be given to A, B and C - every page in the index, including itself. But if it gives itself pagerank, it has more pagerank to give, so it should be giving more pagerank... This becomes circular logic, and would make for an endless loop. The solution is that the dead end is removed from the equation as the pagerank is calculated (and page A gets a rank of ~ .900 after dampening and other factors). Now, the pagerank of A can be safely distributed to all of the other pages in the index.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #98 (permalink)  
Old 03-06-2009, 12:03 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Narasinha View Post
Hi wige, long time no see.



mod_speling, from what I can tell, only has one option other than being turned on or off, and that is whether or not to check capitalization only. This is controlled through .htaccess when the mod is available. From what I can tell, the module made its appearance in June of 1996.
Likewise. I tend to hide from these discussions. And I really tried this time

I realize mod_speling does not have the option to change the way it redirects, my question was more theoretical. Clients are supposed to remember 301 redirects. If they are told to go to a URL they no is permanently redirected somewhere else, they should not even bother requesting the original URL. This is something that search engines strongly adhere to. Is it really appropriate that an internal search function is telling clients never to request that URL? What if you at some point decide to create a landing page to capitalize on the misspelling? It will take a long time to convince the search engines to return and index that new landing page.

Personally, I think an automated search function should only be giving temporary responses (300, 302 and 404 are not supposed to be cached. 301 and 400 are).

Quote:
Originally Posted by Narasinha View Post
I'm perplexed. Why index a page that does not exist, particularly when it requires decision-making on the part of the user? I see the 300 response as a "somewhat kinder" 404 response.
What would you prefer? Someone does a search, and Google can't decide what the best result is. Would you prefer Google guess and possibly give the user an inappropriate link, or worse, a competitor's link instead, or that Google presents the user with your multiple choice selection page? Personally, I prefer Google index as much of my stuff as possible that it might display instead of my competitor's stuff.

Quote:
Originally Posted by Narasinha View Post
(Hey, I just thought of a tool that I want! Maybe someone already makes it. For a given web page, display a list of the response codes given by each link on said page.)
I believe there are several Firefox plugins that do this. I use a tool that shows what redirections I pass through when I follow links, but never bothered with one that checks the links on pages. It requires extra bandwidth because most don't only retreive the page headers.

Quote:
Originally Posted by Narasinha View Post
It's done pretty much the way a 404 response is, if the conditions for immediate redirect are not met. You get a simple HTML 2.0 page telling you that the page wasn't found, matches were found, here's a list of the matches. Browsers, of course, can handle the 300 response in any way they're designed to, but to my knowledge, none have ever done anything besides simply display the HTML result from the server. I guess you could say that it's as well-supported by the browsers as a 404 response is.
This is something I would like to see, proper handling of response codes by browsers. I know it will never happen, but oh well. I believe the proper handling would be as follows:

300: Browser only processes the headers. No body content is parsed or displayed, and is only served by the server if the browser doesn't support 300s. The user just sees a dialog box prompting to select the appropriate page. This allows the server to cut down on the overhead of dynamically generating the disambiguation page.
301: The browser should actually cache the redirection (for how long, who knows) the way search engines already do. Any time a link to a known 301 is followed, or the URL of a 301 page is manually entered, the browser should automatically request the destination page. This would cut down the bandwidth of repeat requests for moved documents. Only if the destination gets a 300, 301, 302, 400 or 404 should the original URL be rechecked.
302: Browsers and search engines handle this fine.
400: Again, this should be cached. Future requests for a URL that is GONE should display either a browser error message, or the cached error page. This would again cut down on repeat requests for removed content.
404: This is handled correctly.

It's nice to dream, you know?
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #99 (permalink)  
Old 03-06-2009, 12:46 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,463
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: New Canonical Tag from the big 3

Wige lets take this scenario:

I block the bots to crawl a link with the above mentioned method i.e bots=nocrawl.

Then I add in the targeted page the canonical element.
For example I have this link:
Code:
http://www.seoworkers.com/mailing-list/?p=unsubscribe&id=3&bots=nocrawl
On the destination page I add this:
Code:
<link rel="Canonical" href="http://www.seoworkers.com" />
What happens in that case?

Or if I send the bots to another page than I send the users, which page has robots meta tag directives "noindex,follow,noarchive,nosnippet" and on that page I have the links of my sitemap.

Would that be an alternative?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #100 (permalink)  
Old 03-06-2009, 05:56 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,822
wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10wige RepRank 10
Default Re: New Canonical Tag from the big 3

Quote:
Originally Posted by Webnauts View Post
Wige lets take this scenario:

I block the bots to crawl a link with the above mentioned method i.e bots=nocrawl.

Then I add in the targeted page the canonical element.
For example I have this link:
Code:
http://www.seoworkers.com/mailing-list/?p=unsubscribe&id=3&bots=nocrawl
On the destination page I add this:
Code:
<link rel="Canonical" href="http://www.seoworkers.com" />
What happens in that case?

Or if I send the bots to another page than I send the users, which page has robots meta tag directives "noindex,follow,noarchive,nosnippet" and on that page I have the links of my sitemap.

Would that be an alternative?
I am not sure exactly what it would be an alternative to. All the canonical element does is tell Google "If this page has the same content as <url>, consider both copies to be the same page". Google already tries to merge duplicated pages in it's index, to combine the pagerank of these pages and have a single page to display in the results. This tag is simply geared to give the webmaster an additional tool to explicitly indicate what pages might be a variation of an original. Nothing more.

If a page is marked as nofollow, or noindex, I don't see any reason why Google would care if it has the canonical tag or not - it doesn't pass pagerank or get displayed in the regular index, so why would Google merge it with an existing page that doesn't have these limitations?
__________________
The best way to learn anything, is to question everything.
Reply With Quote
Reply

  WebProWorld > Search Engines > Search Engine Optimization Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Canonical issue in MSN greg artim MSN Search Discussion Forum 1 04-06-2008 10:06 AM
Canonical Issue Techyolk Google Discussion Forum 1 09-13-2007 12:46 PM
More Canonical Stuff Psychobel Google Discussion Forum 1 05-17-2006 10:56 AM
Canonical issues...again crankydave Google Discussion Forum 1 02-03-2006 06:23 PM
Canonical Hostnames Mod Rewrite bjbtexas Web Programming Discussion Forum 2 04-22-2005 05:55 PM


All times are GMT -4. The time now is 03:52 PM.



Search Engine Optimization by vBSEO 3.3.0