Quote:
|
Originally Posted by emptymirror
I just discovered something odd (at least I think it's odd).
My robots.txt file disallows a page, bio.html
(Actually, most of the pages on my site are disallowed - it's mostly a portfolio site. But that's beside the point.)
A Google site: search finds 5 pages from my website indexed, but not bio.html
Yet, bio.html has a PR2
Bio.html has been off-limits to the search engines for a long time now. Why would it have a PR? Does that mean that Google is still crawling the page?
If anyone can enlighten me, I'd be most grateful!
Thanks!
Denise
|
PR is stored in a different database than the page information. If there any links pointing to the bio.html that Google is aware of, either external or from the pages of your site that are indexed,
PR will be assigned to the target URL regardless of whether or not the page information gets crawled.
Dave