iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 09-24-2008, 04:11 PM
cz's Avatar
cz cz is offline
WebProWorld Veteran
 
Join Date: Mar 2004
Posts: 443
cz RepRank 3cz RepRank 3cz RepRank 3
Default Page Can't Be Spidered??

Hi,

I have been checking GG Analytics and used a couple of utilities and they are saying that this page can't be accessed due to "Robot Text". I never put any of that in the page and the rest of the items within the same section are fine.

Can someone take a look at this page and clue me in about how to find this robot text that is blocking the page from GG and maybe other engines?

CN, CS, Tear Gas, Information, Effects, Reports this is the page that Analytics is flagging as being inaccessible.

Thanks!
Reply With Quote
  #2 (permalink)  
Old 09-24-2008, 06:06 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,661
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: Page Can't Be Spidered??

In robots.txt you have the following directive:
Disallow: /cs
The page URL is:
/csteargasfaq.html

You also have a few issues in the code structure that may cause issues with some browsers and spiders (javascript between <html> and <head>, for example) that you may want to check and correct.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #3 (permalink)  
Old 09-24-2008, 06:20 PM
cz's Avatar
cz cz is offline
WebProWorld Veteran
 
Join Date: Mar 2004
Posts: 443
cz RepRank 3cz RepRank 3cz RepRank 3
Default Re: Page Can't Be Spidered??

Thanks wige,

I never even knew there was robots.txt in my site? The site is a Yahoo! store as you probably noticed. They insert code in the craziest places trying to implement features and there's nothing that I can access to correct it. No Yahoo! Store that I'm aware of unless created and uploaded to the hosting side of their platform, will even come anywhere close to validating.

I've spent hours trying to correct errors found by utilities - only to discover that I have no access to change 98% of the errors - that are built in to their product.

I'll try to find that but if I did for instance, I'd be afraid of deleting it or my site might go "haywire"!

Thanks again!
Reply With Quote
  #4 (permalink)  
Old 09-25-2008, 06:51 PM
SemAdvance's Avatar
WebProWorld Veteran
WebProWorld MVP
 
Join Date: Dec 2005
Location: In Your Mind
Posts: 792
SemAdvance RepRank 4SemAdvance RepRank 4SemAdvance RepRank 4
Default Re: Page Can't Be Spidered??

I would not remove anything from your robots.txt as it is used to operate your site properly.

If you look at the file there are a few other directories as well which are blocked.

You might want to prepare to migrate to a new host and cart in the event Yahoo goes bye bye and you are forced to move out.

Peace!
Reply With Quote
  #5 (permalink)  
Old 09-25-2008, 08:16 PM
cz's Avatar
cz cz is offline
WebProWorld Veteran
 
Join Date: Mar 2004
Posts: 443
cz RepRank 3cz RepRank 3cz RepRank 3
Default Re: Page Can't Be Spidered??

Since that page can't be spidered, I could copy the material to a page that has no "domain/cs" and that would open it up again? Somehow it has managed to have PR1.

SemAdavance, is there something I should know about?

There are solutions like storehost that will transfer a Y! store over in one day.

Thanks!
Reply With Quote
  #6 (permalink)  
Old 09-26-2008, 11:51 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,661
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: Page Can't Be Spidered??

If you have control over your robots.txt file, you might want to change the directories from /cz to /cz/, for example. Otherwise, if you are able to do redirects you could do a 301 redirect to a new URL. The only problem is that you will lose any benefit of links to the current page, since the search engines can't crawl the redirecting page.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #7 (permalink)  
Old 09-26-2008, 01:52 PM
dharrison's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Essex, UK
Posts: 1,289
dharrison RepRank 4dharrison RepRank 4dharrison RepRank 4
Default Re: Page Can't Be Spidered??

Hi cz

Make a backup of the original robots.txt, then if it does go haywire then you can easily revert back to the original.

I don't reckon it will go haywire though. I find it hard to believe that even Yahoo are that thick that they create an unspiderable page (?)
__________________
Deb Harrison
DVH Design
Essex Web Design
Reply With Quote
  #8 (permalink)  
Old 10-05-2008, 11:33 AM
WebProWorld New Member
 
Join Date: Mar 2006
Posts: 17
tnt7 RepRank 0
Default Re: Page Can't Be Spidered??

I Hate to be the bearer of bad news, but Yahoo! Store owners have no control over the robot.txt file.
Reply With Quote
  #9 (permalink)  
Old 10-05-2008, 05:58 PM
cz's Avatar
cz cz is offline
WebProWorld Veteran
 
Join Date: Mar 2004
Posts: 443
cz RepRank 3cz RepRank 3cz RepRank 3
Default Re: Page Can't Be Spidered??

tnt7

what are your recommendations? the page, domain.com/cs... has PR (only PR1) do you think forget about it or change the page url and dump the old one? It just keeps sitting there in GG analytics like I should do somehting about it. Funny thing is it never showed up that way in all of the link checkers or in analytics until a few weeks ago?? Now it shows up in different site shceking tools as robot txt disallowed??

Thanks
Reply With Quote
  #10 (permalink)  
Old 10-09-2008, 09:36 PM
WebProWorld New Member
 
Join Date: Mar 2006
Posts: 17
tnt7 RepRank 0
Default Re: Page Can't Be Spidered??

Quote:
Originally Posted by cz View Post
tnt7

what are your recommendations? the page, domain.com/cs... has PR (only PR1) do you think forget about it or change the page url and dump the old one? It just keeps sitting there in GG analytics like I should do somehting about it. Funny thing is it never showed up that way in all of the link checkers or in analytics until a few weeks ago?? Now it shows up in different site shceking tools as robot txt disallowed??

Thanks
I personally would do one of two things:
1. Call Yahoo! and complain and see if they fix it - Wait.
2. Change Url - You should get pr 1 back with your internal linking, and still call Yahoo! and complain.

They probably changed the global htacess with one of their recent updates, and that is why it has just become an issue.

They really have no excuse for this issue. I Wonder how many pages in the thousands of stores that are dissallowed??

Good Luck
Reply With Quote
  #11 (permalink)  
Old 10-09-2008, 10:38 PM
cz's Avatar
cz cz is offline
WebProWorld Veteran
 
Join Date: Mar 2004
Posts: 443
cz RepRank 3cz RepRank 3cz RepRank 3
Default Re: Page Can't Be Spidered??

Quote:
I Wonder how many pages in the thousands of stores that are dissallowed??
Thanks for the advice, I think I'll do just that. That's a very interesting thought you mentioned as well!

Thanks
Reply With Quote
Reply

  WebProWorld > Search Engines > Search Engine Optimization Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Need code to NOT pass page rank or let a link get spidered. MeanSEO Search Engine Optimization Forum 16 07-26-2006 08:44 PM
2 domains - one spidered, second - only 1 page; sandis.viksna Yahoo! Discussion Forum 0 12-26-2005 08:12 PM
Why I am not being spidered??? pcm535 Search Engine Optimization Forum 4 11-29-2005 04:52 AM
Amount of words spidered per page? ergobob Search Engine Optimization Forum 2 08-13-2004 08:37 PM
only first page is spidered ireneherz Google Discussion Forum 3 04-09-2004 04:17 PM


All times are GMT -4. The time now is 02:52 AM.



Search Engine Optimization by vBSEO 3.3.0