iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 10-12-2004, 05:55 AM
mantawebsolutions's Avatar
WebProWorld Veteran
 
Join Date: Jun 2004
Location: Pretoria, South Africa
Posts: 307
mantawebsolutions RepRank 0
Default robots file for spiders

I came accross this URL (http://www.searchengineworld.com/cgi-bin/sim_spider.cgi) to test what spiders see when accessing a site, so i tested my URL

The first part of the results came back fine but further down i saw something that worries me.

It picked up the URL's fine but for most it showed two variants
example:
http://www.divesouthafrica.co.za/aboutsouthafrica.html
and
http://aboutsouthafrica.html

why does it pick up the url without the domain ???
If this is a problem in my robots.txt file and, if so, how do i correct it ?
Reply With Quote
  #2 (permalink)  
Old 10-12-2004, 06:31 AM
WebProWorld 1,000+ Club
 
Join Date: Sep 2003
Location: Texas
Posts: 1,156
flood6 RepRank 0
Default Spider Flaw

I'd say it's a design flaw in the spider; it doesn't seem to like relative links. I tried one of my sites with a "base href" tag and got the same problems with relative links.

I imagine you could search webmasterworld (searchengine world's sister-site) and maybe find some discussion about it.

I don't think you have anything to worry about.
Reply With Quote
  #3 (permalink)  
Old 10-12-2004, 06:55 AM
mantawebsolutions's Avatar
WebProWorld Veteran
 
Join Date: Jun 2004
Location: Pretoria, South Africa
Posts: 307
mantawebsolutions RepRank 0
Default

thanks flood, much appreciated
Reply With Quote
  #4 (permalink)  
Old 10-19-2004, 10:13 PM
WebProWorld New Member
 
Join Date: Oct 2004
Location: Moorpark, California
Posts: 15
ebonage RepRank 0
Default Re: Spider Flaw

Quote:
Originally Posted by flood6
I'd say it's a design flaw in the spider; it doesn't seem to like relative links. I tried one of my sites with a "base href" tag and got the same problems with relative links.

I imagine you could search webmasterworld (searchengine world's sister-site) and maybe find some discussion about it.

I don't think you have anything to worry about.
Cool! Thanks for the tip! Here is another site that covers robots.txt configurating robots.txt that site also references searchengineworld.com.

Regards,
ebonage
Reply With Quote
Reply

  WebProWorld > Search Engines > Search Engine Optimization Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -4. The time now is 10:28 PM.



Search Engine Optimization by vBSEO 3.3.0