WebProWorld Part of WebProNews.com
Page One Link To Us Edit Profile Private Messages Archives FAQ RSS Feeds  
 

Go Back   WebProWorld > Search Engines > Yahoo! Discussion Forum
Subscribe to the Newsletter FREE!


Register FAQ Members List Calendar Arcade Chatbox Mark Forums Read

Yahoo! Discussion Forum Yahoo Search discussion. Any topic or subject specific to Yahoo should go here. You will also find a subforum dedicated to YPN & Panama.

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 08-17-2007, 02:28 PM
wige's Avatar
Moderator
WebProWorld Moderator
 

Join Date: Jun 2006
Location: United States
Posts: 1,764
wige RepRank 4wige RepRank 4wige RepRank 4wige RepRank 4
Default Yahoo Crawl Errors

I'll admit, I don't go through my raw web site error logs as often as I should. I have the server set up to only show me errors that are caused by users, because spambots and hackersafe generate a lot of useless errors that don't affect the site. This condensed report is what I monitor.

However, I was doing an experiment in another thread to see if Google would try to crawl a link, and as I was reading through the raw data, I realized that there were hundreds of errors caused by Yahoo looking for nonexistant files.

The problem I am seeing is as follows:
Suppose I have a page at http://www.mysite.com/section/subsection/page.php. Most search engines will also check http://www.mysite.com/section/ and http://www.mysite.com/section/subsection/ to look for index files that may have been missed. Yahoo, however, is looking for http://www.mysite.com/section and http://www.mysite.com/section/subsection (without the trailing slash). This generates a huge number of errors, and I want to find out if there is a way to stop Yahoo from doing this. I can't even see a reason why the spider would be looking for files structured like this, because they are unlikely to exist.

Has anyone else encountered this issue? More important, does anyone have any suggested fixes?

I have considered that because my site is dynamic there might be bad links somewhere, but this seems to happen in every single directory and subdirectory on the site, and the only search engine bot acting this way is Yahoo, and they seem to have a lower crawl rate than any of the other engines, so I would expect Google or MSN to have picked up a bad link first.
__________________
The best way to learn anything, is to question everything.
Interestingly Average Security Blog
Reply With Quote
  #2 (permalink)  
Old 08-17-2007, 05:26 PM
WebProWorld Member
 

Join Date: Jan 2007
Posts: 30
cyberkid RepRank 0
Default Re: Yahoo Crawl Errors

I would say the same or similar experince in that I use the Yahoo site map explorer that shows the pages it has crawled. That's all well and good only since I now submit a site map whereas before I didn't, Yahoo continues to index an HTML site that no longer exist. You might want to try an address your question at Yahoo Site Explorer: Yahoo! Site Explorer Suggestion Board
Reply With Quote
Reply

  WebProWorld > Search Engines > Yahoo! Discussion Forum
Tags: , ,



Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
Google Sitemap Web Crawl Errors kc412o0yz Google Discussion Forum 12 02-14-2007 03:29 AM
Yahoo crawl Christine R Yahoo! Discussion Forum 1 02-11-2007 09:12 PM
Google Sitemap Web Crawl Errors kc412o0yz Google Discussion Forum 0 01-28-2007 10:43 PM
Google Sitemaps.....crawl errors SES Trims Google Discussion Forum 1 08-08-2006 01:36 AM
Yahoo has just done a mega-crawl gilkesy Yahoo! Discussion Forum 12 09-16-2004 09:17 PM


Search Engine Optimization by vBSEO 3.2.0