WebProWorld Part of WebProNews.com
Page One Link To Us Edit Profile Private Messages Archives FAQ RSS Feeds  
 

Go Back   WebProWorld > Search Engines > MSN Search Discussion Forum
Subscribe to the Newsletter FREE!


Register FAQ Members List Calendar Arcade Chatbox Mark Forums Read

MSN Search Discussion Forum Topics and discussions specific to MSN and/or Live Search. There is also a subforum for the discussion of Microsoft AdCenter.

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 12-06-2007, 09:04 PM
WebProWorld Member
 

Join Date: Apr 2004
Location: N.E.
Posts: 39
memaggiem RepRank 0
Default LIVE bot looking for my pages in the images folder(?)

Just noticed that an IP that resolves to a MSN Live bot is going through my site and looking for pages in the /images/ folder! Anyone know the whats or whys of this? Looking through the day's log, I see this has been going on all day!

Thanks!
Reply With Quote
  #2 (permalink)  
Old 12-07-2007, 01:50 PM
incrediblehelp's Avatar
Moderator
WebProWorld Moderator
 

Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,657
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: LIVE bot looking for my pages in the images folder(?)

Why wouldnt MSN look for images?
Reply With Quote
  #3 (permalink)  
Old 12-07-2007, 01:56 PM
WebProWorld Member
 

Join Date: Apr 2004
Location: N.E.
Posts: 39
memaggiem RepRank 0
Default Re: LIVE bot looking for my pages in the images folder(?)

Looking for images within the images folder is the norm, yes I am aware of that

However, the logs showed the bot looking for ALL my html pages within the /images/ directory and NOT with the root directory.

Therefore, my logs were filled with "file not found" images/images/index.htm etc and so forth.

The access logs did not show the bot accessing or indexing the pages at all, only the /images/gif or images/jpg files.........

Odd?
Reply With Quote
  #4 (permalink)  
Old 12-11-2007, 04:31 PM
WebProWorld Veteran
 

Join Date: Dec 2005
Location: In Your Mind
Posts: 614
SemAdvance RepRank 1
Default Re: LIVE bot looking for my pages in the images folder(?)

Quote:
Originally Posted by memaggiem View Post
Looking for images within the images folder is the norm, yes I am aware of that

However, the logs showed the bot looking for ALL my html pages within the /images/ directory and NOT with the root directory.

Therefore, my logs were filled with "file not found" images/images/index.htm etc and so forth.

The access logs did not show the bot accessing or indexing the pages at all, only the /images/gif or images/jpg files.........

Odd?
Without knowing the site involved at best you will receive speculation.

Your robots.txt could be causing errors, your websites pages i.e. html coding may be causing errors hard to say or as is often the case MSN is still full of bugs as are most Microsoft products...
__________________
Payday Loans
Costumes
Reply With Quote
  #5 (permalink)  
Old 12-17-2007, 09:30 PM
WebProWorld Member
 

Join Date: Apr 2004
Location: N.E.
Posts: 39
memaggiem RepRank 0
Default Re: LIVE bot looking for my pages in the images folder(?)

Thanks for your reply! I didn't know you had replied!

My site has just vanished from MSN but I was able to find it with a search term. The result is this:

MAKE HOMEMADE BABY FOOD RECIPES, HOMEMADE BABY FOOD RECIPES, EASY ...
Baby Food Recipes - Make Homemade Baby Food - Recipes for Healthy Homemade Baby Food with Tips for Making and Feeding Baby Homemade Baby Food. Step-By-Step instructions ... ...

MAKE HOMEMADE BABY FOOD RECIPES, HOMEMADE BABY FOOD RECIPES, EASY SOLID BABY FOOD TIPS, BABY NUTRITION and MORE at wholesomebabyfood.com | Making Baby Food with Wholesome Goodness & Love · 12/15/2007 · Cached page

**Notice that it has my images folder in the result!** WTH is going on with that?

I have not change the robots.txt in over 1 year nor have I changed any meta-tag info

thanks !!!!
Reply With Quote
  #6 (permalink)  
Old 12-18-2007, 12:24 AM
WebProWorld Pro
 

Join Date: Dec 2007
Location: Brussels, Belgium
Posts: 110
Jean-Luc RepRank 1
Default Re: LIVE bot looking for my pages in the images folder(?)

The problem is that your site does not process non-existing pages correctly.

http://wholesomebabyfood.com/images should return an HTTP error code. It presently returns a "200 OK" (after redirecting to http://wholesomebabyfood.com/images/.

Another invalid situation appears when I try to go to http://wholesomebabyfood.com/does-not-exist. It redirects to http://www.wholesomebabyfood.com/error.htm that returns a "200 OK" code.

Search engines get confused because your server answers that invalid URL's are "200 OK".

This second address should return a "404 File not found". The first address should return a "404 File not found" or a "403 Forbidden" code.

Jean-Luc
__________________
Checking redirects is now as easy as 1 2 3, even if you are not a HTTP-header guru !
AWStats Support: installation assistance, add-on's, extra sections, dedicated forum,...
Reply With Quote
  #7 (permalink)  
Old 12-18-2007, 08:21 AM
WebProWorld Member
 

Join Date: Apr 2004
Location: N.E.
Posts: 39
memaggiem RepRank 0
Default Re: LIVE bot looking for my pages in the images folder(?)

Hmmm.............so why, after 4 years on the web, would this all of a sudden be an issue? Maybe the MSNBOT changed it's crawiling practices.....

Also, why did it NOT find the pages where they actually are....within the main directory and not a sub such as images? Very odd! Some of the pages are actually indexed correctly...

I have the index.htm page in the images dir. to stop the files from being displayed. Should I remove the error.htm page (which is used to display all the links and content in my site so people might stay and find what they were looking for)? I'm not sure I want to do that! I'll have a look-see at a solution.

ETA - I just did a disallow MSNBot /images/
Maybe this will fix it!


Thanks for your reply! This is really driving me nuts!!!!

Last edited by memaggiem : 12-18-2007 at 08:44 AM.
Reply With Quote
  #8 (permalink)  
Old 12-18-2007, 09:19 AM
WebProWorld Pro
 

Join Date: Dec 2007
Location: Brussels, Belgium
Posts: 110
Jean-Luc RepRank 1
Default Re: LIVE bot looking for my pages in the images folder(?)

Add this in your .htaccess file:
Code:
Options +FollowSymlinks
RewriteEngine on 

RewriteRule   ^images\/?$   http://wholesomebabyfood.com/ [L,R=301]

ErrorDocument  404  /error.htm
It will correctly redirect visitors of /images to the home page and it will send a valid error code for non-existing pages.

Jean-Luc
__________________
Checking redirects is now as easy as 1 2 3, even if you are not a HTTP-header guru !
AWStats Support: installation assistance, add-on's, extra sections, dedicated forum,...
Reply With Quote
Reply

  WebProWorld > Search Engines > MSN Search Discussion Forum


Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
BREAKING NEWS WINDOWS LIVE Search Goes Live as BETA scanmonkey MSN Search Discussion Forum 0 03-07-2006 11:26 PM
How to link to sub folder index pages Ellio Search Engine Optimization Forum 2 10-19-2005 01:08 PM
Yahoo Site Explorer Live: New Way To See All Your Pages, Li WPW_Feedbot Search Engine Optimization Forum 0 09-29-2005 08:30 PM
Google Testing Images On Web Results Pages WPW_Feedbot Search Engine Optimization Forum 0 01-05-2005 06:30 PM
Help wanted on images folder. kneelsit Graphics & Design Discussion Forum 3 02-19-2004 10:07 AM


Search Engine Optimization by vBSEO 3.2.0