Submit Your Article Forum Rules

Results 1 to 8 of 8

Thread: LIVE bot looking for my pages in the images folder(?)

  1. #1

    LIVE bot looking for my pages in the images folder(?)

    Just noticed that an IP that resolves to a MSN Live bot is going through my site and looking for pages in the /images/ folder! Anyone know the whats or whys of this? Looking through the day's log, I see this has been going on all day!

    Thanks!

  2. #2
    WebProWorld MVP incrediblehelp's Avatar
    Join Date
    Jan 2004
    Posts
    7,567

    Re: LIVE bot looking for my pages in the images folder(?)

    Why wouldnt MSN look for images?

  3. #3

    Re: LIVE bot looking for my pages in the images folder(?)

    Looking for images within the images folder is the norm, yes I am aware of that

    However, the logs showed the bot looking for ALL my html pages within the /images/ directory and NOT with the root directory.

    Therefore, my logs were filled with "file not found" images/images/index.htm etc and so forth.

    The access logs did not show the bot accessing or indexing the pages at all, only the /images/gif or images/jpg files.........

    Odd?

  4. #4
    WebProWorld MVP SemAdvance's Avatar
    Join Date
    Dec 2005
    Posts
    1,037

    Re: LIVE bot looking for my pages in the images folder(?)

    Quote Originally Posted by memaggiem View Post
    Looking for images within the images folder is the norm, yes I am aware of that

    However, the logs showed the bot looking for ALL my html pages within the /images/ directory and NOT with the root directory.

    Therefore, my logs were filled with "file not found" images/images/index.htm etc and so forth.

    The access logs did not show the bot accessing or indexing the pages at all, only the /images/gif or images/jpg files.........

    Odd?
    Without knowing the site involved at best you will receive speculation.

    Your robots.txt could be causing errors, your websites pages i.e. html coding may be causing errors hard to say or as is often the case MSN is still full of bugs as are most Microsoft products...

  5. #5

    Re: LIVE bot looking for my pages in the images folder(?)

    Thanks for your reply! I didn't know you had replied!

    My site has just vanished from MSN but I was able to find it with a search term. The result is this:

    MAKE HOMEMADE BABY FOOD RECIPES, HOMEMADE BABY FOOD RECIPES, EASY ...
    Baby Food Recipes - Make Homemade Baby Food - Recipes for Healthy Homemade Baby Food with Tips for Making and Feeding Baby Homemade Baby Food. Step-By-Step instructions ... ...

    MAKE HOMEMADE BABY FOOD RECIPES, HOMEMADE BABY FOOD RECIPES, EASY SOLID BABY FOOD TIPS, BABY NUTRITION and MORE at wholesomebabyfood.com | Making Baby Food with Wholesome Goodness & Love · 12/15/2007 · Cached page

    **Notice that it has my images folder in the result!** WTH is going on with that?

    I have not change the robots.txt in over 1 year nor have I changed any meta-tag info

    thanks !!!!

  6. #6
    Senior Member
    Join Date
    Dec 2007
    Posts
    212

    Re: LIVE bot looking for my pages in the images folder(?)

    The problem is that your site does not process non-existing pages correctly.

    http://wholesomebabyfood.com/images should return an HTTP error code. It presently returns a "200 OK" (after redirecting to http://wholesomebabyfood.com/images/.

    Another invalid situation appears when I try to go to http://wholesomebabyfood.com/does-not-exist. It redirects to http://www.wholesomebabyfood.com/error.htm that returns a "200 OK" code.

    Search engines get confused because your server answers that invalid URL's are "200 OK".

    This second address should return a "404 File not found". The first address should return a "404 File not found" or a "403 Forbidden" code.

    Jean-Luc
    200ok.eu Broken Link Checker finds 404 errors, error pages with 200 ok status, missing images, protocol errors, password protected pages, bad domain names, redirect loops, parking pages, ...

  7. #7

    Re: LIVE bot looking for my pages in the images folder(?)

    Hmmm.............so why, after 4 years on the web, would this all of a sudden be an issue? Maybe the MSNBOT changed it's crawiling practices.....

    Also, why did it NOT find the pages where they actually are....within the main directory and not a sub such as images? Very odd! Some of the pages are actually indexed correctly...

    I have the index.htm page in the images dir. to stop the files from being displayed. Should I remove the error.htm page (which is used to display all the links and content in my site so people might stay and find what they were looking for)? I'm not sure I want to do that! I'll have a look-see at a solution.

    ETA - I just did a disallow MSNBot /images/
    Maybe this will fix it!


    Thanks for your reply! This is really driving me nuts!!!!

  8. #8
    Senior Member
    Join Date
    Dec 2007
    Posts
    212

    Re: LIVE bot looking for my pages in the images folder(?)

    Add this in your .htaccess file:
    Code:
    Options +FollowSymlinks
    RewriteEngine on 
    
    RewriteRule   ^images\/?$   http://wholesomebabyfood.com/ [L,R=301]
    
    ErrorDocument  404  /error.htm
    It will correctly redirect visitors of /images to the home page and it will send a valid error code for non-existing pages.

    Jean-Luc
    200ok.eu Broken Link Checker finds 404 errors, error pages with 200 ok status, missing images, protocol errors, password protected pages, bad domain names, redirect loops, parking pages, ...

Similar Threads

  1. redirect from folder only to index in parent folder
    By weegillis in forum Webmaster Resources Discussion Forum
    Replies: 3
    Last Post: 10-05-2009, 10:36 AM
  2. Excluding the "good" BOTS from /images folder
    By memaggiem in forum Search Engine Optimization Forum
    Replies: 16
    Last Post: 05-20-2008, 09:19 AM
  3. images not displaying on SSL pages
    By Britishcandycom in forum Web Programming Discussion Forum
    Replies: 14
    Last Post: 02-28-2008, 11:38 AM
  4. How to link to sub folder index pages
    By Ellio in forum Search Engine Optimization Forum
    Replies: 2
    Last Post: 10-19-2005, 01:08 PM
  5. Help wanted on images folder.
    By kneelsit in forum Graphics & Design Discussion Forum
    Replies: 3
    Last Post: 02-19-2004, 10:07 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •