Submit Your Article Forum Rules

+ Reply to Thread
Results 1 to 6 of 6

Thread: How do you know if your site has been crawled or indexed

  1. #1

    How do you know if your site has been crawled or indexed

    I was just wondering how do you know when or if your site was crawled or indexed with these search engines. I see people saying that there site was crawled many times but how do they know and how can I find out if mine has been crawled.
    Thank you
    kevin

  2. #2
    WebProWorld MVP minstrel is just really nice minstrel is just really nice minstrel is just really nice minstrel is just really nice minstrel is just really nice minstrel's Avatar
    Join Date
    Jul 2003
    Location
    Ottawa, Canada
    Posts
    2,553
    Didn't I just see this question posted in another thread?

    Anyway, the answer is: look at your website logs - depending on which stats package your host uses, you may see spiders identified by their names (Slurp, Googlebot, MSNbot, etc.) or you'll see the spidernames or "refering" (sic) agents in the "agents" section appended to an entry like "Internet Explorer... MSNbot".

  3. #3
    Senior Member sfowler is an unknown quantity at this point sfowler's Avatar
    Join Date
    May 2004
    Posts
    947
    If you don't fancy wasding through the logs or don't have easy access to them, then just cut out a unique sentence from a page and paste it in as a search. If the page has been indexed by that SE, your pagee has to come up top of the list.

  4. #4
    WebProWorld MVP ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger's Avatar
    Join Date
    Aug 2003
    Posts
    1,396
    C'mon people ... is this SE-101 stuff or what? =)
    • 1. Use the site:www.domain.com query.

      2. For server log analysis ... software like AWstats or Sawmill will do the trick. They will identify bot activity (although Sawmill does a better job of it.) No need to wade thru the raw log files.

      3. Another way is with scripts that you can attach to the footers of all your pages. Some of these scripts are designed to trigger an entry for bot activity (amongst other things). The scripts are in a variety of flavors Perl, PHP, ASP, etc.

  5. #5
    Senior Member sfowler is an unknown quantity at this point sfowler's Avatar
    Join Date
    May 2004
    Posts
    947
    Sure, but I find this way is the easiest way to check if an updated text is genuinely in the index, especially when I know that the page was there beforehand.

  6. #6
    WebProWorld MVP ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger is a name known to all ronniethedodger's Avatar
    Join Date
    Aug 2003
    Posts
    1,396
    Quote Originally Posted by sfowler
    Sure, but I find this way is the easiest way to check if an updated text is genuinely in the index, especially when I know that the page was there beforehand.
    I wish I could remember the link, but there is a site that uses this technique for member profiles.

    They have a system of coding the profiles of members into this teeny-tiny text onto their pages. It looks something like this (almost barcode looking):

    CQ10J KL90Y JK74Y IS72K SO34A SL45W AI78E
    AO23A AI23A IC22I AU223 AY2343F OA232D IF839A
    YA34A OD098I GA387E NX73D HA399U EW939U EY9449D
    JK4940D FH3030A AK2928Y CA3930J AK399D HF3030A


    Then they leverage Google search to find the closest matches for your own profile ... since this code is part of the page. Pretty ingenious use of Google.

    So yep ... a unique string on your pages will do the trick. Unless it is Yahoo of course, then there is a delay between the crawl and the actual indexing of the page.

+ Reply to Thread

Similar Threads

  1. Site crawled but not appearing in Google
    By AustinG in forum Google Discussion Forum
    Replies: 7
    Last Post: 10-19-2009, 11:06 PM
  2. My Tip of the Day for Getting a Site Crawled
    By bhartzer in forum Search Engine Optimization Forum
    Replies: 9
    Last Post: 02-20-2007, 10:58 PM
  3. How do I know if my site has been crawled by the engines
    By wickertropics in forum Submit Your Site For Review
    Replies: 4
    Last Post: 05-15-2006, 07:15 AM
  4. Is redirect site crawled by SE robots?
    By shilmy in forum Search Engine Optimization Forum
    Replies: 2
    Last Post: 11-11-2004, 12:41 PM
  5. How to check if Google have crawled your site?
    By sharm in forum Search Engine Optimization Forum
    Replies: 7
    Last Post: 06-22-2004, 08:16 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts