Contact Us Forum Rules Search Archive
WebProWorld Part of WebProNews.com
Page One Link To Us Edit Profile Private Messages Archives FAQ RSS Feeds  
 

Go Back   WebProWorld > Search Engines > Yahoo! Discussion Forum
Subscribe to the Newsletter FREE!


Register FAQ Members List Calendar Arcade Chatbox Mark Forums Read

Yahoo! Discussion Forum Yahoo Search discussion. Any topic or subject specific to Yahoo should go here. You will also find a subforum dedicated to YPN & Panama.

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 08-07-2004, 12:06 PM
WebProWorld Member
 

Join Date: Aug 2004
Posts: 60
loseriam RepRank 0
Default How do you know if your site has been crawled or indexed

I was just wondering how do you know when or if your site was crawled or indexed with these search engines. I see people saying that there site was crawled many times but how do they know and how can I find out if mine has been crawled.
Thank you
kevin
Reply With Quote
  #2 (permalink)  
Old 08-07-2004, 01:06 PM
minstrel's Avatar
WebProWorld 1,000+ Club
 

Join Date: Jul 2003
Location: Ottawa, Canada
Posts: 3,619
minstrel RepRank 0
Default

Didn't I just see this question posted in another thread?

Anyway, the answer is: look at your website logs - depending on which stats package your host uses, you may see spiders identified by their names (Slurp, Googlebot, MSNbot, etc.) or you'll see the spidernames or "refering" (sic) agents in the "agents" section appended to an entry like "Internet Explorer... MSNbot".
Reply With Quote
  #3 (permalink)  
Old 08-07-2004, 04:47 PM
sfowler's Avatar
WebProWorld Veteran
 

Join Date: May 2004
Location: Vienna, Austria
Posts: 967
sfowler RepRank 0
Default

If you don't fancy wasding through the logs or don't have easy access to them, then just cut out a unique sentence from a page and paste it in as a search. If the page has been indexed by that SE, your pagee has to come up top of the list.
__________________
Simon - Biogas, Rauchgasanalysator and Rauchgasanalysegerät
Reply With Quote
  #4 (permalink)  
Old 08-10-2004, 06:31 AM
ronniethedodger's Avatar
WebProWorld 1,000+ Club
 

Join Date: Aug 2003
Location: Central US
Posts: 1,581
ronniethedodger RepRank 0
Default

C'mon people ... is this SE-101 stuff or what? =)
  • 1. Use the site:www.domain.com query.

    2. For server log analysis ... software like AWstats or Sawmill will do the trick. They will identify bot activity (although Sawmill does a better job of it.) No need to wade thru the raw log files.

    3. Another way is with scripts that you can attach to the footers of all your pages. Some of these scripts are designed to trigger an entry for bot activity (amongst other things). The scripts are in a variety of flavors Perl, PHP, ASP, etc.
Reply With Quote
  #5 (permalink)  
Old 08-11-2004, 08:27 AM
sfowler's Avatar
WebProWorld Veteran
 

Join Date: May 2004
Location: Vienna, Austria
Posts: 967
sfowler RepRank 0
Default

Sure, but I find this way is the easiest way to check if an updated text is genuinely in the index, especially when I know that the page was there beforehand.
__________________
Simon - Biogas, Rauchgasanalysator and Rauchgasanalysegerät
Reply With Quote
  #6 (permalink)  
Old 08-12-2004, 11:18 PM
ronniethedodger's Avatar
WebProWorld 1,000+ Club
 

Join Date: Aug 2003
Location: Central US
Posts: 1,581
ronniethedodger RepRank 0
Default

Quote:
Originally Posted by sfowler
Sure, but I find this way is the easiest way to check if an updated text is genuinely in the index, especially when I know that the page was there beforehand.
I wish I could remember the link, but there is a site that uses this technique for member profiles.

They have a system of coding the profiles of members into this teeny-tiny text onto their pages. It looks something like this (almost barcode looking):

CQ10J KL90Y JK74Y IS72K SO34A SL45W AI78E
AO23A AI23A IC22I AU223 AY2343F OA232D IF839A
YA34A OD098I GA387E NX73D HA399U EW939U EY9449D
JK4940D FH3030A AK2928Y CA3930J AK399D HF3030A


Then they leverage Google search to find the closest matches for your own profile ... since this code is part of the page. Pretty ingenious use of Google.

So yep ... a unique string on your pages will do the trick. Unless it is Yahoo of course, then there is a delay between the crawl and the actual indexing of the page.
Reply With Quote
Reply

  WebProWorld > Search Engines > Yahoo! Discussion Forum
Tags: , , ,



Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Search Engine Optimization by vBSEO 3.2.0