iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Webmaster Resources Discussion Forum Sitemaps and robots and logfiles -- Oh My! If you have any questions, comments, concerns and/or ideas about the tools currently available to webmasters to make their lives... 'easier'. Here's where you need to be. Know of a good tool? Post it here. Got something funny in your logfiles? Maybe we can help.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 11-19-2008, 12:14 PM
WebProWorld New Member
 
Join Date: Nov 2008
Posts: 2
CraigH RepRank 0
Lightbulb Web Crawler frequency and access to the Deep Web

Hi all,
I am working on a personal project looking at web crawler frequency and the accessibility of the Deep Web to crawlers. From a webmaster's point of view, the problem as I see it is getting content into a search engine's index quickly after publication, and getting exposure to deep-web content within the search engines.

The project website is at Site Update Notification where I have tried to describe the problems and a solution based on an idea of web server agents notifying crawlers that content has changed (preventing unnecessary re-crawling and use of your bandwidth, too!).

I'd be really interested to hear everyone's views on the problem domain and the described Site Update Notification system. It is technically feasible (I have some prototypes working nicely which I may open-source if the project has significant interest), but would the webmaster community embrace the technology if major search engines were involved? Any feedback, suggestions or general comment would be great.

Thanks
-- Craig
Reply With Quote
  #2 (permalink)  
Old 11-28-2008, 11:28 PM
WebProWorld New Member
 
Join Date: Nov 2008
Posts: 11
muckle.martin RepRank 0
Default Re: Web Crawler frequency and access to the Deep Web

That's an excellent idea and blogs had introduced 'pinging' concept keeping the same problem in mind. I had though about another way to get browser to post the md5 checksum for the page, but that obviously had too many queries going to the search engines.

Sitemaps were probably a step towards finding out if the pages have been updated but obviously they are not as accurate.
Reply With Quote
Reply

  WebProWorld > Webmaster, IT and Security Discussion > Webmaster Resources Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Google crawler & webmaster crawler sally Search Engine Optimization Forum 4 03-08-2008 12:09 PM
crawler frequency VisualMind Search Engine Optimization Forum 1 10-18-2005 12:28 AM
GoogleBot Frequency Ankushm Google Discussion Forum 3 03-04-2005 10:06 AM
Search term frequency sfowler Search Engine Optimization Forum 7 05-14-2004 03:51 AM
What's the frequency, Kenneth? minstrel The Castle Breakroom (General: Any Topic) 4 11-24-2003 08:23 AM


All times are GMT -4. The time now is 11:06 AM.



Search Engine Optimization by vBSEO 3.3.0