|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
|
I have been checking my server logs and have noticed that when google bot spiders my website it is now requesting the robots.txt file then that is it,then it will return 30-60 minutes later and request the index page does anyone know why they do this?
Thanks in advance littlemansearch Littleman Search Engine Home Page |
|
||||
|
For example you have in your robots.txt:
user-agent: Slurp Although commands are not case sensitive, I advise you to write exactly "User-agent", that is all lowercase except for the capitalized "U". Maybe that can help.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
|||
|
Thanks for the reply I will try changing the U to a capital although I can`t see this making much difference.I had problems with the robots.txt file untill recently which I fixed and everything was working fine when my site was crawled this has only just started happening again I was wondering if maybe it could have something to do with the way google crawls sites maybe they are changing the way they crawl sites.
I know that the search engine I run will only let me crawl every link on a site when it runs on auto,which can be a bit of a pain for other webmasters.I am introducing some new routines to let the engine spider only a few links on a site when on automatic run then move onto the next domain and spider a few their then return to these sites another time. Perhaps google are changing their crawling technique also.Just a thought. Thanks Littleman Search http://www.littlemansearch/index.html |
|
||||
|
Do you have a Google WebMaster Tools account? Set one up and check their spider summary page for web crawl errors such as broken links, server errors or missing files.
I noticed that you don't have an .xml sitemap. Index your site and create an xml sitemap with GSiteCrawler Google Sitemap Generator for Windows :: GSiteCrawler You can easily add a link to your new sitemap file in your robots.txt file with the following string: Code:
Sitemap: http://www.littlemansearch.co.uk/sitemap.xml # end of file Find broken links on your site with Xenu's Link Sleuth (TM) |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Why is googlebot ignoring my robots.txt file? | Littlemansearch | Google Discussion Forum | 8 | 05-24-2007 05:40 PM |
| AOL.com Portal Leaves Beta | WPW_Feedbot | Search Engine Optimization Forum | 0 | 09-23-2005 10:30 AM |
| YPN: Cat Leaves Bag | WPW_Feedbot | Search Engine Optimization Forum | 0 | 08-02-2005 06:00 PM |
| Tourist leaves $8,700 at McDonald's | WPW_Feedbot | The Castle Breakroom (General: Any Topic) | 0 | 02-16-2005 12:02 PM |
| Googlebot only visting index.asp and robots.txt only | pbatson | Google Discussion Forum | 14 | 03-04-2004 12:17 AM |
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |