iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 07-01-2007, 01:10 PM
WebProWorld New Member
 
Join Date: Jul 2007
Posts: 2
jb1702 RepRank 0
Default robots.txt unreachable - googlebot stopped crawling

Hallo everyone
My website has been reguralry crawled by googlebot
for two years now. Suddenly on June 8 everything stopped because of a
missing robots.txt file. Until then googlebot was crawling the site
daily and I had over 10.000 pages in the index. I did't have a
robots.txt and my server returned 404s

GET /robots.txt HTTP/1.1" 404 2157 "-" "Mozilla/5.0 (compatible;
Googlebot/2.1...

On that day googlebot asked for the robots file, crawled some pages
and never returned (I never got another googlebot request according to
my logs).

On my webmasters tools now I have:
robots.txt URL http://www.tospitimou.gr/robots.txt
Last downloaded June 8, 2007 2:04:45 AM PDT
Status 404 (Not found) [?]

and almost 5.000 "Unreachable URLs" errors, with cause "robots.txt
unreachable". I have added a robots file about two weeks now and
changed it a few times in that period too, but googlebot has never
made another request for anything since that day. Yet many of the
"Unreachable URL" erros where registered during that time, but not a single
page or robots request in my logs.

Can you think of anything that could get me out of this?

Thanks in advance

Last edited by jb1702; 07-01-2007 at 01:12 PM.
Reply With Quote
  #2 (permalink)  
Old 07-01-2007, 06:21 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,132
Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8
Default Re: robots.txt unreachable - googlebot stopped crawling

Your robots.txt as they are now cannot be a problem. From what you claimed above, or you have broken links or your server was not available.

Another post possible problem can be your DNS errors: DNS Stuff: DNS tools, DNS hosting tests, WHOIS, traceroute, ping, and other network and domain name tools.
You better check that out with your provider as soon as possible.

When all issues above are clarified and solved, then create an XML site map and submit it to Google.

I just checked and you are not banned from Google.

Good luck patrida! I am half Greek. LOL
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #3 (permalink)  
Old 07-01-2007, 06:46 PM
WebProWorld New Member
 
Join Date: Jul 2007
Posts: 2
jb1702 RepRank 0
Default Re: robots.txt unreachable - googlebot stopped crawling

Euxaristw for your answer

Well, I know about my dns issues, they were there from the beginning and they never caused any problem. I also used to have a sitemap, but I removed it after the problem with google appeared. Now, whenever I try to upload a new one, google says it's unreachable too.

What puzzles me is that I have absolutely no Googlebot requests in my access and error logs. I also tried to translate some pages of my site using the google language tools and they were unreachable from there as well.

So what I'm thinking is that somehow the google ips have been blocked from by server. Do you know of a way to verify what ips are blocked from my host's machine?


Quote:
Originally Posted by Webnauts View Post
Your robots.txt as they are now cannot be a problem. From what you claimed above, or you have broken links or your server was not available.

Another post possible problem can be your DNS errors: DNS Stuff: DNS tools, DNS hosting tests, WHOIS, traceroute, ping, and other network and domain name tools.
You better check that out with your provider as soon as possible.

When all issues above are clarified and solved, then create an XML site map and submit it to Google.

I just checked and you are not banned from Google.

Good luck patrida! I am half Greek. LOL
Reply With Quote
  #4 (permalink)  
Old 07-01-2007, 08:46 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,132
Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8
Default Re: robots.txt unreachable - googlebot stopped crawling

I see you are on an Apache server. You should check if the IPs are blocked in your .htaccess file.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #5 (permalink)  
Old 07-01-2007, 08:56 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,132
Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8
Default Re: robots.txt unreachable - googlebot stopped crawling

I cannot imagine that your site is crawlable anyway, if you have on the homepage alone 1490 HTML errors: Result for http://www.tospitimou.gr/main/index.jsp - W3C Markup Validator

For God's sake man.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #6 (permalink)  
Old 07-01-2007, 08:59 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,132
Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8Webnauts RepRank 8
Default Re: robots.txt unreachable - googlebot stopped crawling

And this is damn weird brother: Tospitimou.gr - To Spiti Mou

I get: %ERROR:101: no entries found

You seriously need professional help!
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
Reply

  WebProWorld > Search Engines > Google Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Googlebot gets robots.txt then leaves Littlemansearch Google Discussion Forum 4 06-20-2007 04:18 PM
Why is googlebot ignoring my robots.txt file? Littlemansearch Google Discussion Forum 8 05-24-2007 05:40 PM
500 internal server error stops googlebot crawling my site? internet-marketing-cr Search Engine Optimization Forum 10 01-18-2007 02:57 PM
Googlebot Crawling AdSense WebSearch Pages ronniethedodger Google Discussion Forum 0 08-16-2004 03:36 AM
Googlebot only visting index.asp and robots.txt only pbatson Google Discussion Forum 14 03-04-2004 12:17 AM


All times are GMT -4. The time now is 06:33 AM.



Search Engine Optimization by vBSEO 3.3.0