iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 02-20-2008, 05:50 AM
WebProWorld New Member
 
Join Date: Dec 2007
Location: California
Posts: 24
jetskiron RepRank 0
Default Google ignores robot block

I placed the following robot blocker at the top of my header code on 2-17-08 and one day later, Google crawled the website anyway:
<META NAME="ROBOTS" CONTENT="noindex,nofollow">

I'm also wondering if my new website is going to be penalized by google now since this new website is exactly the same as as my existing website under a different domain name? I don't have all my content uploaded to the new website yet. I was going to redirect my old domain name to the new one after I got the new website completely installed, but Google jumped the gun. I did not submit the new domain name to Google either. I suspect the host on the new website, Godaddy, may have alerted Google?

I set up a new hosting account for the new domain name so I could have full control of the hosting account. My existing website is under the control of my web developers group hosting plan, but my web developer just went out of business.

thanks,
Ron
Reply With Quote
  #2 (permalink)  
Old 02-20-2008, 06:21 AM
subhzash's Avatar
WebProWorld Veteran
 
Join Date: Apr 2007
Posts: 340
subhzash RepRank 1
Smile Re: Google ignores robot block

Quote:
Originally Posted by jetskiron View Post
I placed the following robot blocker at the top of my header code on 2-17-08 and one day later, Google crawled the website anyway:
<META NAME="ROBOTS" CONTENT="noindex,nofollow">

I'm also wondering if my new website is going to be penalized by google now since this new website is exactly the same as as my existing website under a different domain name? I don't have all my content uploaded to the new website yet. I was going to redirect my old domain name to the new one after I got the new website completely installed, but Google jumped the gun. I did not submit the new domain name to Google either. I suspect the host on the new website, Godaddy, may have alerted Google?

I set up a new hosting account for the new domain name so I could have full control of the hosting account. My existing website is under the control of my web developers group hosting plan, but my web developer just went out of business.

thanks,
Ron

Instead of using robots in meta tags use the robot.txt file. That is useful for you


Regards
Subhash
Reply With Quote
  #3 (permalink)  
Old 02-20-2008, 07:18 AM
WebProWorld New Member
 
Join Date: Dec 2007
Location: California
Posts: 24
jetskiron RepRank 0
Default Re: Google ignores robot block

I found the robotstxt.org and it was real simple.

But Google has already indexed my pages from the new site. Whats going to happen? Will they stop showing my pages after they do their next crawl and see the robots.txt file?

thanks much,
Ron
Reply With Quote
  #4 (permalink)  
Old 02-20-2008, 08:23 AM
WebProWorld Veteran
 
Join Date: Jul 2004
Posts: 913
activeco RepRank 2
Default Re: Google ignores robot block

Q: “When I change a robots.txt to exclude more existing files from being crawled, how long does it take for them to be removed from the index? Perhaps the answer is a function of how often the site is crawled and it’s PR?”

A: It is a function of how often the site is crawled. I believe in the past that every several hundred page fetches or several days, the bot would re-check the robots.txt. Note that for supplemental results, you need recrawling to happen by the supplemental Googlebot in order for the robots.txt file to take affect on those pages. If you’re really sure you never want those pages to be seen, you can use our url removal tool to remove urls for six months at a time. But I’d be very careful with the url removal tool unless you’re an expert. If you make a mistake and (for example) remove your entire site, that’s your responsibility. Google can sometimes clear out self-removals, but we don’t guarantee it.

Taken rom muttcutts.com.
__________________
Impossible? You just underestimate the time.
Reply With Quote
  #5 (permalink)  
Old 02-20-2008, 11:37 AM
WebProWorld Pro
 
Join Date: Dec 2007
Location: Brussels, Belgium
Posts: 164
Jean-Luc RepRank 2
Default Re: Google ignores robot block

Sorry, but the problem comes from the fact that you didn't do things as they had to.

If the content of the new site is the same as the content of the old web site:
1. upload all contents to the new site
2. put a 301 redirect from every old page to the matching new page (assuming your old host allows it)
3. that's all.

No need for robots.txt or "noindex;nofollow".

Jean-Luc
__________________
Checking redirects made easy | | Professional AWStats Services
Reply With Quote
  #6 (permalink)  
Old 02-20-2008, 11:58 AM
WebProWorld Veteran
 
Join Date: Jul 2004
Posts: 913
activeco RepRank 2
Default Re: Google ignores robot block

Jean-Luc,

As I understood he couldn't have any control of the old server and he's putting the same content on a new server and for some reason under a new domain name.

Jetskiron,
unless your site was banned there was no reason for a new domain. Just change dns settings at your registrar.
__________________
Impossible? You just underestimate the time.
Reply With Quote
  #7 (permalink)  
Old 02-20-2008, 01:32 PM
incrediblehelp's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Jan 2004
Location: Live in Cincy Now
Posts: 7,573
incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4incrediblehelp RepRank 4
Default Re: Google ignores robot block

Quote:
Originally Posted by activeco View Post
Jean-Luc,

As I understood he couldn't have any control of the old server and he's putting the same content on a new server and for some reason under a new domain name.

Jetskiron,
unless your site was banned there was no reason for a new domain. Just change dns settings at your registrar.
I disagree. It sounds like he was fully aware that he needed to 301 redirect the old website and should have immediately after the new one went online. I dont see any need to block spiders here.

Anyways , just get the new website up and live with your content fully and put the 301 redirection in place. ofc ourse I am wondering like the others what the reason to make such a change is.
Reply With Quote
  #8 (permalink)  
Old 02-21-2008, 05:33 AM
davidweb's Avatar
WebProWorld Pro
 
Join Date: Mar 2007
Posts: 209
davidweb RepRank 1
Default Re: Google ignores robot block

Quote:
Originally Posted by jetskiron View Post
I placed the following robot blocker at the top of my header code on 2-17-08 and one day later, Google crawled the website anyway:
<META NAME="ROBOTS" CONTENT="noindex,nofollow">

I'm also wondering if my new website is going to be penalized by google now since this new website is exactly the same as as my existing website under a different domain name? I don't have all my content uploaded to the new website yet. I was going to redirect my old domain name to the new one after I got the new website completely installed, but Google jumped the gun. I did not submit the new domain name to Google either. I suspect the host on the new website, Godaddy, may have alerted Google?

I set up a new hosting account for the new domain name so I could have full control of the hosting account. My existing website is under the control of my web developers group hosting plan, but my web developer just went out of business.

thanks,
Ron
Google always follow its own rule. Google places restrictions for website owners, and surely not for himself.

Google can crawl almost anything from your website ( regardless of whether it is nofollow links, or robots blocking. Google will take your data and silently add into their database. If anything goes wrong ( like some adult link added for example), then google will use blocked data to pass their own judgement.
__________________
SEO Optimization Company - SEO Hawk - UK, US, Canada, and Australia
SEO Optimisation UK | Latest SEO Blog on the Planet http://www.seohawk.com/blog/
Reply With Quote
  #9 (permalink)  
Old 02-21-2008, 09:04 AM
WebProWorld Veteran
 
Join Date: Jul 2003
Location: Mass, U.S.A.
Posts: 399
Conficio RepRank 0
Default Re: Google ignores robot block

DON'T PANIC!

I think you have to better understand how a web crawler like Googlebot works. It gets a huge list of pages to crawl and follows it religiously. When you put a meta tag on your page (or pages) then Google does not stop crawling immediately if it already has the URL's in its list. Later it will analyze the pages read in the crawl and eliminate the URLs from its crawl list, as to your instructions, and also see which pages it should add to the index.

Remember, Google needed to read you page in order to even discover your instructions in the meta-tag.

All what happened so far is that the Google Bot did crawl a few pages. That in itself is not evidence that it will include it in the index (prevented by "NOINDEX") or does follow the links from this page for further crawls (prevented by "NOFOLLOW"). also it takes weeks for Google to purge pages from its index tagged by NOINDEX.

Actually, Google will come back to crawl pages that have links to it from the outside world. And there are some automatic links from sites like About US, which makes a wiki page for every domain registered.

To accelerate the purging of your pages from the Index, sign up for a Google Webmaster account and use Tools --> Remove URLs. Unfortunately ,for that to work you have to first authenticate your site.

Good luck

K<o>
Reply With Quote
  #10 (permalink)  
Old 02-22-2008, 06:23 AM
WebProWorld Member
 
Join Date: Feb 2008
Posts: 27
PAUL-B RepRank 0
Default Re: Google ignores robot block

Quote:
Originally Posted by Jean-Luc View Post
Sorry, but the problem comes from the fact that you didn't do things as they had to.

If the content of the new site is the same as the content of the old web site:
1. upload all contents to the new site
2. put a 301 redirect from every old page to the matching new page (assuming your old host allows it)
3. that's all.

No need for robots.txt or "noindex;nofollow".

Jean-Luc
Sound advice.
Reply With Quote
  #11 (permalink)  
Old 03-06-2008, 04:16 PM
WebProWorld Member
 
Join Date: Mar 2008
Location: Ramah
Posts: 38
newoptimizer RepRank 0
Default Re: Google ignores robot block

I have similar issues, and would like to add a question here..

For a few years I've had geocities pages, and in the last year purchased domain names for a couple of the sites..
I found that I would get kicked out of the search engines--just the new Domain name, not the geocities site itself--unless I used a specific Frames Forward that included a "No Frames" Tag...

I haven't had any trouble with having duplicate pages out there, except yahoo has recently kicked out the new Domain names, while the other major search engines give me a rather high ranking...

Should I have any concerns here?

And any suggestions how to remedy future issues with this?
Reply With Quote
  #12 (permalink)  
Old 03-06-2008, 06:07 PM
WebProWorld New Member
 
Join Date: Oct 2004
Posts: 13
gridsix RepRank 0
Default Re: Google ignores robot block

Quote:
Originally Posted by newoptimizer View Post
I have similar issues, and would like to add a question here..

For a few years I've had geocities pages, and in the last year purchased domain names for a couple of the sites..
I found that I would get kicked out of the search engines--just the new Domain name, not the geocities site itself--unless I used a specific Frames Forward that included a "No Frames" Tag...

I haven't had any trouble with having duplicate pages out there, except yahoo has recently kicked out the new Domain names, while the other major search engines give me a rather high ranking...

Should I have any concerns here?

And any suggestions how to remedy future issues with this?
Uhhh, I'd start a site off of geocities and begin the slow migration to that domain. In the short term, whatever keeps you in the search results is a good thing.
Reply With Quote
  #13 (permalink)  
Old 03-06-2008, 06:28 PM
WebProWorld Member
 
Join Date: Mar 2008
Location: Ramah
Posts: 38
newoptimizer RepRank 0
Default Re: Google ignores robot block

thanks for your reply...

the sites are non-income producing, so i have them at geocities because its free..

any suggestions where to place them?

and actually, yahoo didn't kick the domain out, just made it unsearchable by keyword after that domain url had been placed on the first page...

Last edited by newoptimizer; 03-06-2008 at 08:15 PM.
Reply With Quote
  #14 (permalink)  
Old 03-07-2008, 05:26 PM
WebProWorld Member
 
Join Date: Mar 2008
Location: Ramah
Posts: 38
newoptimizer RepRank 0
Default Re: Google ignores robot block

i think i will post this as a new thread..
Reply With Quote
Reply

  WebProWorld > Search Engines > Search Engine Optimization Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Unknown robot (identified by 'robot') chandrika Webmaster Resources Discussion Forum 3 08-23-2007 01:32 AM
WebPosition = Google Block J-Spider Google Discussion Forum 11 03-09-2007 11:52 PM
Firefox ignores <comment> RikR Graphics & Design Discussion Forum 1 11-03-2004 01:10 AM
Google Ignores <h1> tag janeth Google Discussion Forum 84 10-07-2004 08:40 PM
Google Ignores Robots.txt jestep Google Discussion Forum 1 09-03-2004 01:39 PM


All times are GMT -4. The time now is 01:40 AM.



Search Engine Optimization by vBSEO 3.3.0