iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Internet Security Discussion Forum This forum is for the discussion of security related issues. If you find a new Phishing scheme, spyware, virus or malicious site - let us know about it. If any of the above found you... here's where you ask for help.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 09-28-2009, 06:35 PM
simonm's Avatar
WebProWorld Veteran
 
Join Date: Jul 2003
Location: UK Kent
Posts: 309
simonm RepRank 2
Default robots.txt an open window?

I use robots.txt to identify what I don't want on the search engines. But it struck me after an increase in hack attempts that maybe all that robots.txt was doing was telling the hackers where I didn't want them to go and therefore where they might want to go!

I've ditched robots.txt and just made sure any pages I want off search engine listings have no inbound links from public pages.

Maybe I'm paranoid!
Reply With Quote
  #2 (permalink)  
Old 09-29-2009, 10:08 AM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,022
inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6
Default Re: robots.txt an open window?

What's your set-up? Are you using a CMS?

Pages can be found by G in a variety of ways, not just from inbound links. Do the pages link out anywhere?

If I want to block a page from the index I use the noindex meta tag.
__________________
Latest Blog Post: Google Consultant - Should this Job Title be Allowed? - Matt Inertia's SEO Blog - SEOers.org

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #3 (permalink)  
Old 09-29-2009, 07:08 PM
Smeagol's Avatar
WebProWorld New Member
 
Join Date: Apr 2009
Location: Netherlands
Posts: 16
Smeagol RepRank 1
Default Re: robots.txt an open window?

I fear there be more to be done here than just a 'nofollow' alone, but it is good start. Like 'nofollow', noarchive' and 'noindex'....
However. Been there, tried it and did not get the t-shirt. If bad intentions are out to get you, they never follow the rules!
Try putting a "bad bot/Hack" trap as the first exclude in the robots.txt file, and possibly a few other nice tricks to put off the unlawfull access.
And if they do go there.. where you told them not to go, fload then with (what looks like honey) garbage. Knowing the search engines engines advise against this kind of tactic, it has worked wonders for me, and the 'B', "Y" and 'G' has not tried it yet. So look after yourself and your site and not what the search engines advise!
Reply With Quote
  #4 (permalink)  
Old 09-29-2009, 07:16 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Quote:
Originally Posted by simonm View Post
I've ditched robots.txt and just made sure any pages I want off search engine listings have no inbound links from public pages.
Well, unless there are also no internal links pointing to said pages, they're easy discovered.

If there are, all you've accomplished is to implicitly allow the well behaved bots to do as they like.
Reply With Quote
  #5 (permalink)  
Old 09-29-2009, 07:37 PM
Smeagol's Avatar
WebProWorld New Member
 
Join Date: Apr 2009
Location: Netherlands
Posts: 16
Smeagol RepRank 1
Default Re: robots.txt an open window?

I agree completely with you deepsand. But when you have a private site and want (select informed) others to see it and give comments (private url) and suddenly every search engine is on your site! (exaduration).

For me this means that other Internet users supplied with the URL in question either willingly gave it out or there is foul play?

I vote for foul play.tMany fantastic 'helpers' that plugin into your browser that even (previously) respected software companies force onto users are to blame for this espionage of detection. The result is quickly know. You are idenitfied and tracked.

At least that is how I have experienced it. The search engine 'recomendations' are one way and not to your benifit. As for the rest. I see there is more bad than good out on the internet at present. So deal with it.

However. robots.txt is good it helps the 'good people' to help themselves and everyone else to be moe efficient and respect privacy. It does not take away that they are many more others out there that want to exploit it.

I chose to deal with the prblem from a claisical point of view. 'If you don't listen, then feel'.
Reply With Quote
  #6 (permalink)  
Old 09-29-2009, 08:27 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

One must bear in mind the the web is a public space, not a private one. And, it is a long ago and well established principle that there can be no reasonable expectation of privacy in a public space.

Therefore, if one is desirous of some modicum of privacy in a public space, it is incumbent on that person to take such steps as are possible to shield that which is to go unseen and unheard by the public at large.

The use of robots directives is one method of undertaking such shielding; this will easily at least handle the case of well behaved bots. For those ill behaved, one might consider IP blocking; while not a perfect solution, it is better than none at all.

As in all things, physical or virtual, the maxim is that, when in public, expect no privacy. For that one should remain in private spaces.
Reply With Quote
  #7 (permalink)  
Old 09-30-2009, 08:27 AM
WebProWorld Member
 
Join Date: Mar 2007
Posts: 50
luigip RepRank 1
Default Re: robots.txt an open window?

Quote:
Originally Posted by simonm View Post
I use robots.txt to identify what I don't want on the search engines. But it struck me after an increase in hack attempts
Am I right that these pages are password protected? So these pages are already "protected from the search engines" and there is no need for your robots.txt solution.
__________________
Wetter - Regenradar
Reply With Quote
  #8 (permalink)  
Old 10-01-2009, 04:23 AM
simonm's Avatar
WebProWorld Veteran
 
Join Date: Jul 2003
Location: UK Kent
Posts: 309
simonm RepRank 2
Default Re: robots.txt an open window?

Hi, Some good advice here and yes, I'll put the nofollow / noindex on page rather than robots.txt, should have thought of that.
Yes, pages are password protected and unlinked from the outside world and yes, doing a regular trawl of what the search engines list, they do seem to be able to get into every nook and cranny!
Reply With Quote
  #9 (permalink)  
Old 10-01-2009, 04:33 AM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Quote:
Originally Posted by simonm View Post
... I'll put the nofollow / noindex on page rather than robots.txt ...
If a bot is not so well behaved as to heed robots directives, why believe that it will heed meta tags?
Reply With Quote
  #10 (permalink)  
Old 10-01-2009, 01:18 PM
inertia's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Apr 2006
Location: Lancaster, UK
Posts: 1,022
inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6inertia RepRank 6
Default Re: robots.txt an open window?

Why would a password protected page be indexed? Bots cant fill in logins. There must be something dodgy going on there!
__________________
Latest Blog Post: Google Consultant - Should this Job Title be Allowed? - Matt Inertia's SEO Blog - SEOers.org

"Carpe diem, seize the day boys, make your lives extraordinary"
- Dead Poets Society
Reply With Quote
  #11 (permalink)  
Old 11-06-2009, 04:11 PM
WebProWorld Member
 
Join Date: Nov 2009
Posts: 53
moneymen RepRank 1
Default Re: robots.txt an open window?

U can do basic auth the pages under basic outh unless u give them permision by entering the login for the basic auth in webmaster tools
__________________
Website protection---Lot of tutorials to help u protect your website,penetration and protecting services and more...
Reply With Quote
  #12 (permalink)  
Old 11-06-2009, 06:05 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Quote:
Originally Posted by moneymen View Post
U can do basic auth the pages under basic outh unless u give them permision by entering the login for the basic auth in webmaster tools
What has Google Webmaster Tools to do with it?
Reply With Quote
  #13 (permalink)  
Old 11-06-2009, 06:45 PM
WebProWorld Member
 
Join Date: Nov 2009
Posts: 53
moneymen RepRank 1
Default Re: robots.txt an open window?

I think i saw option somewhere when u are adding your site to put also user and password for password protected things so google bot can crawl it
__________________
Website protection---Lot of tutorials to help u protect your website,penetration and protecting services and more...
Reply With Quote
  #14 (permalink)  
Old 11-06-2009, 07:18 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Quote:
Originally Posted by moneymen View Post
I think i saw ...
So, you are guessing?

Citation required.

Even were such the case, would have no effect on SEs other than Google.

Blocking Google

Last edited by deepsand; 11-06-2009 at 07:22 PM.
Reply With Quote
  #15 (permalink)  
Old 11-06-2009, 08:13 PM
WebProWorld Member
 
Join Date: Nov 2009
Posts: 53
moneymen RepRank 1
Default Re: robots.txt an open window?

NO am not guessing i saw it

not sure for other SE i was saying about google although it think same works for other ones also since i had a site that was protected that way "private one" and only public parts was index and 2 other files.After full 2 years of existing nothing exchept those 3 public ones is not indexed
and on the link u provided Blocking Google first
first method mentioned htaccess one is the one am talking about-- basic auth
__________________
Website protection---Lot of tutorials to help u protect your website,penetration and protecting services and more...
Reply With Quote
  #16 (permalink)  
Old 11-06-2009, 08:30 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Quote:
Originally Posted by moneymen View Post
I think i saw
Quote:
Originally Posted by moneymen View Post
NO am not guessing i saw it
Not what you first said. "I think I saw" and "I saw" are two quite different things.

Quote:
Originally Posted by moneymen View Post
U can do basic auth the pages under basic outh unless u give them permision by entering the login for the basic auth in webmaster tools
Quote:
Originally Posted by moneymen View Post
... on the link u provided Blocking Google first
first method mentioned htaccess one is the one am talking about-- basic auth
Again, not what you first said. htaccess is not in Webmaster Tools!

It is important that one both precise and sufficiently complete when speaking.
Reply With Quote
  #17 (permalink)  
Old 11-06-2009, 08:59 PM
WebProWorld Member
 
Join Date: Nov 2009
Posts: 53
moneymen RepRank 1
Default Re: robots.txt an open window?

First there is a part of text missing from my first post i don't know why

the full one was

U can do basic auth of the pages, pages under basic auth can't be crawled unless u give them permision by entering the login for the basic auth in webmaster tools

maybe that is what confused u since i didn't said that u are seting up basic auth in webmaster tools
and later i said that u posted link to a resource where first method explained is the one am talking about basic auth = htaccess password protection = popup login
__________________
Website protection---Lot of tutorials to help u protect your website,penetration and protecting services and more...
Reply With Quote
  #18 (permalink)  
Old 11-06-2009, 09:13 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Quote:
Originally Posted by moneymen View Post
U can do basic auth of the pages, pages under basic auth can't be crawled unless u give them permision by entering the login for the basic auth in webmaster tools

maybe that is what confused u since i didn't said that u are seting up basic auth in webmaster tools
To the contrary, that is precisely what you said.

"u give them permision by entering the login for the basic auth in webmaster tools"

One does not do anything "in Webmaster Tools."

Quote:
Originally Posted by moneymen View Post
... and later i said that u posted link to a resource where first method explained is the one am talking about basic auth = htaccess password protection = popup login
That does not change the fact that you first mis-spoke.

As previously noted, precision of speech is very important.
Reply With Quote
  #19 (permalink)  
Old 11-06-2009, 09:42 PM
WebProWorld Member
 
Join Date: Nov 2009
Posts: 53
moneymen RepRank 1
Default Re: robots.txt an open window?

Are u an english teacher ?
I think its pretty clear am not explaining to a baby am i ?

And when u take half of the statement from the sentence it gets different meaning so if u quote something quote all mr.eng teacher


p.s. point of the thread was to help him and not to corect things i say to first grade english its clear enough
i don't know what is wrong with some people here they will turn whole subject in different way only so they can get into debate with someone maybe admins should make new section called debating room if people here are so horny on debates
__________________
Website protection---Lot of tutorials to help u protect your website,penetration and protecting services and more...

Last edited by moneymen; 11-06-2009 at 09:46 PM.
Reply With Quote
  #20 (permalink)  
Old 11-06-2009, 10:24 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

That you know what you intend to communicate is not sufficient for the understanding of others.

One cannot help when what one says is not clearly stated.

Quote:
I think its pretty clear am not explaining to a baby am i ?
Having reviewed some of your posts elsewhere as well, it is abundantly clear as to who is being puerile.
Reply With Quote
  #21 (permalink)  
Old 11-06-2009, 10:50 PM
WebProWorld Member
 
Join Date: Nov 2009
Posts: 53
moneymen RepRank 1
Default Re: robots.txt an open window?

ya because am in hostile environment
__________________
Website protection---Lot of tutorials to help u protect your website,penetration and protecting services and more...
Reply With Quote
  #22 (permalink)  
Old 11-06-2009, 11:26 PM
deepsand's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2004
Location: Philadelphia, PA
Posts: 3,288
deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9deepsand RepRank 9
Default Re: robots.txt an open window?

Any appearance of hostility is of your own making.
Reply With Quote
Reply

  WebProWorld > Webmaster, IT and Security Discussion > Internet Security Discussion Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
Can Flash files open up in own window Photoartomation Graphics & Design Discussion Forum 6 09-17-2009 11:55 AM
How to open it in a new window alexcl Google AdWords/Google AdSense 12 11-01-2007 01:39 AM
How to open a link in a new window while being valid html nseidm1 Graphics & Design Discussion Forum 3 01-16-2007 10:19 PM
Discussion: Avoid forcing links to open in a new window Webnauts Graphics & Design Discussion Forum 0 01-20-2005 03:37 AM
Links to open in a new window ? malcolm Google AdWords/Google AdSense 14 12-12-2003 02:45 AM


All times are GMT -4. The time now is 09:58 AM.



Search Engine Optimization by vBSEO 3.3.0