|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Web Programming Discussion Forum Working with an API? Developing a plugin? Writing a Mod or script for your favorite blog, Web 2.0 site or Forum? Welcome. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
|
1. Do a web search e.g. (www.dogpile.com) on the term: NBX NetSet
2. Now pick out ALL the pages that show something like this: NBX NetSet Version: RX_X_Xx Created: Xxx xx XXXX, Now, you've been successful in finding user companies that assign an public IP address to their corporate telephone systems. Besides being a really dumb thing to do- assigning a public IP to their system- the manufacturer isn't too bright either. Isn't there some simple html that could prevent the page you see (NBX NetSet splash page) from being indexed by the so called Webots or Webcrawlers ? I know the obvious answer is not to assign a public IP to the box and put the box behind a firewall- which the user/customers will open up port 80/html traffic to the box...so wouldn't the Webots/Webcrawlers still index the page ? What do you think ? thank you |
|
|||
|
There are two common ways to prevent spiders from indexing your data, excluding them with robots.txt and excluding them with meta data. Both methods are described here.
For bots that misbehave by disregarding robots.txt and the meta instructions, you can try this method to trap them and automatically ban them. I hope that was what you were asking... Good luck. |
|
||||
|
Code:
<meta name="robot" content="noindex,follow">
__________________
http://www.usalug.org USA Linux Users Group usalug.org is an online forum for Linux users. |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |