|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Google Discussion Forum Google Discussion forum is for topics specifically related to Google. There is a subforum dedicated to AdSense/AdWords subjects. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
|||
|
My scenarios is that bots are already disallowed to visit the things they are not supposed to have access to. However, the problem is that there may be LINKS to such threads (for example, links to private threads in Forum's "recent posts" section) or, what is more interesting, links to various 'Reply', 'Add Post' and similar buttons.
Search bots see such buttons and attempt to navigate to the related links, which results in the errors similar to those that we can see. I don't really like to provide different content (i.e. some sort of conditional code that would remove such buttons for search bots), especially taking into account that some software might set custom or empty user agents, which will look similar to some bots..in such cases it might be a user who wants to ensure his privacy and hide browser information and we need to ensure that such users are able to view all those 'add post' buttons. Taking it into account, i wanted to check if there is some good in terms of SEO way to specify that this or that link (meaning links to private threads or buttons that would bring user to 'add post' page) is not meant to be searched. Please share your thoughts on this. |
|
||||
|
It seems to me that you could use the nofollow attribute ...
__________________
M.-J. Taylor SEO Web Design by Cyber Key Search Smart DesignŽ SEO Copywriter & Traveling Vacation Gypsy |
|
|||
|
I would include noindex,follow version of META tag in such pages...however, the bots get redirected to 304 BEFORE getting to those inaccessible pages (thus increasing our 304 statistics), that's why i asked about the way (if such way exists) to specify that EXACT links should not be FOLLOWED by bots.
|
|
|||
|
For crawlers, Meta tag attributes (nofollow) & Robot.txt are both there to control their activity. But something to specify that EXACT links... Can't think of anything. Let me know if you do find something.
|
|
||||
|
You can probably use a server side referrer script.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Depending on how selective you want to be, here are my options:
|
|
||||
|
You should include a robots.txt file to tell the good bots not to visit your reply links.
Only problem is that only the good bots will follow the rules you set. Suggest you also install Bad Behavior. http://www.ioerror.us/software/bad-behavior It can be installed on any PHP script. Doesn't need the WordPress database that it uses on WP
__________________
--- * SLMR v2.0 * Have many Nice days |
|
||||
|
Quote:
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Hey that sounds cool. I usually do that with PHP. Can you explain how can that work?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO Last edited by Webnauts; 02-21-2008 at 04:36 PM. |
|
|||
|
Your htaccess file is a ticket to solving a lot of your search engine problems. Not all of them, but some of them. It’s a file on your server that gives instructions to browsers and search engine spiders, telling them how to read your web pages. One common usage of this file is to use it to redirect old web pages to new web pages. Frequently, webmasters will update their information and when doing so will change the URL of a web page. Well, if you do that then you still have that old web page indexed and when people try to visit that page they will get a 404 error page. To prevent that from happening, you can add a 301 redirect command in your htaccess to redirect traffic to your new page.
|
|
||||
|
Quote:
The question once again: How can someone disallow crawling an individual page with the .htaccess file?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
Quote:
Take your time, read, reread and enjoy your increased security: Network and System Administration I 10
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 02-21-2008 at 06:54 PM. |
|
||||
|
Kjell, thanks for the valuable information, but be aware that what you are saying is nothing new to me.
Our member Torpengkute said that we can disallow crawling pages with .htaccess, and I am asking him or anybody else here, how can that work!!! I use a rule in my .htaccess files, to make sure that my robots.txt file will not get indexed and won't show up in the search results, like happens here http://www.whitehouse.gov/robots.txt And that because I do not want to experience a PR dilution as they do, since their robots.txt has a PR 4. And please don't start now with, John they do not need to care. It is just an example. Hey, this is my 6,000th post here at WebProWorld.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO Last edited by Webnauts; 02-22-2008 at 07:31 AM. |
|
|||
|
6,000 posts. John, get outside and smell the flowers!
M
__________________
Non Piercing Nipple Jewelry - All the pleasure and none of the pain! - Body Jewelry |
|
||||
|
Quote:
Updated .htaccess to Block spammers / scammers. You find related threads by the following WPW site search:
So to conclude: Start by using server side configuration files like .htaccess. Then go on to robots.txt to be more specific, then meta tags... And it has one great advantage: It saves bandwith and burden on your web server if you can do it in web sever configuration files. Finally it is more secure.
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 02-22-2008 at 08:44 AM. |
|
||||
|
OK Kgun. I think you did not get my point.
The user was claiming that we can use .htaccess rules to disallow crawlers indexing and following pages. Individual pages should be accessed by users, but not from certain crawlers, like i.e Googlebot. Looks like he gave up, as I assume he could not provide us here with any examples. To be honest I use several server side scripting or configuration methods to achieve all above, but I want to hear how others do that. Got it know? So can you please place an example of code here and stop referring to other threads, etc? Thanks bro.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Point taken.
Save bandwith and choose the most secure solution where you have an option. |
|
||||
|
Quote:
If-Modified-Since = "If-Modified-Since" ":" HTTP-date is the most appropriate solution. By the way, do you think this could disallow crawlers to index a page? <FilesMatch "example\.html"> Header set X-Robots-Tag "noindex" </FilesMatch> I think I have to test that myself anyway. But just asking...
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
|||
|
Quote:
As you said " To be honest I use several server side scripting or configuration methods to achieve all above" So why do you want to hear other suggestions... So It means you did not really know that... Sorry john, before i want to help you how to do that But...... anyway good luck to you bro! |
|
||||
|
Quote:
I am sure, if someone (you) became an Apache server expert, you will soon laugh at the spammers, hackers and intruders. Related links: Apache Tutorial: .htaccess files Documentation: Apache HTTP Server - The Apache HTTP Server Project 30 minutes training each day, and you can offer network security services in addition to SEO services. And John, you should set up your own paid extranet.
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 02-22-2008 at 02:11 PM. |
|
||||
|
Quote:
You wanted to help? Did you all here hear that? (Sorry for my arrogance, but that was honestly too funny.) If you follow the link to the other thread Kgun posted, you will see that I am not telling stories. And the mods and other older members here can confirm that I am not a guy telling stories here but facts! So if it is true what you are claiming, show your facts or just let give up theories, as they do not help anybody here. OK?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO Last edited by Webnauts; 02-22-2008 at 03:06 PM. |
|
||||
|
Quote:
"If the fact's don't fit the theory, change the facts." |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Prevent referrer spam, block bad bots from stealing bandwith | kgun | Internet Security Discussion Forum | 0 | 04-04-2007 08:08 AM |
| Google is not crawling the links in the index page | bobkom | Google Discussion Forum | 1 | 09-12-2005 05:28 PM |
| about not crawling links pages | bobkom | Google Discussion Forum | 0 | 09-08-2005 10:38 AM |
| Shallow Bots - Deep Bots? | jonathan-uk | Google Discussion Forum | 1 | 02-01-2004 09:32 PM |
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |