|
|
||||||
|
||||||
| Index Link To US Private Messages Archive FAQ RSS | ||||||
| Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here. |
Share Thread: & Tags
|
||||
|
![]() |
|
|
LinkBack | Thread Tools | Display Modes |
|
||||
|
Hi All,
When i want to see the internal links of SEO Workers Search Engine Optimization Consulting Company through dead-links.com and also with XENU It is giving following error Dead-links.com shows this error SEO Workers Search Engine Optimization Consulting Company 403 Forbidden XENU shows that No such host Regards Subhzash
__________________
http://hipaacompliancesoftware.net/ |
|
||||
|
Probably that implies that that bot is blocked.
|
|
||||
|
To be specific here is the rule which can be helpful for everyone else too: Code:
RewriteEngine on
RewriteBase /
RewriteCond %{HTTP_USER_AGENT} ADSARobot|ah-ha|almaden|aktuelles|Anarchie|amzn_assoc|ASPSeek|ASSORT|ATHENS|Atomz|attach|attache|autoemailspider|BackWeb|Bandit|BatchFTP|bdfetch|big.brother|BlackWidow|bmclient|Boston\ Project|BravoBrian\ SpiderEngine\ MarcoPolo|Bot\ mailto:craftbot@yahoo.com|Buddy|Bullseye|bumblebee|capture|CherryPicker|ChinaClaw|CICC|clipping|Collector|Copier|Crescent|Crescent\ Internet\ ToolPak|Custo|cyberalert|Deweb|diagem|Digger|Digimarc|DIIbot|DISCo|DISCo\ Pump|DISCoFinder|Download\ Demon|Download\ Wonder|Downloader|Drip|DSurf15a|DTS.Agent|EasyDL|eCatch|ecollector|efp@gmx\.net|Email\ Extractor|EirGrabber|email|EmailCollector|EmailSiphon|EmailWolf|Express\ WebPictures|ExtractorPro|EyeNetIE|FavOrg|fastlwspider|Favorites\ Sweeper|Fetch|FEZhead|FileHound|FlashGet\ WebWasher|FlickBot|fluffy|FrontPage|GalaxyBot|Generic|Getleft|GetRight|GetSmart|GetWeb!|GetWebPage|gigabaz|Girafabot|Go\!Zilla|Go!Zilla|Go-Ahead-Got-It|GornKer|gotit|Grabber|GrabNet|Grafula|Green\ Research|grub-client|Harvest|hhjhj@yahoo|hloader|HMView|HomePageSearch|http\ generic|HTTrack|httpdown|httrack|ia_archiver|IBM_Planetwide|Image\ Stripper|Image\ Sucker|imagefetch|IncyWincy|Indy*Library|Indy\ Library|informant|Ingelin|InterGET|Internet\ Ninja|InternetLinkagent|Internet\ Ninja|InternetSeer\.com|Iria|Irvine|JBH*agent|JetCar|JOC|JOC\ Web\ Spider|JustView|KWebGet|Lachesis|larbin|Leacher|LeechFTP|LexiBot|lftp|libwww|likse|Link|Link*Sleuth|LINKS\ ARoMATIZED|LinkWalker|LWP|lwp-trivial|Mag-Net|Magnet|Mac\ Finder|Mag-Net|Mass\ Downloader|MCspider|MJ12bot/v1\.0\.8|Memo|Microsoft.URL|MIDown\ tool|Mirror|Missigua\ Locator|Mister\ PiX|MMMtoCrawl\/UrlDispatcherLLL|^Mozilla$|Mozilla.*Indy|Mozilla.*NEWT|Mozilla*MSIECrawler|MS\ FrontPage*|MSFrontPage|MSIECrawler|MSProxy|multithreaddb|nationaldirectory|Navroad|NearSite|NetAnts|NetCarta|NetMechanic|netprospector|NetResearchServer|NetSpider|Net\ Vampire|NetZIP|NetZip\ Downloader|NetZippy|NEWT|NICErsPRO|Ninja|NPBot|Octopus|Offline\ Explorer|Offline\ Navigator|OpaL|Openfind|OpenTextSiteCrawler|OrangeBot|PageGrabber|Papa\ Foto|PackRat|pavuk|pcBrowser|PersonaPilot|Ping|PingALink|Pockey|Proxy|psbot|PSurf|psycheclone|puf|Pump|PushSite|QRVA|RealDownload|Reaper|Recorder|ReGet|replacer|RepoMonkey|Robozilla|Rover|RPT-HTTPClient|Rsync|Scooter|SearchExpress|searchhippo|searchterms\.it|Second\ Street\ Research|Seeker|Shai|Siphon|sitecheck|sitecheck.internetseer.com|SiteSnagger|SlySearch|SmartDownload|snagger|Snake|SpaceBison|Spegla|SpiderBot|sproose|SqWorm|Stripper|Sucker|SuperBot|SuperHTTP|Surfbot|SurfWalker|Szukacz|tAkeOut|tarspider|Teleport\ Pro|Templeton|TrueRobot|TV33_Mercator|UIowaCrawler|UtilMind|URLSpiderPro|URL_Spider_Pro|Vacuum|vagabondo|vayala|visibilitygap|VoidEYE|vspider|Web\ Downloader|w3mir|Web\ Data\ Extractor|Web\ Image\ Collector|Web\ Sucker|Wweb|WebAuto|WebBandit|web\.by\.mail|Webclipping|webcollage|webcollector|WebCopier|webcraft@bea|webdevil|webdownloader|Webdup|WebEMailExtrac|WebFetch|WebGo\ IS|WebHook|Webinator|WebLeacher|WEBMASTERS|WebMiner|WebMirror|webmole|WebReaper|WebSauger|Website|Website\ eXtractor|Website\ Quester|WebSnake|Webster|WebStripper|websucker|webvac|webwalk|webweasel|WebWhacker|WebZIP|Wget|Whacker|whizbang|WhosTalking|Widow|WISEbot|WWWOFFLE|x-Tractor|^Xaldon\ WebSpider|WUMPUS|Xenu|XGET|Zeus.*Webster|Zeus [NC]
RewriteRule ^.* - [F,L]
Code:
SetEnvIfNoCase User-Agent "8484 Boston Project v 1.0" bad_bot SetEnvIfNoCase User-Agent "charlotte/" bad_bot SetEnvIfNoCase User-Agent "curl/7.15.5 (i686-redhat-linux-gnu) libcurl/7.15.5 OpenSSL/0.9.8b zlib/1.2.3 libidn/0.6.5" bad_bot SetEnvifNoCase User-Agent "ISC Systems iRc Search 2.1" bad_bot SetEnvIfNoCase User-Agent "^Jakarta\ Commons-HttpClient/" bad_bot SetEnvIfNoCase User-Agent "Java 1.5 / IBM HTML Commons" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.1_01" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.1_04" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_01" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_02" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_03" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_04" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_05" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_07" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_08" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_09" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_10" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_12" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_13" bad_bot SetEnvIfNoCase User-Agent "Java/1.4.2_16" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0-p3" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_01" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_02" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_03" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_04" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_05" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_06" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_07" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_08" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_09" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_10" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_11" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_12" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_13" bad_bot SetEnvIfNoCase User-Agent "Java/1.5.0_14" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-beta" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-beta2" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-beta" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-beta2" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-dp" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-oem" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0-rc" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_01" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_01-ea" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_02" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_03" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_04" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_05" bad_bot SetEnvIfNoCase User-Agent "Java/1.6.0_06" bad_bot SetEnvIfNoCase User-Agent "Java1.1.8" bad_bot SetEnvIfNoCase User-Agent "Java1.2.2" bad_bot SetEnvIfNoCase User-Agent "Java1.3.0" bad_bot SetEnvIfNoCase User-Agent "Java1.3.1" bad_bot SetEnvIfNoCase User-Agent "Java1.3.1_07" bad_bot SetEnvIfNoCase User-Agent "Java1.3.1_18" bad_bot SetEnvIfNoCase User-Agent "Java1.4.0" bad_bot SetEnvIfNoCase User-Agent "Java1.4.0_01" bad_bot SetEnvIfNoCase User-Agent "libwww-perl/" bad_bot SetEnvIfNoCase User-Agent "^libcurl-agent/" bad_bot SetEnvIfNoCase User-Agent "^Microsoft\ URL\ Control.*$" bad_bot SetEnvIfNoCase User-Agent "MJ12bot/v1.0.8" bad_bot SetEnvIfNoCase User-Agent "^Missigua" bad_bot SetEnvIfNoCase User-Agent "^Mozilla/4\.0\ .*Win\ 9x\ 4\.90.*$" bad_bot SetEnvIfNoCase User-Agent "Nutch" bad_bot SetEnvIfNoCase User-Agent "phpversion" bad_bot SetEnvIfNoCase User-Agent "TencentTraveler" bad_bot SetEnvIfNoCase User-Agent "^Web Downloader" bad_bot <FilesMatch "(.*)"> Order Allow,Deny Allow from all Deny from env=bad_bot </FilesMatch> Code:
User-agent: nutch Disallow: / User-Agent: OmniExplorer_Bot Disallow: / User-agent: MJ12bot Disallow: / User-agent: Bitacle bot/1.1 Disallow: / User-agent: Bitacle bot Disallow: / User-agent: Bitacle * Disallow: / User-agent: Bitacle* Disallow: / User-agent: Bitacle Disallow: / The ones we could not block so far, we let Distributed Spam Harvester Tracking Network | Project Honey Pot to catch them. Any further problems please? Thanks for trying to support SEO Workers. Great promotion attempt!!!
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO Last edited by Webnauts; 05-29-2008 at 12:19 PM. |
|
||||
|
I just wanted to bring to your consideration that I just found some bots I did not manage to exclude so far.
I updated my files and the same time I updated my post above too. Stay tuned.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
I updated this moment again. I added all Java bots I could find, and others too now.
I can't express with words how much bandwidth it saved us so far. And how much spam is reduced. Here you can follow up the most recent results: Statistics for seoworkers.com (2008-05) Try it and you will be amazed too buddy.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Wige thanks for the tip in your PM.
I just replaced the all SetEnvIfNoCase for the Java User-Agents with the one you advised me Code:
BrowserMatch "^Java/?[1-9_\.]*" bad_bot
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
So here are the security updated settings in my .htaccess so far: Code:
### Protect against DOS attacks by limiting file upload size ### LimitRequestBody 10240000 Code:
### Prevent .htaccess and .htpasswd files from being viewed by web clients ### <Files "^\.ht"> Order allow,deny Deny from all </Files> Code:
RewriteEngine on
RewriteBase /
RewriteCond %{HTTP_USER_AGENT} ADSARobot|ah-ha|almaden|aktuelles|Anarchie|amzn_assoc|Arachmo|ASPSeek|ASSORT|ATHENS|Atomz|attach|attache|autoemailspider|BackWeb|Bandit|BatchFTP|bdfetch|BecomeBot|big.brother|BlackWidow|bmclient|Boston\ Project|bot/1.0|BravoBrian\ SpiderEngine\ MarcoPolo|Bot\ mailto:craftbot@yahoo.com|Buddy|Bullseye|bumblebee|capture|CherryPicker|ChinaClaw|CICC|clipping|Clushbot|Collector|Copier|Crescent|Crescent\ Internet\ ToolPak|Custo|cyberalert|Deweb|diagem|Digger|Digimarc|DIIbot|DISCo|DISCo\ Pump|DISCoFinder|Download\ Demon|Download\ Wonder|Downloader|Drip|DSurf15a|DTS.Agent|EasyDL|eCatch|ecollector|efp@gmx\.net|Email\ Extractor|EirGrabber|email|EmailCollector|EmailSiphon|EmailWolf|Express\ WebPictures|ExtractorPro|EyeNetIE|FavOrg|fastlwspider|Favorites\ Sweeper|Fetch|FEZhead|FileHound|FlashGet\ WebWasher|FlickBot|fluffy|FrontPage|GalaxyBot|Generic|Getleft|GetRight|GetSmart|GetWeb!|GetWebPage|gigabaz|Gigabot|Girafabot|Go\!Zilla|Go!Zilla|Go-Ahead-Got-It|GornKer|gotit|Grabber|GrabNet|Grafula|Green\ Research|grub-client|Harvest|hhjhj@yahoo|hloader|HMView|HomePageSearch|http\ generic|HTTrack|httpdown|httrack|ia_archiver|IBM_Planetwide|Image\ Stripper|Image\ Sucker|imagefetch|IncyWincy|Indy*Library|Indy\ Library|informant|Ingelin|InterGET|Internet\ Ninja|InternetLinkagent|Internet\ Ninja|InternetSeer\.com|Iria|Irvine|JBH*agent|JetCar|JOC|JOC\ Web\ Spider|JustView|kalooga|KWebGet|Lachesis|larbin|Leacher|LeechFTP|LexiBot|lftp|libwww|likse|Link|Link*Sleuth|LINKS\ ARoMATIZED|LinkWalker|LWP|lwp-trivial|Mag-Net|Magnet|Mac\ Finder|Mag-Net|Mass\ Downloader|MCspider|MJ12bot/v1\.0\.8|Memo|Microsoft.URL|MIDown\ tool|Mirror|Missigua\ Locator|Mister\ PiX|MMMtoCrawl\/UrlDispatcherLLL|^Mozilla$|Mozilla.*Indy|Mozilla.*NEWT|Mozilla*MSIECrawler|MS\ FrontPage*|MSFrontPage|MSIECrawler|MSProxy|MSR-ISRCCrawler|multithreaddb|my-heritrix-crawler|nationaldirectory|Navroad|NearSite|NetAnts|NetCarta|NetMechanic|netprospector|NetResearchServer|NetSpider|Net\ Vampire|NetZIP|NetZip\ Downloader|NetZippy|NEWT|NICErsPRO|Ninja|NPBot|NicheBot|noxtrumbot|Octopus|Offline\ Explorer|Offline\ Navigator|OpaL|Openfind|OpenTextSiteCrawler|OrangeBot|PageGrabber|Papa\ Foto|PackRat|pavuk|pcBrowser|PersonaPilot|Ping|PingALink|Pingdom|Pockey|POE-Component-Client-HTTP|Proxy|psbot|PSurf|psycheclone|puf|Pump|PushSite|QRVA|RealDownload|Reaper|Recorder|ReGet|replacer|RepoMonkey|Robozilla|Rover|RPT-HTTPClient|Rsync|Scooter|SearchExpress|searchhippo|searchterms\.it|Second\ Street\ Research|Seeker|Shai|Siphon|sitecheck|sitecheck.internetseer.com|SiteSnagger|SlySearch|SmartDownload|snagger|Snake|SpaceBison|Spegla|SpiderBot|sproose|SqWorm|Stripper|Sucker|SuperBot|SuperHTTP|Surfbot|SurfWalker|Szukacz|tAkeOut|tarspider|Teleport\ Pro|Templeton|TrueRobot|TV33_Mercator|UIowaCrawler|UtilMind|URLSpiderPro|URL_Spider_Pro|Vacuum|vagabondo|vayala|visibilitygap|VoidEYE|vspider|Web\ Downloader|w3mir|Web\ Data\ Extractor|Web\ Image\ Collector|Web\ Sucker|Wweb|WebAuto|WebBandit|web\.by\.mail|Webclipping|webcollage|webcollector|WebCopier|webcraft@bea|webdevil|webdownloader|Webdup|WebEMailExtrac|WebFetch|WebGo\ IS|WebHook|Webinator|WebLeacher|WEBMASTERS|WebMiner|WebMirror|webmole|WebReaper|WebSauger|Website|Website\ eXtractor|Website\ Quester|WebSnake|Webster|WebStripper|websucker|webvac|webwalk|webweasel|WebWhacker|WebZIP|Wget|Whacker|whizbang|WhosTalking|Widow|WISEbot|WWWOFFLE|x-Tractor|^Xaldon\ WebSpider|WUMPUS|Xenu|XGET|Yeti|zermelo|Zeus.*Webster|Zeus [NC]
RewriteRule ^.* - [F,L]
Code:
### Deny Fake Bots ### BrowserMatch "^Java/?[1-9_\.]*" bad_bot SetEnvIfNoCase User-Agent "8484 Boston Project v 1.0" bad_bot SetEnvIfNoCase User-Agent "charlotte/" bad_bot SetEnvIfNoCase User-Agent "curl/7.15.5 (i686-redhat-linux-gnu) libcurl/7.15.5 OpenSSL/0.9.8b zlib/1.2.3 libidn/0.6.5" bad_bot SetEnvifNoCase User-Agent "ISC Systems iRc Search 2.1" bad_bot SetEnvIfNoCase User-Agent "^Jakarta\ Commons-HttpClient/" bad_bot SetEnvIfNoCase User-Agent "libwww-perl/" bad_bot SetEnvIfNoCase User-Agent "^libcurl-agent/" bad_bot SetEnvIfNoCase User-Agent "^Microsoft\ URL\ Control.*$" bad_bot SetEnvIfNoCase User-Agent "MJ12bot/v1.0.8" bad_bot SetEnvIfNoCase User-Agent "^Missigua" bad_bot SetEnvIfNoCase User-Agent "^Mozilla/4\.0\ .*Win\ 9x\ 4\.90.*$" bad_bot SetEnvIfNoCase User-Agent "Nutch" bad_bot SetEnvIfNoCase User-Agent "phpversion" bad_bot SetEnvIfNoCase User-Agent "TencentTraveler" bad_bot SetEnvIfNoCase User-Agent "^Web Downloader" bad_bot <FilesMatch "(.*)"> Order Allow,Deny Allow from all Deny from env=bad_bot </FilesMatch> I was thinking also to block proxy servers, but I am not sure if that would have been a good idea. What do you think about this? Code:
### Block proxy servers from site access ###
RewriteCond %{HTTP:VIA} !^$ [OR]
RewriteCond %{HTTP:FORWARDED} !^$ [OR]
RewriteCond %{HTTP:USERAGENT_VIA} !^$ [OR]
RewriteCond %{HTTP:X_FORWARDED_FOR} !^$ [OR]
RewriteCond %{HTTP:PROXY_CONNECTION} !^$ [OR]
RewriteCond %{HTTP:XPROXY_CONNECTION} !^$ [OR]
RewriteCond %{HTTP:HTTP_PC_REMOTE_ADDR} !^$ [OR]
RewriteCond %{HTTP:HTTP_CLIENT_IP} !^$
RewriteRule ^(.*)$ - [F]
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO Last edited by Webnauts; 05-29-2008 at 01:49 PM. |
|
||||
|
Quote:
Ideally it blocks all non-Norwegian Ip's. Not a single attack in months.
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 05-29-2008 at 01:58 PM. |
|
||||
|
Quote:
Is there also an ideal solution to block all India IPs?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
May be. John try.
|
|
||||
|
Quote:
Additional information here: DigitalStart.net: The starting point for English speaking surfers and webmasters |
|
||||
|
I do not need to try. If I come through a Norwegian proxy server to your forums, I will have access.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
Quote:
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Well, already I have seen a distinct drop in bandwidth usage today. I am using the BrowserMatch "whatever" bad_bot method, because I can set this in the server's configuration and apply it to all of the virtual hosts, which I could not get working with the RewriteRule for some reason. Probably had a conflict somewhere.
__________________
The best way to learn anything, is to question everything. |
|
||||
|
Quote:
By the way I would like to ask you probably use Skype or IM? And do you speak Perl?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
Not really. I dont think still some thing is wrong in your website. But my main intension is to prove my self by finding bugs in your site. Some times your words really worst and those hurt me. Anyway not a problem...But still i do some efforts to find bugs in your site. Regards Subhzash
__________________
http://hipaacompliancesoftware.net/ |
|
||||
|
do you detect spammers as easy as that?
__________________
Hawaii Events|Oahu Events|Honolulu Events |led signs|outdoor led sign |
|
||||
|
Quote:
Quote:
If you mean spambots, I use a database to log visits so I can track how popular different products are, and to allow certain customization of the site for visitors. I tweaked the logging system to spot user agents that are only on a few IP addresses, or that visit on unusual rates, and from that I can spot both new browsers (for example, an upswing in traffic from mobile browsers) and spam bots for blocking. If you are referring to forum and form spam, a careful approach to security and input validation, combined with a system that tracks unexpected user input and blocks the source, can go a long way toward eliminating the problem.
__________________
The best way to learn anything, is to question everything. |
|
||||
|
Quote:
Can we be friends now?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Wige are you sure GigaBot is a spambot? I made a research and I found out that it is the bot of the search engine Gigablast.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
Even the old well established ex Norwegian SE FAST - Enterprise Search now owned by Microsoft was blocked by some members.
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 06-01-2008 at 12:03 AM. |
|
||||
|
Quote:
Off-topic: My online support is active at the moment.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO Last edited by Webnauts; 06-01-2008 at 12:27 AM. |
|
||||
|
Quote:
Thanks a lot Regards Subhzash
__________________
http://hipaacompliancesoftware.net/ |
|
||||
|
I hope dead-links.com is not bad bot. I use to list out URLs of the website which have less than 250 pages.
Why do you stop dead-links.com to crawl the website??
__________________
http://hipaacompliancesoftware.net/ |
|
||||
|
Quote:
What is wrong with that? By the way a great news! Where did the bad bots go? Statistics for seoworkers.com (2008-06)
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
Quote:
__________________
http://hipaacompliancesoftware.net/ |
|
||||
|
Quote:
Quote:
Yeah, I noticed some even recommend blocking Google's Mobile Transcoder, a human operated (thus not a bot) service that converts standard web sites for mobile users, because it doesn't obey robots.txt.
__________________
The best way to learn anything, is to question everything. |
|
||||
|
I don't kow if you use this Help : when you search that forum.
I think you find it in a thread with the following KW's ban a country using htaccess Here is the thread: WebmasterWorld Login "i want to banned certain ip adresses all coming from: UserAgent: FAST-WebCrawler/3.6 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp) IP address: 66.77.73.151 i get way to many visits from this spiderbot" My bolding. Note the answer (last post) from heini: "Just a quick and offtopic addition: you are about to block the spider of one of the few big worldwide search engines".
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 06-03-2008 at 06:12 AM. |
|
||||
|
Quote:
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO |
|
||||
|
What do you mean by best?
I started a new thread about this important topic here: Google the best search engine? But I noted an interesting difference while using this search form on the big three and the KW's link selling Google 10 200 hits Yahoo 9 700 hits MSN 20 200 hits Is that a result of
link selling site:WebmasterWorld News and Discussion for the Web Professional (www-version) 10 100 hits link selling site:webmasterworld.com 88 500 from webmasterworld.com OR webmasterworld.com for link selling. So the reason was as I thought how WMW has programmed that form. On MSN, the www and non-www version produces approximately the same number of hits. link selling site:webmasterworld.com 20 000 hits link selling site:WebmasterWorld News and Discussion for the Web Professional (www-version) 20 300 hits
__________________
Mini Network:: Financial information at your fingertips Learn object oriented programming where it started Last edited by kgun; 06-03-2008 at 08:34 AM. |
![]() |
|
| Thread Tools | |
| Display Modes | |
|
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| This page is showing an error....why? | watto | Web Programming Discussion Forum | 4 | 05-14-2007 05:33 PM |
| G. isn't showing my site locally, but is showing it globally | writergrrrl48 | Google Discussion Forum | 11 | 05-17-2006 08:57 PM |
| Runtime Error - Line 1 - Error Syntex Error | charms987 | Graphics & Design Discussion Forum | 6 | 07-29-2005 12:56 PM |
| Error: Syntax Error in IE | web-content-king | Web Programming Discussion Forum | 3 | 03-20-2005 02:42 AM |
| Error Pages showing in stats | kengeddes | Graphics & Design Discussion Forum | 1 | 06-06-2004 01:46 PM |
|
WebProWorld |
Advertise |
Contact Us |
About |
Forum Rules |
MVP's |
Archive |
Newsletter Archive |
Top |
WebProNews
WebProWorld is an iEntry, Inc. ® site - © 2009 All Rights Reserved Privacy Policy and Legal iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509 |