iEntry 10th Anniversary Forum Rules Search
WebProWorld
Register FAQ Calendar Mark Forums Read
Search Engine Optimization Forum SEO is much easier with help from peers and experts! The WebProWorld SEO forum is for the discussion and exploration of various search engine optimization topics. Any non (engine) specific SEO or SEM topics should go here.

Share Thread: & Tags

Share Thread:

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 05-29-2008, 07:13 AM
subhzash's Avatar
WebProWorld Veteran
 
Join Date: Apr 2007
Posts: 340
subhzash RepRank 1
Unhappy www.seoworkers.com showing error

Hi All,

When i want to see the internal links of SEO Workers Search Engine Optimization Consulting Company through dead-links.com and also with XENU
It is giving following error

Dead-links.com shows this error

SEO Workers Search Engine Optimization Consulting Company 403 Forbidden

XENU shows that

No such host



Regards
Subhzash
Reply With Quote
  #2 (permalink)  
Old 05-29-2008, 07:34 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

Probably that implies that that bot is blocked.
Reply With Quote
  #3 (permalink)  
Old 05-29-2008, 10:21 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

We deny domain access to spammers, spies and other scumbags with our .htaccess file. Bad bots attacks are reduced up to at least 95% so far!!!

To be specific here is the rule which can be helpful for everyone else too:
Code:
RewriteEngine on
RewriteBase /
RewriteCond %{HTTP_USER_AGENT} ADSARobot|ah-ha|almaden|aktuelles|Anarchie|amzn_assoc|ASPSeek|ASSORT|ATHENS|Atomz|attach|attache|autoemailspider|BackWeb|Bandit|BatchFTP|bdfetch|big.brother|BlackWidow|bmclient|Boston\ Project|BravoBrian\ SpiderEngine\ MarcoPolo|Bot\ mailto:craftbot@yahoo.com|Buddy|Bullseye|bumblebee|capture|CherryPicker|ChinaClaw|CICC|clipping|Collector|Copier|Crescent|Crescent\ Internet\ ToolPak|Custo|cyberalert|Deweb|diagem|Digger|Digimarc|DIIbot|DISCo|DISCo\ Pump|DISCoFinder|Download\ Demon|Download\ Wonder|Downloader|Drip|DSurf15a|DTS.Agent|EasyDL|eCatch|ecollector|efp@gmx\.net|Email\ Extractor|EirGrabber|email|EmailCollector|EmailSiphon|EmailWolf|Express\ WebPictures|ExtractorPro|EyeNetIE|FavOrg|fastlwspider|Favorites\ Sweeper|Fetch|FEZhead|FileHound|FlashGet\ WebWasher|FlickBot|fluffy|FrontPage|GalaxyBot|Generic|Getleft|GetRight|GetSmart|GetWeb!|GetWebPage|gigabaz|Girafabot|Go\!Zilla|Go!Zilla|Go-Ahead-Got-It|GornKer|gotit|Grabber|GrabNet|Grafula|Green\ Research|grub-client|Harvest|hhjhj@yahoo|hloader|HMView|HomePageSearch|http\ generic|HTTrack|httpdown|httrack|ia_archiver|IBM_Planetwide|Image\ Stripper|Image\ Sucker|imagefetch|IncyWincy|Indy*Library|Indy\ Library|informant|Ingelin|InterGET|Internet\ Ninja|InternetLinkagent|Internet\ Ninja|InternetSeer\.com|Iria|Irvine|JBH*agent|JetCar|JOC|JOC\ Web\ Spider|JustView|KWebGet|Lachesis|larbin|Leacher|LeechFTP|LexiBot|lftp|libwww|likse|Link|Link*Sleuth|LINKS\ ARoMATIZED|LinkWalker|LWP|lwp-trivial|Mag-Net|Magnet|Mac\ Finder|Mag-Net|Mass\ Downloader|MCspider|MJ12bot/v1\.0\.8|Memo|Microsoft.URL|MIDown\ tool|Mirror|Missigua\ Locator|Mister\ PiX|MMMtoCrawl\/UrlDispatcherLLL|^Mozilla$|Mozilla.*Indy|Mozilla.*NEWT|Mozilla*MSIECrawler|MS\ FrontPage*|MSFrontPage|MSIECrawler|MSProxy|multithreaddb|nationaldirectory|Navroad|NearSite|NetAnts|NetCarta|NetMechanic|netprospector|NetResearchServer|NetSpider|Net\ Vampire|NetZIP|NetZip\ Downloader|NetZippy|NEWT|NICErsPRO|Ninja|NPBot|Octopus|Offline\ Explorer|Offline\ Navigator|OpaL|Openfind|OpenTextSiteCrawler|OrangeBot|PageGrabber|Papa\ Foto|PackRat|pavuk|pcBrowser|PersonaPilot|Ping|PingALink|Pockey|Proxy|psbot|PSurf|psycheclone|puf|Pump|PushSite|QRVA|RealDownload|Reaper|Recorder|ReGet|replacer|RepoMonkey|Robozilla|Rover|RPT-HTTPClient|Rsync|Scooter|SearchExpress|searchhippo|searchterms\.it|Second\ Street\ Research|Seeker|Shai|Siphon|sitecheck|sitecheck.internetseer.com|SiteSnagger|SlySearch|SmartDownload|snagger|Snake|SpaceBison|Spegla|SpiderBot|sproose|SqWorm|Stripper|Sucker|SuperBot|SuperHTTP|Surfbot|SurfWalker|Szukacz|tAkeOut|tarspider|Teleport\ Pro|Templeton|TrueRobot|TV33_Mercator|UIowaCrawler|UtilMind|URLSpiderPro|URL_Spider_Pro|Vacuum|vagabondo|vayala|visibilitygap|VoidEYE|vspider|Web\ Downloader|w3mir|Web\ Data\ Extractor|Web\ Image\ Collector|Web\ Sucker|Wweb|WebAuto|WebBandit|web\.by\.mail|Webclipping|webcollage|webcollector|WebCopier|webcraft@bea|webdevil|webdownloader|Webdup|WebEMailExtrac|WebFetch|WebGo\ IS|WebHook|Webinator|WebLeacher|WEBMASTERS|WebMiner|WebMirror|webmole|WebReaper|WebSauger|Website|Website\ eXtractor|Website\ Quester|WebSnake|Webster|WebStripper|websucker|webvac|webwalk|webweasel|WebWhacker|WebZIP|Wget|Whacker|whizbang|WhosTalking|Widow|WISEbot|WWWOFFLE|x-Tractor|^Xaldon\ WebSpider|WUMPUS|Xenu|XGET|Zeus.*Webster|Zeus [NC]
RewriteRule ^.* - [F,L]
and

Code:
SetEnvIfNoCase User-Agent "8484 Boston Project v 1.0" bad_bot
SetEnvIfNoCase User-Agent "charlotte/" bad_bot
SetEnvIfNoCase User-Agent "curl/7.15.5 (i686-redhat-linux-gnu) libcurl/7.15.5 OpenSSL/0.9.8b zlib/1.2.3 libidn/0.6.5" bad_bot
SetEnvifNoCase User-Agent "ISC Systems iRc Search 2.1" bad_bot
SetEnvIfNoCase User-Agent "^Jakarta\ Commons-HttpClient/" bad_bot
SetEnvIfNoCase User-Agent "Java 1.5 / IBM HTML Commons" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.1_01" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.1_04" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_01" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_02" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_03" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_04" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_05" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_07" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_08" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_09" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_10" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_12" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_13" bad_bot
SetEnvIfNoCase User-Agent "Java/1.4.2_16" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0-p3" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_01" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_02" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_03" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_04" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_05" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_06" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_07" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_08" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_09" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_10" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_11" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_12" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_13" bad_bot
SetEnvIfNoCase User-Agent "Java/1.5.0_14" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-beta" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-beta2" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-beta" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-beta2" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-dp" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-oem" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0-rc" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_01" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_01-ea" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_02" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_03" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_04" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_05" bad_bot
SetEnvIfNoCase User-Agent "Java/1.6.0_06" bad_bot
SetEnvIfNoCase User-Agent "Java1.1.8" bad_bot
SetEnvIfNoCase User-Agent "Java1.2.2" bad_bot
SetEnvIfNoCase User-Agent "Java1.3.0" bad_bot
SetEnvIfNoCase User-Agent "Java1.3.1" bad_bot
SetEnvIfNoCase User-Agent "Java1.3.1_07" bad_bot
SetEnvIfNoCase User-Agent "Java1.3.1_18" bad_bot
SetEnvIfNoCase User-Agent "Java1.4.0" bad_bot
SetEnvIfNoCase User-Agent "Java1.4.0_01" bad_bot
SetEnvIfNoCase User-Agent "libwww-perl/" bad_bot
SetEnvIfNoCase User-Agent "^libcurl-agent/" bad_bot
SetEnvIfNoCase User-Agent "^Microsoft\ URL\ Control.*$" bad_bot
SetEnvIfNoCase User-Agent "MJ12bot/v1.0.8" bad_bot
SetEnvIfNoCase User-Agent "^Missigua" bad_bot
SetEnvIfNoCase User-Agent "^Mozilla/4\.0\ .*Win\ 9x\ 4\.90.*$" bad_bot
SetEnvIfNoCase User-Agent "Nutch" bad_bot
SetEnvIfNoCase User-Agent "phpversion" bad_bot
SetEnvIfNoCase User-Agent "TencentTraveler" bad_bot
SetEnvIfNoCase User-Agent "^Web Downloader" bad_bot
<FilesMatch "(.*)">
Order Allow,Deny
Allow from all
Deny from env=bad_bot
</FilesMatch>
In our robots.txt we are blocking also some bots, including the bots attempting articles theft:

Code:
User-agent: nutch
Disallow: / 

User-Agent: OmniExplorer_Bot
Disallow: /

User-agent: MJ12bot
Disallow: /

User-agent: Bitacle bot/1.1
Disallow: /

User-agent: Bitacle bot
Disallow: /

User-agent: Bitacle *
Disallow: /

User-agent: Bitacle*
Disallow: /

User-agent: Bitacle
Disallow: /
If you know any missing, please be kind a let us know so we can update.

The ones we could not block so far, we let Distributed Spam Harvester Tracking Network | Project Honey Pot to catch them.

Any further problems please?

Thanks for trying to support SEO Workers. Great promotion attempt!!!
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 05-29-2008 at 12:19 PM.
Reply With Quote
  #4 (permalink)  
Old 05-29-2008, 10:47 AM
Burf.com's Avatar
WebProWorld Veteran
 
Join Date: Dec 2003
Location: UK, Hampshire
Posts: 541
Burf.com RepRank 1
Default Re: www.seoworkers.com showing error

Who else bans bots like this

Webnauts implemented this for me and its made quite a difference on traffic / bw etc
Reply With Quote
  #5 (permalink)  
Old 05-29-2008, 11:56 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

I just wanted to bring to your consideration that I just found some bots I did not manage to exclude so far.

I updated my files and the same time I updated my post above too.

Stay tuned.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #6 (permalink)  
Old 05-29-2008, 12:02 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: www.seoworkers.com showing error

I like it. Going to have to try that method out on my site.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #7 (permalink)  
Old 05-29-2008, 12:22 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by wige View Post
I like it. Going to have to try that method out on my site.
I updated this moment again. I added all Java bots I could find, and others too now.

I can't express with words how much bandwidth it saved us so far. And how much spam is reduced. Here you can follow up the most recent results: Statistics for seoworkers.com (2008-05)

Try it and you will be amazed too buddy.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #8 (permalink)  
Old 05-29-2008, 12:54 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: www.seoworkers.com showing error

Here are a few others that I have blocked on mine:

Arachmo
BecomeBot
bot/1.0
Clushbot
Gigabot
kalooga
Kyluka crawl
MSR-ISRCCrawler
my-heritrix-crawler
NicheBot
noxtrumbot
Pingdom
POE-Component-Client-HTTP
Yeti
zermelo
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #9 (permalink)  
Old 05-29-2008, 01:00 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Wige thanks for the tip in your PM.

I just replaced the all SetEnvIfNoCase for the Java User-Agents with the one you advised me
Code:
BrowserMatch "^Java/?[1-9_\.]*" bad_bot
and tested with this simulator several versions and looks like it works: Simulate any User Agent or Bot
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #10 (permalink)  
Old 05-29-2008, 01:44 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by wige View Post
Here are a few others that I have blocked on mine:

Arachmo
BecomeBot
bot/1.0
Clushbot
Gigabot
kalooga
Kyluka crawl
MSR-ISRCCrawler
my-heritrix-crawler
NicheBot
noxtrumbot
Pingdom
POE-Component-Client-HTTP
Yeti
zermelo
Wige thanks a lot for the heads up. I added those bots too, except of "Kyluka Crawl", since I could not get it right somehow. Maybe you can help there.

So here are the security updated settings in my .htaccess so far:

Code:
### Protect against DOS attacks by limiting file upload size ###
LimitRequestBody 10240000
Code:
### Prevent .htaccess and .htpasswd files from being viewed by web clients ###
<Files "^\.ht">
  Order allow,deny
  Deny from all
</Files>
Code:
RewriteEngine on
RewriteBase /
RewriteCond %{HTTP_USER_AGENT} ADSARobot|ah-ha|almaden|aktuelles|Anarchie|amzn_assoc|Arachmo|ASPSeek|ASSORT|ATHENS|Atomz|attach|attache|autoemailspider|BackWeb|Bandit|BatchFTP|bdfetch|BecomeBot|big.brother|BlackWidow|bmclient|Boston\ Project|bot/1.0|BravoBrian\ SpiderEngine\ MarcoPolo|Bot\ mailto:craftbot@yahoo.com|Buddy|Bullseye|bumblebee|capture|CherryPicker|ChinaClaw|CICC|clipping|Clushbot|Collector|Copier|Crescent|Crescent\ Internet\ ToolPak|Custo|cyberalert|Deweb|diagem|Digger|Digimarc|DIIbot|DISCo|DISCo\ Pump|DISCoFinder|Download\ Demon|Download\ Wonder|Downloader|Drip|DSurf15a|DTS.Agent|EasyDL|eCatch|ecollector|efp@gmx\.net|Email\ Extractor|EirGrabber|email|EmailCollector|EmailSiphon|EmailWolf|Express\ WebPictures|ExtractorPro|EyeNetIE|FavOrg|fastlwspider|Favorites\ Sweeper|Fetch|FEZhead|FileHound|FlashGet\ WebWasher|FlickBot|fluffy|FrontPage|GalaxyBot|Generic|Getleft|GetRight|GetSmart|GetWeb!|GetWebPage|gigabaz|Gigabot|Girafabot|Go\!Zilla|Go!Zilla|Go-Ahead-Got-It|GornKer|gotit|Grabber|GrabNet|Grafula|Green\ Research|grub-client|Harvest|hhjhj@yahoo|hloader|HMView|HomePageSearch|http\ generic|HTTrack|httpdown|httrack|ia_archiver|IBM_Planetwide|Image\ Stripper|Image\ Sucker|imagefetch|IncyWincy|Indy*Library|Indy\ Library|informant|Ingelin|InterGET|Internet\ Ninja|InternetLinkagent|Internet\ Ninja|InternetSeer\.com|Iria|Irvine|JBH*agent|JetCar|JOC|JOC\ Web\ Spider|JustView|kalooga|KWebGet|Lachesis|larbin|Leacher|LeechFTP|LexiBot|lftp|libwww|likse|Link|Link*Sleuth|LINKS\ ARoMATIZED|LinkWalker|LWP|lwp-trivial|Mag-Net|Magnet|Mac\ Finder|Mag-Net|Mass\ Downloader|MCspider|MJ12bot/v1\.0\.8|Memo|Microsoft.URL|MIDown\ tool|Mirror|Missigua\ Locator|Mister\ PiX|MMMtoCrawl\/UrlDispatcherLLL|^Mozilla$|Mozilla.*Indy|Mozilla.*NEWT|Mozilla*MSIECrawler|MS\ FrontPage*|MSFrontPage|MSIECrawler|MSProxy|MSR-ISRCCrawler|multithreaddb|my-heritrix-crawler|nationaldirectory|Navroad|NearSite|NetAnts|NetCarta|NetMechanic|netprospector|NetResearchServer|NetSpider|Net\ Vampire|NetZIP|NetZip\ Downloader|NetZippy|NEWT|NICErsPRO|Ninja|NPBot|NicheBot|noxtrumbot|Octopus|Offline\ Explorer|Offline\ Navigator|OpaL|Openfind|OpenTextSiteCrawler|OrangeBot|PageGrabber|Papa\ Foto|PackRat|pavuk|pcBrowser|PersonaPilot|Ping|PingALink|Pingdom|Pockey|POE-Component-Client-HTTP|Proxy|psbot|PSurf|psycheclone|puf|Pump|PushSite|QRVA|RealDownload|Reaper|Recorder|ReGet|replacer|RepoMonkey|Robozilla|Rover|RPT-HTTPClient|Rsync|Scooter|SearchExpress|searchhippo|searchterms\.it|Second\ Street\ Research|Seeker|Shai|Siphon|sitecheck|sitecheck.internetseer.com|SiteSnagger|SlySearch|SmartDownload|snagger|Snake|SpaceBison|Spegla|SpiderBot|sproose|SqWorm|Stripper|Sucker|SuperBot|SuperHTTP|Surfbot|SurfWalker|Szukacz|tAkeOut|tarspider|Teleport\ Pro|Templeton|TrueRobot|TV33_Mercator|UIowaCrawler|UtilMind|URLSpiderPro|URL_Spider_Pro|Vacuum|vagabondo|vayala|visibilitygap|VoidEYE|vspider|Web\ Downloader|w3mir|Web\ Data\ Extractor|Web\ Image\ Collector|Web\ Sucker|Wweb|WebAuto|WebBandit|web\.by\.mail|Webclipping|webcollage|webcollector|WebCopier|webcraft@bea|webdevil|webdownloader|Webdup|WebEMailExtrac|WebFetch|WebGo\ IS|WebHook|Webinator|WebLeacher|WEBMASTERS|WebMiner|WebMirror|webmole|WebReaper|WebSauger|Website|Website\ eXtractor|Website\ Quester|WebSnake|Webster|WebStripper|websucker|webvac|webwalk|webweasel|WebWhacker|WebZIP|Wget|Whacker|whizbang|WhosTalking|Widow|WISEbot|WWWOFFLE|x-Tractor|^Xaldon\ WebSpider|WUMPUS|Xenu|XGET|Yeti|zermelo|Zeus.*Webster|Zeus [NC]
RewriteRule ^.* - [F,L]
Code:
### Deny Fake Bots ###
BrowserMatch "^Java/?[1-9_\.]*" bad_bot
SetEnvIfNoCase User-Agent "8484 Boston Project v 1.0" bad_bot
SetEnvIfNoCase User-Agent "charlotte/" bad_bot
SetEnvIfNoCase User-Agent "curl/7.15.5 (i686-redhat-linux-gnu) libcurl/7.15.5 OpenSSL/0.9.8b zlib/1.2.3 libidn/0.6.5" bad_bot
SetEnvifNoCase User-Agent "ISC Systems iRc Search 2.1" bad_bot
SetEnvIfNoCase User-Agent "^Jakarta\ Commons-HttpClient/" bad_bot
SetEnvIfNoCase User-Agent "libwww-perl/" bad_bot
SetEnvIfNoCase User-Agent "^libcurl-agent/" bad_bot
SetEnvIfNoCase User-Agent "^Microsoft\ URL\ Control.*$" bad_bot
SetEnvIfNoCase User-Agent "MJ12bot/v1.0.8" bad_bot
SetEnvIfNoCase User-Agent "^Missigua" bad_bot
SetEnvIfNoCase User-Agent "^Mozilla/4\.0\ .*Win\ 9x\ 4\.90.*$" bad_bot
SetEnvIfNoCase User-Agent "Nutch" bad_bot
SetEnvIfNoCase User-Agent "phpversion" bad_bot
SetEnvIfNoCase User-Agent "TencentTraveler" bad_bot
SetEnvIfNoCase User-Agent "^Web Downloader" bad_bot
<FilesMatch "(.*)">
Order Allow,Deny
Allow from all
Deny from env=bad_bot
</FilesMatch>
Any further ideas to optimize (shrink/compress) the above?

I was thinking also to block proxy servers, but I am not sure if that would have been a good idea.
What do you think about this?

Code:
### Block proxy servers from site access ###
RewriteCond %{HTTP:VIA}                 !^$ [OR]
RewriteCond %{HTTP:FORWARDED}           !^$ [OR]
RewriteCond %{HTTP:USERAGENT_VIA}       !^$ [OR]
RewriteCond %{HTTP:X_FORWARDED_FOR}     !^$ [OR]
RewriteCond %{HTTP:PROXY_CONNECTION}    !^$ [OR]
RewriteCond %{HTTP:XPROXY_CONNECTION}   !^$ [OR]
RewriteCond %{HTTP:HTTP_PC_REMOTE_ADDR} !^$ [OR]
RewriteCond %{HTTP:HTTP_CLIENT_IP}      !^$
RewriteRule ^(.*)$ - [F]
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 05-29-2008 at 01:49 PM.
Reply With Quote
  #11 (permalink)  
Old 05-29-2008, 01:54 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Burf.com View Post
Who else bans bots like this

Webnauts implemented this for me and its made quite a difference on traffic / bw etc
Try http://www.forumnorway.com/

Ideally it blocks all non-Norwegian Ip's.

Not a single attack in months.

Last edited by kgun; 05-29-2008 at 01:58 PM.
Reply With Quote
  #12 (permalink)  
Old 05-29-2008, 01:58 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by kgun View Post
Try http://www.forumnorway.com/

Ideally it blocks all non-Norwegian Ip's
That was a good one.

Is there also an ideal solution to block all India IPs?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #13 (permalink)  
Old 05-29-2008, 02:03 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by kgun View Post
Try http://www.forumnorway.com/

Ideally it blocks all non-Norwegian Ip's.

Not a single attack in months.
Are you claiming that your forums cannot be viewed through a Norvegian proxy?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #14 (permalink)  
Old 05-29-2008, 02:04 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by subhzash View Post
Hi All,

When i want to see the internal links of SEO Workers Search Engine Optimization Consulting Company through dead-links.com and also with XENU
It is giving following error

Dead-links.com shows this error

SEO Workers Search Engine Optimization Consulting Company 403 Forbidden

XENU shows that

No such host



Regards
Subhzash
Subhzash, where did you go? Are you still sure that something was wrong?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #15 (permalink)  
Old 05-29-2008, 02:09 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
Are you claiming that your forums cannot be viewed through a Norvegian proxy?
May be. John try.
Reply With Quote
  #16 (permalink)  
Old 05-29-2008, 02:12 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
That was a good one.

Is there also an ideal solution to block all India IPs?
Done in a few minutes block a country, block by ip address, deny access by country

Additional information here: DigitalStart.net: The starting point for English speaking surfers and webmasters
Reply With Quote
  #17 (permalink)  
Old 05-29-2008, 02:20 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by kgun View Post
May be. John try.
I do not need to try. If I come through a Norwegian proxy server to your forums, I will have access.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #18 (permalink)  
Old 05-29-2008, 02:22 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by kgun View Post
Well I am aware of that solution already. But my .htaccess is already huge.

Quote:
Originally Posted by kgun View Post
Congrats brother. I like the idea!!! I wish I had an IBL on the page.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #19 (permalink)  
Old 05-29-2008, 06:48 PM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: www.seoworkers.com showing error

Well, already I have seen a distinct drop in bandwidth usage today. I am using the BrowserMatch "whatever" bad_bot method, because I can set this in the server's configuration and apply it to all of the virtual hosts, which I could not get working with the RewriteRule for some reason. Probably had a conflict somewhere.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #20 (permalink)  
Old 05-29-2008, 08:36 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by wige View Post
Well, already I have seen a distinct drop in bandwidth usage today. I am using the BrowserMatch "whatever" bad_bot method, because I can set this in the server's configuration and apply it to all of the virtual hosts, which I could not get working with the RewriteRule for some reason. Probably had a conflict somewhere.
Great to hear positive news!

By the way I would like to ask you probably use Skype or IM? And do you speak Perl?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #21 (permalink)  
Old 05-30-2008, 12:52 AM
subhzash's Avatar
WebProWorld Veteran
 
Join Date: Apr 2007
Posts: 340
subhzash RepRank 1
Talking Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
Subhzash, where did you go? Are you still sure that something was wrong?

Not really. I dont think still some thing is wrong in your website. But my main intension is to prove my self by finding bugs in your site. Some times your words really worst and those hurt me.

Anyway not a problem...But still i do some efforts to find bugs in your site.



Regards
Subhzash
Reply With Quote
  #22 (permalink)  
Old 05-30-2008, 08:23 AM
full house's Avatar
WebProWorld Veteran
 
Join Date: Sep 2007
Posts: 522
full house RepRank 2
Default Re: www.seoworkers.com showing error

do you detect spammers as easy as that?
Reply With Quote
  #23 (permalink)  
Old 05-30-2008, 10:29 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
Great to hear positive news!

By the way I would like to ask you probably use Skype or IM? And do you speak Perl?
I do use IM, just added them to my profile, although I am on MSN a lot more than Yahoo. As for Perl, I used to build my web sites entirely in Perl (Yes, I did once say "PHP? Bah! Merely a fad - real devs use Perl!" - sorta) but its been so long since I used it I have forgotten most of it.

Quote:
Originally Posted by subhzash View Post
Not really. I dont think still some thing is wrong in your website. But my main intension is to prove my self by finding bugs in your site. Some times your words really worst and those hurt me.

Anyway not a problem...But still i do some efforts to find bugs in your site.



Regards
Subhzash
There are a few. Need to be a hacker to spot them though.

Quote:
Originally Posted by full house View Post
do you detect spammers as easy as that?
If you mean spambots, I use a database to log visits so I can track how popular different products are, and to allow certain customization of the site for visitors. I tweaked the logging system to spot user agents that are only on a few IP addresses, or that visit on unusual rates, and from that I can spot both new browsers (for example, an upswing in traffic from mobile browsers) and spam bots for blocking. If you are referring to forum and form spam, a careful approach to security and input validation, combined with a system that tracks unexpected user input and blocks the source, can go a long way toward eliminating the problem.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #24 (permalink)  
Old 05-30-2008, 03:37 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by subhzash View Post
Not really. I dont think still some thing is wrong in your website. But my main intension is to prove my self by finding bugs in your site. Some times your words really worst and those hurt me.

Anyway not a problem...But still i do some efforts to find bugs in your site.



Regards
Subhzash
Hey sorry man. Now I understand entirely your intention. I honestly thank you for your kind efforts to support me perfect my site.

Can we be friends now?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #25 (permalink)  
Old 05-31-2008, 10:57 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by wige View Post
Here are a few others that I have blocked on mine:

Arachmo
BecomeBot
bot/1.0
Clushbot
Gigabot
kalooga
Kyluka crawl
MSR-ISRCCrawler
my-heritrix-crawler
NicheBot
noxtrumbot
Pingdom
POE-Component-Client-HTTP
Yeti
zermelo
Wige are you sure GigaBot is a spambot? I made a research and I found out that it is the bot of the search engine Gigablast.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #26 (permalink)  
Old 05-31-2008, 11:41 PM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
Wige are you sure GigaBot is a spambot? I made a research and I found out that it is the bot of the search engine Gigablast.
If you are a member of WMW, you may have noticed that some members think that nearly every bot is bad.

Even the old well established ex Norwegian SE FAST - Enterprise Search now owned by Microsoft was blocked by some members.

Last edited by kgun; 06-01-2008 at 12:03 AM.
Reply With Quote
  #27 (permalink)  
Old 06-01-2008, 12:22 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by kgun View Post

Even the old well established ex Norwegian SE FAST - Enterprise Search now owned by Microsoft was blocked by some members.
WHAAAT????

Off-topic: My online support is active at the moment.
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO

Last edited by Webnauts; 06-01-2008 at 12:27 AM.
Reply With Quote
  #28 (permalink)  
Old 06-01-2008, 12:07 PM
subhzash's Avatar
WebProWorld Veteran
 
Join Date: Apr 2007
Posts: 340
subhzash RepRank 1
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
Hey sorry man. Now I understand entirely your intention. I honestly thank you for your kind efforts to support me perfect my site.

Can we be friends now?
Sure. Thanks webnauts. I feel happy to have friend like to you. I would like to add your yahoo and gmail ids to add my friends list. Please send me


Thanks a lot


Regards
Subhzash
Reply With Quote
  #29 (permalink)  
Old 06-01-2008, 12:15 PM
subhzash's Avatar
WebProWorld Veteran
 
Join Date: Apr 2007
Posts: 340
subhzash RepRank 1
Default Re: www.seoworkers.com showing error

I hope dead-links.com is not bad bot. I use to list out URLs of the website which have less than 250 pages.

Why do you stop dead-links.com to crawl the website??
Reply With Quote
  #30 (permalink)  
Old 06-01-2008, 10:36 PM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by subhzash View Post
I hope dead-links.com is not bad bot. I use to list out URLs of the website which have less than 250 pages.

Why do you stop dead-links.com to crawl the website??
We check if we have dead links ourselves. Every time we do changes on our site, we turn off temporary the rules that disallow our tools to check, and when things are fine we block everything again.
What is wrong with that?

By the way a great news! Where did the bad bots go?
Statistics for seoworkers.com (2008-06)
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #31 (permalink)  
Old 06-02-2008, 03:48 AM
subhzash's Avatar
WebProWorld Veteran
 
Join Date: Apr 2007
Posts: 340
subhzash RepRank 1
Smile Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
We check if we have dead links ourselves. Every time we do changes on our site, we turn off temporary the rules that disallow our tools to check, and when things are fine we block everything again.
What is wrong with that?

By the way a great news! Where did the bad bots go?
Statistics for seoworkers.com (2008-06)
Nothing wrong. Actually i had doubt that how do you check internal links if some internal site checking bots are blocked. Any way my doubt clarified.
Reply With Quote
  #32 (permalink)  
Old 06-02-2008, 10:31 AM
wige's Avatar
Moderator
WebProWorld Moderator
 
Join Date: Jun 2006
Location: United States
Posts: 2,648
wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9wige RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by Webnauts View Post
Wige are you sure GigaBot is a spambot? I made a research and I found out that it is the bot of the search engine Gigablast.
Ah. I will remove it from my block list.

Quote:
Originally Posted by kgun View Post
Even the old well established ex Norwegian SE FAST - Enterprise Search now owned by Microsoft was blocked by some members.
That one too.

Quote:
Originally Posted by kgun View Post
If you are a member of WMW, you may have noticed that some members think that nearly every bot is bad.
Yeah, I noticed some even recommend blocking Google's Mobile Transcoder, a human operated (thus not a bot) service that converts standard web sites for mobile users, because it doesn't obey robots.txt.
__________________
The best way to learn anything, is to question everything.
Reply With Quote
  #33 (permalink)  
Old 06-03-2008, 06:01 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by wige View Post
That one too.
I don't kow if you use this Help : when you search that forum.

I think you find it in a thread with the following KW's

ban a country using htaccess

Here is the thread: WebmasterWorld Login

"i want to banned certain ip adresses all coming from:
UserAgent: FAST-WebCrawler/3.6 (atw-crawler at fast dot no; http://fast.no/support/crawler.asp)
IP address: 66.77.73.151


i get way to many visits from this spiderbot"

My bolding.

Note the answer (last post) from heini:

"Just a quick and offtopic addition: you are about to block the spider of one of the few big worldwide search engines".


Last edited by kgun; 06-03-2008 at 06:12 AM.
Reply With Quote
  #34 (permalink)  
Old 06-03-2008, 06:47 AM
Webnauts's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: Aug 2003
Location: Worldwide
Posts: 8,164
Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9Webnauts RepRank 9
Default Re: www.seoworkers.com showing error

Quote:
Originally Posted by kgun View Post
Note the answer (last post) from heini:

"Just a quick and offtopic addition: you are about to block the spider of one of the few big worldwide search engines".
Do you probably mean one of the best?
__________________
"Being an expert isn't telling other people what you know. It's understanding what questions to ask, and flexibly applying your knowledge to the specific situation at hand. Being an expert means providing sensible, highly contextual direction." Jeff Atwood
SEO Workers - Search Engine Optimization Consulting Company | SEO Analysis Tool | Webnauts Net SEO
Reply With Quote
  #35 (permalink)  
Old 06-03-2008, 07:14 AM
kgun's Avatar
WebProWorld 1,000+ Club
WebProWorld MVP
 
Join Date: May 2005
Location: Norway
Posts: 5,678
kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9kgun RepRank 9
Default Re: www.seoworkers.com showing error

What do you mean by best?

I started a new thread about this important topic here:

Google the best search engine?

But I noted an interesting difference while using this search form on the big three and the KW's

link selling

Google 10 200 hits

Yahoo 9 700 hits

MSN 20 200 hits

Is that a result of
  • Indexing
  • Reporting
  • New MSN search algorithms
Took a cup of coffee outside in the beautiful may weather and got back to the computer and tried these searches on Google:

link selling site:WebmasterWorld News and Discussion for the Web Professional (www-version) 10 100 hits

link selling site:webmasterworld.com 88 500

from webmasterworld.com OR webmasterworld.com for link selling.

So the reason was as I thought how WMW has programmed that form.

On MSN, the www and non-www version produces approximately the same number of hits.

link selling site:webmasterworld.com 20 000 hits

link selling site:WebmasterWorld News and Discussion for the Web Professional (www-version) 20 300 hits

Last edited by kgun; 06-03-2008 at 08:34 AM.
Reply With Quote
Reply

  WebProWorld > Search Engines > Search Engine Optimization Forum

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
This page is showing an error....why? watto Web Programming Discussion Forum 4 05-14-2007 05:33 PM
G. isn't showing my site locally, but is showing it globally writergrrrl48 Google Discussion Forum 11 05-17-2006 08:57 PM
Runtime Error - Line 1 - Error Syntex Error charms987 Graphics & Design Discussion Forum 6 07-29-2005 12:56 PM
Error: Syntax Error in IE web-content-king Web Programming Discussion Forum 3 03-20-2005 02:42 AM
Error Pages showing in stats kengeddes Graphics & Design Discussion Forum 1 06-06-2004 01:46 PM


All times are GMT -4. The time now is 08:28 PM.



Search Engine Optimization by vBSEO 3.3.0