Thanks people,
If you're trying to spot bad bots, you might use a more general RE
Not so much spotting bots and other critter activity as finding ways to avoid blocking IP's via htaccess. You could explain this (s*6) for me.
I did find the "Firefox user agent switcher" so got to test a bit. I'll look at Rex Swain (sounds familiar).
I'll tell you what I found after a few days of viewing logs:
Some bots with MSIE 6.0 in user agent string still got in. Some switched to MSIE 7 immediately. No real visitors got through so I toned it down and left deny for MSIE 4,3,2 etc.
I've also been using and watching results for this:
Deny from env=bad_bot
SetEnvIfNoCase User-Agent ^-?$ bad_bot
BrowserMatchNoCase Wget bad_bot
BrowserMatchNoCase Curl bad_bot
This seems to be working well and effects 0 real visitors. I'm also using some proxy and request directives which works for some bots. I can't use perl or php stuff yet. Just starting to understand htaccess. Since I'm asking questions, here are a couple more, which line after {HTTP_REFERER} is more dependable (better)?
Code:
RewriteCond %{HTTP_REFERER} domain\.com [NC,OR]
RewriteCond %{HTTP_REFERER} ^xxxx://xxx\.crap\.com [NC]
RewriteRule ^(.*)$ - [F]
Question #2: I have a 500 error problem which occurs on clipbucket install. Error occurs only with user agents and only on /file/videos...
I know this should be simple. I've been looking, off and on, for months and can't fix it. If anyone wants to take a crack at it (paid) email me.
Thanks much for the responses, Doug