Removing your site from the proxy duplicators will require a few steps, and may take some time.
The first step is to create a new page on your site that only shows the user agent of the visitor's browser. Use the proxy web site to access that page, and note what the user agent is. In most cases, it will not be the same as a regular browser user agent. Lets say the user agent is "bad proxy 2.0"
Next, you need to create a custom error page. The proxy server will change any links on your site to keep the user in the proxy system, so any link you put in the page would not work. The page should simply contain a message that the page is being accessed with an unauthorized proxy, and please visit the site directly at
www.domain.tld. This page should be located in a subdirectory of the site root, so that it can be made accessible even after everything else is blocked.
Next, since you have a dedicated server, open up your server configuration, and add the following line:
Code:
BrowserMatch "bad proxy 2" bad_browser
This should be added right before the virtual hosts section.
Finally, add the following to your .htaccess file:
Code:
<Directory />
Order Allow,Deny
Deny from env=bad_browser
Allow from All
</Directory>
<Directory /errordocs/>
Order Deny,Allow
Allow from All
</Directory>
The first directory node needs to point to the web root, and the second directory node needs to point to the directory wherein your error document is located. This will ensure that the proxy system can only see the error messages. Also, the error document you created needs to be specified for 403 (access denied) errors.
Note, you can expand on this a bit further by adding the user agents of spambots after the BrowserMatch directive in the main configuration I mentioned, like this:
Code:
BrowserMatch "spambot" bad_browser
This will cut down massively on your bandwidth, block many spambots once you identify them, and cut down on proxy duplication as you find it.
Note, if the proxy system does not use a distinguished user agent, there is a slightly different approach to try. Let me know how this works though.
PS: This will take a while to actually take effect, since the proxy system needs time to see the change and start removing your pages. Your URLs will still show up, but you should not have an issue with new pages being added, or any risk of duplicate content penalties once the changes propagate.