I'm working with a client many of whose product descriptions have been scraped by sites that sell similar products. What is the best way to identify new content as ours? Would you use the rel=author tag or something else?
I'm working with a client many of whose product descriptions have been scraped by sites that sell similar products. What is the best way to identify new content as ours? Would you use the rel=author tag or something else?
Just before publishing your new article go to myows . com (you have to create an account, and pay a small monthly fee) upload your new text and they will time stamp it. This is to prove you are the owner of the idea.
Right after you publish you new article go to pingomatic . com and send pings to the major blog engines. This technique also create a first time stamp and prove that the article has been first seen on yourdomain.com
On product descriptions it is very difficult. The only way we have found with clients is to write them in a esoteric fashion (perhaps including company name and URL in each description) so that they 'fit' with your site, but stand out on someone elses. Groupon do a good job of this. Registering with timestamp services is only really any good if you are going to pursue those who copy. That is time consuming and you have to pursue all you find to be able to enforce anything.
When we have this discussion with clients we do pick over the 'is it really the problem you think it is' question.
---------------------------------------------------------------------
I-ntarsia(tm) - A Hosted CMS for web designers and marketing agencies
I thought of this when I saw this thread earlier today. Would work as long as the scraper doesn't remove your name and url.
You can set up Google Alerts for say:
-your company name
-your domain name
-and depending on how many products we are talking about, a blurb of each description
Google Alerts are free. You can set them up to notify you by email as Google finds the information in the alert you set up.
Even if this doesn't catch the smart scrapers, the information is still helpful to see where you are being mentioned (your name and domain name).
There is Copyscape also.
I don't really thing the myows . com idea would work. If you find someone scraping the content you can do a DMCA notice yourself for free (other than your time).
Accrete Web Solutions - Search engine friendly websites, ecommerce websites & blogs
Web Page Mistakes - Web page mistakes with solutions
HTML Basic Tutor - HTML help to learn HTML basics
That's why it is difficult, because removing obvious stuff that remains the same in each description is easy. So that's why I mentioned doing something esoteric as well. To illustrate you could describe this product randomly selected from Amazon as they do:
Which would work on pretty much any site. Or you could describe it like this:The Prestige Cook n Look 4 piece stainless steel set is a stylish and economical way of meeting your cookware needs. Made up of some of the most used cookware pieces in the kitchen this durable stainless steel cookware will be stylish and practical for years to come. Featuring full capped friction bonded base for even heat distribution and rivetted stainless steel and silicone combination handles for a confident, stay cool grip. Each saucepan comes with a close fitting glass lid which locks in flavour and nutrients while still offering see through convenience. Suitable for all cooker types including induction and oven safe to 180C/350F and Gas Mark 4.
This cook set is just the best! Stainless steel? OF COURSE!. And that's not all, we got a full capped friction bonded base here for totally even heat distribution. The handle won't burn you either, because it is riveted steel and silicone. The flavour and good stuff is all locked in while you cook with the close fitting SEE THROUGH lids. And the icing on the cake? ALL cooker types AND oven safe to 180C/350F and GM4.
OK, cheesey but I'm doing this on the flyThe point is that if the scrapers are scraping the factual descriptions for most of their products the ones they scrape from you will look so different they might just not bother.
---------------------------------------------------------------------
I-ntarsia(tm) - A Hosted CMS for web designers and marketing agencies
Defend your site against stealing your content with a free plagiarism warning banner!
You can give a try at copyscape.com.