What is duplicate content?
According to Google’s own webmaster central blog:
Duplicate content generally refers to substantive blocks of content within or across domains that either completely match other content or are appreciably similar. Most of the time when we see this, it’s unintentional or at least not malicious in origin: forums that generate both regular and stripped-down mobile-targeted pages, store items shown (and — worse yet — linked) via multiple distinct URLs, and so on. In some cases, content is duplicated across domains in an attempt to manipulate search engine rankings or garner more traffic via popular or long-tail queries.
Common situations
Company “ACME” wants to have several entry points for the same website. So they register the following domains: acme.com, acme.be, acme.org, acme.eu, … etc. Now they point all those domains to the same website (content). After a bit the page gets crawled by the Search Engines. Guess what happens?
The search engine crawls acme.be and finds an article about product “acme generic”. It indexes this and continues. After a while it crawls acme.com and finds (the same) article about “acme generic”. Then it flags the article at acme.com as a duplicate penalizing it in the search results.
Why do the Search Engines do this?
To have decent search results… A lot of (blackhat?!?) webmasters will use RSS feeds of other sites to publish their own content. If someone browses the net, and wants to find information about “acme generic”, they should only get the unique & relevant search results.
What should I do then?
Continue reading “The DO’s and DON’T’s of duplicate content.”