Duplicate Content & URL Canonicalization

Posted on February 16, 2007 · Posted in Search Engine Optimization, SEO Strategies

Before we get into this week’s tip, let me provide a definition for the term Canonicalization.

“It is the process of converting data that has more than one possible representation into a “standard” canonical representation.”

For SEO purposes this means it creates a definitive and unique page to represent more than one possible page on your site. You want to avoid duplicate pages on your site.

Sometimes duplicate content is a result of two pages sharing the same information in two different files. Sometimes you have duplicate content because of the technology causing two different urls point to the same page. Both of these problems have simple solutions. It is important to correct this because search engines want to show only unique web pages and text.

If your site has multiple pages with the same content possibly through a Content Management System (CMS) or through duplicate navigation, or because it actually exists in multiple versions, you are could be hurting your search engine ranking results. We all know how important linking is to any SEO campaign and if links pointing to different urls of the same information your link value will be diluted because 2 incoming links may point to one version and 3 links to another. It would be much better for all 5 links to point to one url.

301 Re-Direct to the Rescue
The solution is to take any current duplicate pages and use a 301 re-direct to point all versions to a single, “canonical” version of the content or web page.

Most often this problem can be found on a site’s homepage. For example: Search engines view your home page as having more than one version. How? Take a look at the following urls. All point to the same page, but to the search engines they are different: http://www.yoursite.com, http://yoursite.com, http://yoursite.com/index.html and http://www.yoursite.com/index.html. The search engines may find up to four home pages that have the same content.

While this may not cause your site to be unranked it is certainly not helping and can easily cause poor rankings. That is a shame for something that is so easily corrected. Most often this is caused by links pointing to different versions of your site. You can’t change all the links coming into your site, but you can use the 301 re-direct to solve this by pointing all versions of your home page to the full url (http://www.url.com). You can read more on how this is accomplished on our 301 re-direct blog post.