Is reported duplication on the pages or their canonical pages?
-
There are several sections getting flagged for duplication on one of our sites:
http://mysite.com/section-1/?something=X&confirmed=true
http://mysite.com/section-2/?something=X&confirmed=true
http://mysite.com/section-3/?something=X&confirmed=trueEach of the above are showing as having duplicates of the other sections. Indeed, these pages are exactly the same (it's just an SMS confirmation page you enter your code in), however, they all have canonical links back to the section (without the query string), i.e. section-1, section-2 and section-3 respectively.
These three sections have unique content and aren't flagged up for duplications themselves, so my questions are:
Are the pages with the query strings the duplicates, and if so why are the canonical links being ignored?
or
Are the canonical pages without the query strings the duplicates, and if so why don't they appear as URLs in their own right in the duplicate content report?
I am guessing it's the former, but I can't figure out why it would ignore the canonical links. Any ideas?
Thanks
-
This is good news sugar-coating bad news
Thanks!
-
Hi,
The URLs that are reported by the crawl as being duplicates are the duplicate pages. Unfortunately the way the crawl from SEOMoz works, it does not factor the rel=canonical tag when reporting duplicates. In other words, even with the tag implemented, it will still report these pages as duplicates. Don't worry though, as long as the tag is implemented, the search engines should treat the canonical like a 301 redirect and not penalise you for duplicate content.
So to answer your question:
Are the pages with the query strings the duplicates? - Yes.
Hope that helps,
Adam
-
Hey,
It's kind of tricky to answer this without seeing at least two of the category pages but I am guessing that the duplication is in the category pages themselves and if they are simply very thin pages with little to differentiate category A from category B then there is your problem.
Rather than look at the web tool, if you export the spreadsheet this is a lot easier to understand and for each page there is a duplication column which has a comma separated list of the pages that are being flagged as possible duplicates so this should answer your question.
What to do though?
I may be telling you how to suck eggs but this is always a good read when it comes to thin content problems and solutions:
http://www.seomoz.org/blog/fat-pandas-and-thin-contentIf it was me, and these pages are thin, but that is the way they are supposed to be, and they are not really search landing pages then there is a good argument to noindex them and remove the possibility of them causing you any problems. If you do this, next time the campaign tool crawls your site they will be ignored and will not show up as a possible duplicate.
Obviously, from a Panda perspective, if these pages are listed as thin, they could be damaging other pages on the site so it is certainly an issue worth addressing.
Hope this helps!
Marcus
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Shifting target keyword to a new page, how do we rank the internal page?
I have been targeting one keyword for home page that was ranking between the postilion 6-7 but was never ranking on 1st as there were 2 highly competitive keywords targeted on the same page, I changed the keyword to an internal service page to rank it on 1st, I have optimized the content as well but the home page is still ranking on 11th, how do I get the internal page rank on that keyword
On-Page Optimization | | GOMO-Gabriel0 -
My site on desktop browser: page 2 /mobile browser: page 0
Using my two most pertinent keywords in Chome my site shows up page two. Using the same keywords on my iPhone does not show my site at all (I clicked on to page 15). I have a mobile ranking of 84 on Google PageSpeed Insights. Could be a bit higher but not enough to totally ignore my site. What am I missing?
On-Page Optimization | | artsp0 -
How to optimize WordPress Pages with Duplicate Page Content?
I found the non WWW ans WWW duplicate pages URL only, more than thousand pages.
On-Page Optimization | | eigital0 -
To create extra pages, or not to create extra pages?
I'm responsible for a site where we cater for all kinds of medical & legal problems. I recently conducted keyword research that shows a lot of questions being 'asked' in relation to the conditions we cater for. Naturally, I want to create content to answer these questions. We have a page for 'Cancer compensation' - the 'possible content' that answers questions won't necessarily help someone claiming compensation for cancer mistreatment, BUT someone who asks a question relating to cancer, answered in the 'possible content' may find the 'cancer compensation' page useful. SO! Do I: Add this content to the existing 'cancer compensation' page? Create individual pages of content answering each question, linking to the 'cancer compensation' page? or do I amalgamate all the answers into one heafty 'resource' page that sits elsewhere on the site? What do you think? Thanks in advance. John King
On-Page Optimization | | Muhammad-Isap0 -
Duplicate Page Content
Hey Moz Community, Newbie here. On my second week of Moz and I love it but have a couple questions regarding crawl errors. I have two questions: 1. I have a few pages with duplicate content but it say 0 duplicate URL's. How do I know what is duplicated in this instance? 2. I'm not sure if anyone here is familiar with an IDX for a real estate website. But I have this setup on my site and it seems as though all the links it generates for different homes for sale show up as duplicate pages. For instance, http://www.handyrealtysa.com/idx/mls...tonio_tx_78258 is listed as having duplicate page content compared with 7 duplicate URLS: http://www.handyrealtysa.com/idx/mls...tonio_tx_78247
On-Page Optimization | | HandyRealtySA
http://www.handyrealtysa.com/idx/mls...tonio_tx_78253
http://www.handyrealtysa.com/idx/mls...tonio_tx_78245
http://www.handyrealtysa.com/idx/mls...tonio_tx_78261
http://www.handyrealtysa.com/idx/mls...tonio_tx_78258
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260
http://www.handyrealtysa.com/idx/mls...tonio_tx_78260 I've attached a screenshot that shows 2 of the pages that state duplicate page content but have 0 duplicate URLs. Also you can see somewhat about the idx duplicate pages. rel="canonical" is functioning on these pages, or so it seems when I view the source code from the page. Any help is greatly appreciated. skitch.png0 -
How about this "onpage overoptimisation" everybody is talking about? Are the on-page optimisation reports still to be used?
Are the on-page optimisation reports still to be used? If we do check all factors we risk penalization because of latest Panda update?
On-Page Optimization | | MugurCosminFrunzetti0 -
Do product pages need unique content or does having duplcate content hurt on those pages?
We are adding product rapidly to our website but this requires allowing duplicate to exist on our product pages of furniture-online.com. From an SEO standpoint do we need to make this content unique for each product. Since we aren't link building to specific product pages and we don't anticipate product pages being found in a search result, are we ok leaving the duplicate content in place and spending our dollars elsewhere?
On-Page Optimization | | gallreddy0 -
How woud you deal with Blog TAGS & CATEGORY listings that are marked a 'duplicate content' in SEOmoz campaign reports?
We're seeing "Duplicate Content" warnings / errors in some of our clients' sites for blog / event calendar tags and category listings. For example the link to http://www.aavawhistlerhotel.com/news/?category=1098 provides all event listings tagged to the category "Whistler Events". The Meta Title and Meta Description for the "Whistler Events" category is the same as another other category listing. We use Umbraco, a .NET CMS, and we're working on adding some custom programming within Umbraco to develop a unique Meta Title and Meta Description for each page using the tag and/or category and post date in each Meta field to make it more "unique". But my question is .... in the REAL WORLD will taking the time to create this programming really positively impact our overall site performance? I understand that while Google, BING, etc are constantly tweaking their algorithms as of now having duplicate content primarily means that this content won't get indexed and there won't be any really 'fatal' penalties for having this content on our site. If we don't find a way to generate unique Meta Titles and Meta Descriptions we could 'no-follow' these links (for tag and category pages) or just not use these within our blogs. I am confused about this. Any insight others have about this and recommendations on what action you would take is greatly appreciated.
On-Page Optimization | | RoyMcClean0