How to Detect if Google Ignores Your Canonicals Using Google Search Console Data

May 18, 2024 at 4:53:09 AM

How to Detect if Google Ignores Your Canonicals Using Google Search Console Data

To check if Google is ignoring your canonicals and serving canonical child URLs, follow these steps:

  1. Crawl your website using a crawler like Screaming Frog that outputs URL, canonical status, and canonical path.

  2. Export your crawl and upload it to a Google Sheet. Name the tab 'Canonicalised URLs'.

  3. Create another tab in the same sheet called 'GSC Click Data'.

  4. Export the last 16 months of search console data. This can be done in the same sheet by clicking 'EXTENSIONS > ADD ONS > GET ADD ONS', then search for and install 'Search analytics for sheets'.

  5. Run 'Search Analytics for Sheets' from 'EXTENSIONS'.

  6. Find your URL under 'verified site'.

  7. Select 'GROUP BY:', choose 'PAGE' and click 'REQUEST DATA'. Adjust the number of 'ROWS' returned if your site has more than 25k URLs.

  8. Name the tab where data is dumped as 'GSC CLICK DATA'.

  9. In your 'Canonicalised URLs' sheet, insert columns in 'COLUMN B / C' and title them 'GSC Clicks' and 'GSC Impressions'.

  10. Paste =IFERROR(VLOOKUP(A2,'GSC Click Data'!$A$1:$ZZ$999999,2,FALSE),"-") in cell B2.

  11. Paste =IFERROR(VLOOKUP(A2,'GSC Click Data'!$A$1:$ZZ$999999,3,FALSE),"-") in cell C2.

  12. Cascade the formulas down the entire columns B / C.

  13. Freeze ROW 1 by clicking 'VIEW > FREEZE > 1 ROW'.

  14. Create a filter in ROW 1 by clicking 'DATA > CREATE A FILTER'.

  15. Apply a filter on cell C1 by selecting 'FILTER BY CONDITION', choosing 'greater than', entering '0', and clicking 'OK'.

Now, you have canonicalised URLs with GSC data. These URLs are crucial to rule out cannibalisation and ensure the canonical parent is the lead URL. In most cases, serving canonical child URLs is not desirable. If the data is based on the last 16 months, it might be a legacy issue. You can repeat the process but change the timeframe on the 'Search analytics for sheets' export to the last 3 months.

Q&A

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Want Personalized Digital Marketing Insights at Your Preferred Time?

Our Smart Newsletter brings you the latest insights on the topics you love, delivered at your preferred time and frequency.

Discover More

Cloudflare Launches Free Tool to Block AI Bots from Scraping Websites

Cloudflare Launches Free Tool to Block AI Bots from Scraping Websites

Cloudflare
Cloudflare

Official Source

Official Source

Cloudflare is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Explains Why Soft 404s Are Bad for SEO

Google Explains Why Soft 404s Are Bad for SEO

Gary Illyes
Gary Illyes

Official Source

Official Source

Gary Illyes is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Screaming Frog SEO Spider 20.2 Update: Bug Fixes, Schema.org v27, PSI 12.0.0, Gemini 1.0

Screaming Frog SEO Spider 20.2 Update: Bug Fixes, Schema.org v27, PSI 12.0.0, Gemini 1.0

Screaming Frog
Screaming Frog

Official Source

Official Source

Screaming Frog is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Explains How to Mark Up Large Category Lists for Structured Data Carousels

Google Explains How to Mark Up Large Category Lists for Structured Data Carousels

Google Search Central
Google Search Central

Official Source

Official Source

Google Search Central is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Treats Uppercase and Lowercase URLs as Different Pages, Avoid Duplicate Content

Google Treats Uppercase and Lowercase URLs as Different Pages, Avoid Duplicate Content

Chris Long
Chris Long

Top Creator

Top SEO Creator

Chris Long is a Top SEO Creator. Part of Swipe Insight Select, a curated list of top creators.

Top SEO Creator
Applebot-Extended: Web Publishers Can Opt Out of AI Training via Robots.txt

Applebot-Extended: Web Publishers Can Opt Out of AI Training via Robots.txt