Apple updates Applebot documentation clarifying crawler differences and data usage

April 29, 2025 at 5:09:30 AM

TL;DR Apple updated its Applebot documentation to clarify the differences between Applebot and Applebot-Extended. Blocking Applebot-Extended in the robots.txt file prevents content from being used to train generative AI models. Applebot crawls data for features like Spotlight and Siri, and web publishers can control access through robots.txt. Applebot-Extended allows further control over data usage, while Applebot follows standard crawling rules.

Apple updates Applebot documentation clarifying crawler differences and data usage

Apple has updated its Applebot documentation to clarify the distinction between the standard Applebot crawler and the Applebot-Extended crawler. Although Applebot-Extended was introduced a year ago, the update emphasizes how blocking it affects Applebot's functionality. Applebot collects data that aids in training Apple foundation models for generative AI features across various Apple products.

To prevent content from being used for training generative models, web publishers can block Applebot-Extended by adding a directive in the robots.txt file. Allowing Applebot in robots.txt ensures that website content is discoverable through Apple services like Spotlight, Siri, and Safari.

Identifying Applebot

Traffic from Applebot can be identified using reverse DNS in the *.applebot.apple.com domain or by matching IP addresses with CIDR prefixes from the Applebot IP CIDRs JSON file. The host command can verify if an IP belongs to Applebot.

User Agents

Applebot utilizes various user agents, including:

  • Search: Identified by a user-agent string containing "Applebot."
  • Apple Podcasts: Identified by the user-agent "iTMS," which does not follow robots.txt as it only crawls registered content.

Customizing robots.txt Rules

Applebot respects standard robots.txt directives. For example, it will not crawl documents under /private/ or /not-allowed/ if specified. If robots.txt does not mention Applebot but includes Googlebot, Applebot will adhere to Googlebot's instructions.

Rendering and Indexing

Applebot may render website content, so blocking resources like JavaScript and CSS in robots.txt can hinder proper rendering. To ensure optimal indexing, all necessary resources should be accessible to Applebot.

Customizing Indexing Rules

Applebot supports robots meta tags in HTML documents. Key directives include:

  • noindex: Prevents indexing and visibility in Spotlight or Siri.
  • nosnippet: Disallows generation of descriptions.
  • nofollow: Prevents following links.
  • none: Combines all restrictions.
  • all: Allows indexing and snippet generation.

Multiple directives can be combined in a single meta tag.

Applebot-Extended and Data Usage Control

Applebot-Extended offers web publishers additional control over their content's usage in training AI models. By disallowing Applebot-Extended in robots.txt, publishers can opt out of their content being used for this purpose. However, disallowing it does not prevent the content from appearing in search results.

Search Rankings

Apple Search rankings may consider factors such as user engagement, relevancy of search terms, link quality, user location signals, and webpage design characteristics, without predetermined importance for each factor. Users are subject to the privacy policy governing Siri Suggestions and Search.

Have more questions on this topic? Ask our AI assistant for in-depth insights.

Read more from sources πŸ‘‡

The Only Digital Marketing Feed You'll Ever Need.

Stay informed your way. Tailored updates when and how you want them. 100% Free.

10,000+ Users

500+ Sources

1000+ Tools

Or

Related Posts

Google updates documentation for Google-Extended user agent and product token description

Google updates documentation for Google-Extended user agent and product token description

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google updates Merchant Center documentation on energy efficiency and certification

Google updates Merchant Center documentation on energy efficiency and certification

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Marketing Workflows Powered by AI

Marketing Workflows Powered by AI

Featured
Markifact
Markifact

Verified Sponsor

Verified Sponsor

Markifact is a Verified Sponsor. Want to get featured here? Contact us.

Verified Sponsor
Google to Drop Support for Special Announcement Structured Data by July 2025

Google to Drop Support for Special Announcement Structured Data by July 2025

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google to Redirect Country Code Domains to Google.com Trending ️‍πŸ”₯

Google to Redirect Country Code Domains to Google.com

Google
Google

Official Source

Official Source

Google is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Expands Availability for Structured Data Carousels in EEA Countries

Google Expands Availability for Structured Data Carousels in EEA Countries

Google for Developers
Google for Developers

Official Source

Official Source

Google for Developers is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Search Console API Adds Support for Hourly Data Trending ️‍πŸ”₯

Google Search Console API Adds Support for Hourly Data

Google Search Central
Google Search Central

Official Source

Official Source

Google Search Central is a Official Source. The source has been verified by Swipe Insight team.

Official Source
Google Discover will expand to desktop

Google Discover will expand to desktop

Related Tools

Markifact logo

Markifact

Verified Tool

Verified Tool

Markifact is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Marketing Workflows Powered by AI

Featured
Marketing Auditor logo

Marketing Auditor

Verified Tool

Verified Tool

Marketing Auditor is a Verified Tool. Want to get this badge? Contact us.

Verified Tool

Automated audits for Google Ads and Analytics.

Get Featured Here

Showcase your tool in this list.

Contact Us
Ahrefs logo

Ahrefs

SEO tools to boost traffic and rank higher

SEO
Surfer SEO logo

Surfer SEO

SEO content creation and optimization made easy

SEO
Sitebulb logo

Sitebulb

Efficient website crawler for better SEO audits

SEO
Screpy logo

Screpy

AI-Powered SEO and Web Analysis Simplified

SEO
Blogify logo

Blogify

Convert multimedia to SEO-optimized blogs fast

SEO
Answer the Public logo

Answer the Public

Unlock Consumer Insights for Content Creation

SEO
SEO Writing AI logo

SEO Writing AI

AI-powered SEO content in 1 click

SEO
SEO Stuff logo

SEO Stuff

Affordable SEO tools without monthly fees

SEO

Get Featured Here

Showcase your tool in this list.

Contact Us