Change Tracking with Crawl
Track changes across your entire website, including new, removed, and hidden pages
Change tracking becomes even more powerful when combined with crawling. While change tracking on individual pages shows you content changes, using it with crawl lets you monitor your entire website structure - showing new pages, removed pages, and pages that have become hidden.
Basic Usage
To enable change tracking during a crawl, include it in the formats
array of your scrapeOptions
:
Understanding Change Status
When using change tracking with crawl, the changeStatus
field becomes especially valuable:
new
: A page that didn’t exist in your previous crawlsame
: A page that exists and hasn’t changed since your last crawlchanged
: A page that exists but has been modified since your last crawlremoved
: A page that existed in your previous crawl but is no longer found
Page Visibility
The visibility
field helps you understand how pages are discovered:
visible
: The page is discoverable through links or the sitemaphidden
: The page still exists but is no longer linked or in the sitemap
This is particularly useful for:
- Detecting orphaned content
- Finding pages accidentally removed from navigation
- Monitoring site structure changes
- Identifying content that should be re-linked or removed
Full Diff Support
For detailed change tracking with diffs, you can use the same options as described in the Change Tracking for Scrape documentation.