Sitemap URL Extractor
Paste a sitemap.xml URL and instantly get every page URL as a clean, copyable list. Sitemap indexes are followed automatically.
Free forever
No account, no paywall, no usage limits.
Built for speed
Runs instantly in your browser, no server round-trips.
Privacy first
Your data never leaves your device.
What is a sitemap URL extractor?
Common uses
- ✓Get a full URL inventory for a content audit
- ✓Feed a list into a crawler or analytics tool
- ✓Check competitor coverage by topic or section
- ✓Spot stale URLs that no longer belong in a sitemap
How to extract sitemap URLs
- 1
Paste your sitemap.xml URL.
- 2
Click Extract — nested indexes are followed automatically.
- 3
Filter or search the results.
- 4
Copy the list or download as CSV.
Why a sitemap is the cleanest URL inventory you'll ever get
Crawlers guess, sitemaps declare. A web crawler infers your site's URL list by following links, which means it misses orphan pages, picks up junk URLs that no canonical points to, and respects robots.txt rules that hide what you actually want to inventory. A sitemap.xml is the URL list as the site owner intended it — the pages they want indexed, with metadata about freshness and priority baked in.
That's why content audits, search-console submissions, AI training corpus assembly, and migration plans all start with the sitemap rather than the crawl. The extractor below flattens the XML into a plain list you can paste into anything: a spreadsheet, an HTTP status checker, a content brief, an internal slide. Sitemap indexes — the parent files that group multiple sitemaps — are followed automatically so you never have to chase nested URLs by hand.
What sitemap metadata actually means
loc is the URL. It's the only required field and the one this tool exports.
lastmod is the page's last meaningful update date. Google uses it as a hint to prioritise recrawls, so populating it accurately accelerates indexation of fresh content. Sites that set lastmod to "today" on every page sabotage their own credibility — Google learns to ignore the field.
changefreq and priority are largely ignored by modern crawlers. Don't waste time tuning them; the URL list and accurate lastmod values are what matter.
Sitemap workflows for serious sites
Quarterly URL audits. Extract the sitemap, diff it against the previous quarter, and you have a precise list of pages added, removed or renamed. Surface stale URLs that should be unpublished or redirected.
Migration planning. Before a domain change or platform swap, the sitemap is the canonical list of redirects you need to author. Export, map old to new, ship as a 301 list.
Index gap analysis. Cross-reference the sitemap with Google Search Console's indexed-pages report. Anything in the sitemap but not indexed is a coverage issue worth diagnosing.
Section health checks. Filter the URL list by directory and you can spot under-published categories that need editorial investment or over-thin categories that should be consolidated.
Frequently asked questions
Does it support nested sitemap indexes?+
Yes. The extractor follows nested sitemapindex files and aggregates URLs from every child sitemap.
Can I export the URL list?+
Yes — copy as plain text or download as CSV.
Are large sitemaps supported?+
Yes. The tool streams XML so even sitemaps with tens of thousands of URLs work fine.
More free tools by Bulk Slug Generator
A growing library of focused, browser-based tools for SEO, content and developer workflows.