Free SEO Tool

Sitemap URL Extractor

Paste a sitemap.xml URL and instantly get every page URL as a clean, copyable list. Sitemap indexes are followed automatically.

100% private — everything runs in your browser. No upload, no signup.

Free forever

No account, no paywall, no usage limits.

Built for speed

Runs instantly in your browser, no server round-trips.

Privacy first

Your data never leaves your device.

What is a sitemap URL extractor?

It downloads a sitemap.xml file, follows any nested sitemap indexes, and returns the full flat list of <loc> URLs ready to copy or export.

Common uses

  • Get a full URL inventory for a content audit
  • Feed a list into a crawler or analytics tool
  • Check competitor coverage by topic or section
  • Spot stale URLs that no longer belong in a sitemap

How to extract sitemap URLs

  1. 1

    Paste your sitemap.xml URL.

  2. 2

    Click Extract — nested indexes are followed automatically.

  3. 3

    Filter or search the results.

  4. 4

    Copy the list or download as CSV.

Why a sitemap is the cleanest URL inventory you'll ever get

Crawlers guess, sitemaps declare. A web crawler infers your site's URL list by following links, which means it misses orphan pages, picks up junk URLs that no canonical points to, and respects robots.txt rules that hide what you actually want to inventory. A sitemap.xml is the URL list as the site owner intended it — the pages they want indexed, with metadata about freshness and priority baked in.

That's why content audits, search-console submissions, AI training corpus assembly, and migration plans all start with the sitemap rather than the crawl. The extractor below flattens the XML into a plain list you can paste into anything: a spreadsheet, an HTTP status checker, a content brief, an internal slide. Sitemap indexes — the parent files that group multiple sitemaps — are followed automatically so you never have to chase nested URLs by hand.

What sitemap metadata actually means

loc is the URL. It's the only required field and the one this tool exports.

lastmod is the page's last meaningful update date. Google uses it as a hint to prioritise recrawls, so populating it accurately accelerates indexation of fresh content. Sites that set lastmod to "today" on every page sabotage their own credibility — Google learns to ignore the field.

changefreq and priority are largely ignored by modern crawlers. Don't waste time tuning them; the URL list and accurate lastmod values are what matter.

Sitemap workflows for serious sites

Quarterly URL audits. Extract the sitemap, diff it against the previous quarter, and you have a precise list of pages added, removed or renamed. Surface stale URLs that should be unpublished or redirected.

Migration planning. Before a domain change or platform swap, the sitemap is the canonical list of redirects you need to author. Export, map old to new, ship as a 301 list.

Index gap analysis. Cross-reference the sitemap with Google Search Console's indexed-pages report. Anything in the sitemap but not indexed is a coverage issue worth diagnosing.

Section health checks. Filter the URL list by directory and you can spot under-published categories that need editorial investment or over-thin categories that should be consolidated.

Frequently asked questions

Does it support nested sitemap indexes?+

Yes. The extractor follows nested sitemapindex files and aggregates URLs from every child sitemap.

Can I export the URL list?+

Yes — copy as plain text or download as CSV.

Are large sitemaps supported?+

Yes. The tool streams XML so even sitemaps with tens of thousands of URLs work fine.

More free tools by Bulk Slug Generator

A growing library of focused, browser-based tools for SEO, content and developer workflows.

Browse all tools →