Sitemap URL Extractor

Extract all URLs from any XML sitemap. Find duplicate entries, analyze folder depth for crawl budget optimization, and export all links to CSV.

Loading your experience...

Please wait a moment

Technical Audit

This utility is a high-performance node optimized for modern browser environments. All data processing is executed client-side, ensuring zero knowledge transfer to external servers.

File upload or raw XML string parsing
Automatic duplicate URL detection
Path depth analysis (folder level counting)
100% Client-side — no server uploads
Bulk CSV export of all extracted URLs

System FAQ

How do I extract all URLs from my XML sitemap?

Paste your sitemap URL or upload the XML file. The tool instantly parses all URLs, showing you the full list with their path depth and any duplicate entries. Export everything to CSV with one click.

Why should I check my sitemap for duplicate URLs?

Duplicate URLs in a sitemap waste your crawl budget — Google has a limit on how many pages it crawls per day. Duplicate entries also signal poor site structure and can split your ranking signals between the same content.

What is path depth in a sitemap?

Path depth is the number of folder levels in a URL (counted by slashes after the domain). /blog/post = depth 2. Shallower URLs (depth 1-2) are generally prioritized by search engines and rank more easily.

How many URLs should be in my sitemap?

A single sitemap file can contain up to 50,000 URLs with a maximum file size of 50MB. For larger sites, use a sitemap index file pointing to multiple sitemap files. Only include indexable, canonical URLs.

Should I include every page in my sitemap?

No. Only include pages you want Google to index: main content pages, blog posts, product pages, category pages. Exclude: admin pages, login pages, thank-you pages, duplicate content, and pages with noindex tags.