Question 1

How do I extract all URLs from my XML sitemap?

Accepted Answer

Paste your sitemap URL or upload the XML file. The tool instantly parses all URLs, showing you the full list with their path depth and any duplicate entries. Export everything to CSV with one click.

Question 2

Why should I check my sitemap for duplicate URLs?

Accepted Answer

Duplicate URLs in a sitemap waste your crawl budget — Google has a limit on how many pages it crawls per day. Duplicate entries also signal poor site structure and can split your ranking signals between the same content.

Question 3

What is path depth in a sitemap?

Accepted Answer

Path depth is the number of folder levels in a URL (counted by slashes after the domain). /blog/post = depth 2. Shallower URLs (depth 1-2) are generally prioritized by search engines and rank more easily.

Question 4

How many URLs should be in my sitemap?

Accepted Answer

A single sitemap file can contain up to 50,000 URLs with a maximum file size of 50MB. For larger sites, use a sitemap index file pointing to multiple sitemap files. Only include indexable, canonical URLs.

Question 5

Should I include every page in my sitemap?

Accepted Answer

No. Only include pages you want Google to index: main content pages, blog posts, product pages, category pages. Exclude: admin pages, login pages, thank-you pages, duplicate content, and pages with noindex tags.

Sitemap URL Extractor

Technical Audit

System FAQ

How do I extract all URLs from my XML sitemap?

Why should I check my sitemap for duplicate URLs?

What is path depth in a sitemap?

How many URLs should be in my sitemap?

Should I include every page in my sitemap?