All posts
How-To Guides

How to Find and Fix 404 Errors in Your Sitemap

404 errors in your sitemap waste crawl budget and signal low-quality content to search engines. Here's how to find them systematically and fix each type.

June 22, 2026·4 min read

A 404 error means a page that used to exist no longer does — or never did. When 404s appear in your sitemap, you're telling search engines to crawl pages that return nothing, which wastes crawl budget and erodes trust. If enough dead pages accumulate, Google may reduce how often it crawls your site.

The good news is 404s are fixable, and fixing them is one of the highest-leverage technical SEO tasks you can do. This guide walks through how to find them, how to prioritise which to fix first, and the three ways to resolve each one.

How 404s End Up in Your Sitemap

Sitemaps are usually generated from the links a crawler discovers on your site. If a link points to a URL that returns 404, the crawler still logs it. This happens more than you'd think, for a few reasons:

  • Content was deleted — a blog post, product page, or landing page was removed but links to it still exist in navigation, footers, or other content.
  • URLs were changed — a site migration, CMS switch, or URL structure change left old URLs intact in navigation while the pages moved to new addresses.
  • Typos in links — a mistyped href in a nav link or internal article link points to a page that doesn't exist.
  • Third-party links — external sites linking to old URLs that have since changed.

Step 1 — Find All the 404s

Run a full crawl of your site. After the crawl completes, open the health report and filter by status code to show only 4xx errors. This gives you a list of every URL your crawler visited that returned a 404 (or 403, 410, etc.).

Export or screenshot this list — you'll need it for the next steps. Sort by URL path so that similar pages (e.g. /products/... or /blog/...) appear together — this makes it easier to spot patterns.

Step 2 — Decide How to Handle Each 404

Every 404 falls into one of three categories, and each category has a different fix.

Option A — Restore the page

If the page was deleted by mistake, or if it was a page that still has value (old blog posts, product pages for items you still sell), restore it. This is the cleanest fix — you get the page back, any backlinks pointing to it pass full value, and the 404 disappears from your report.

Option B — Redirect to the best matching page

If the page is gone for good but there's a relevant replacement, set up a 301 (permanent) redirect from the dead URL to the most relevant live page. For example:

  • A deleted product page → /products (the category page)
  • A renamed blog post → /blog/new-post-slug
  • An old landing page → the most relevant current service page

A 301 passes link equity (ranking signals) from the old URL to the new one, which protects any SEO value the original page had. Don't redirect to the homepage unless there's genuinely no better option — search engines treat "redirect everything to the homepage" as a soft 404.

Option C — Remove the broken link

If the page is gone and there's no suitable replacement, the right fix is to find where the link to that dead page lives on your site and remove it. Check your navigation menus, footer links, internal blog posts, and any curated link lists. Once nothing on your site points to the dead URL, the crawler won't discover it, and it drops out of your sitemap on the next crawl.

Step 3 — Prioritise What to Fix First

If you have dozens or hundreds of 404s, don't try to fix them all at once. Work in this order:

  1. Pages with inbound backlinks. If external sites are linking to a dead page, fixing the 404 with a 301 redirect recovers that link equity. Check Google Search Console under Links to find which of your 404s have backlinks.
  2. Pages linked from navigation or footers. These are crawled on every visit and hurt your crawl budget the most.
  3. High-traffic pages from your analytics history. If a page used to drive significant organic traffic, restoring or redirecting it can recover those rankings.
  4. Everything else — work through the remainder systematically, batching pages of the same type together.

Step 4 — Verify and Re-crawl

After making changes, trigger a new crawl. The health report will update to reflect the current state of your site. A URL that previously returned 404 should now show 200 (if restored) or 301 (if redirected). If it still shows 404, the redirect wasn't applied correctly — double-check your server config or CMS redirect settings.

Once your health score improves, resubmit your sitemap to Google Search Console so Googlebot picks up the changes promptly.

Preventing Future 404s

The best fix is prevention. A few habits that keep 404s from building up:

  • Before deleting or renaming any page, check whether anything links to it and either update those links or set up a redirect.
  • Enable auto-refresh on your sitemap tool so you get an updated health report after every scheduled crawl — catching new 404s before they compound.
  • When migrating platforms or restructuring URLs, create a redirect map before going live.

Try it free

Generate and check your sitemap in minutes

Crawl up to 2,000 URLs for free — no credit card required. See your health score and download your sitemap XML instantly.

Generate your sitemap →