Malta SEO ProMalta SEO Pro
Technical SEO

Your Sitemap Is Broken and Google Won't Tell You: 7 Mistakes Killing Your Rankings

Daniel Vella24 March 20266 min read

You submitted your sitemap to Google Search Console. You see "Success" in green. You assume everything is fine.

It's not.

After auditing over 200 websites, we've found that more than 80% of XML sitemaps contain at least one critical error that silently prevents pages from being indexed. Google doesn't send you an email about it. Search Console sometimes shows "Couldn't fetch" with zero explanation.

Here are the 7 sitemap mistakes we see constantly, why they matter, and exactly how to fix them.

1. Missing the XML Schema Declaration

This is the single most common sitemap error we encounter. Your sitemap opens with <urlset> but is missing the namespace declaration that tells Google it's a valid sitemap.

Wrong:

<urlset>
  <url><loc>https://example.com/</loc></url>
</urlset>

Correct:

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
        xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
        xsi:schemaLocation="http://www.sitemaps.org/schemas/sitemap/0.9
        http://www.sitemaps.org/schemas/sitemap/0.9/sitemap.xsd">
  <url><loc>https://example.com/</loc></url>
</urlset>

Without the xmlns declaration, Google may accept your sitemap but fail to parse individual URLs. We've seen Search Console report "Sitemap could not be read" purely because of this missing attribute.

2. Submitting HTTP Instead of HTTPS

Check the URL in your Search Console sitemap submission right now. Is it http:// or https://?

If your site runs on HTTPS (and it should), submitting http://yoursite.com/sitemap.xml will fail. Google treats HTTP and HTTPS as different properties. The sitemap URL must match your canonical protocol.

Fix: Delete the HTTP sitemap submission and resubmit using https://yoursite.com/sitemap.xml.

3. Including Pages That Return Non-200 Status Codes

Your sitemap should only contain URLs that return a 200 OK response. We regularly find sitemaps listing:

  • 301 redirected pages - Google has to follow the redirect, wastes crawl budget
  • 404 pages - Tells Google your sitemap is unreliable
  • 500 error pages - Server errors that prevent indexing entirely
  • Soft 404s - Pages that return 200 but show "page not found" content

Every non-200 URL in your sitemap erodes Google's trust in the entire file. If enough URLs are broken, Google may stop processing your sitemap altogether.

Fix: Audit every URL in your sitemap. Remove anything that doesn't return a clean 200 response. Set up monitoring to catch new broken URLs before they pollute your sitemap.

4. Missing lastmod Dates (Or Using Fake Ones)

The <lastmod> tag tells Google when a page was last meaningfully updated. Most sitemaps either omit it entirely or worse, set every page to today's date.

Why it matters: Google uses lastmod to prioritise which pages to recrawl. If every page shows today's date, Google learns your lastmod is unreliable and ignores it completely. You lose the ability to signal "hey, I just updated this important page."

Best practice:

  • Only include lastmod if you track actual modification dates
  • Use the date format YYYY-MM-DD (ISO 8601)
  • Update lastmod only when the page content genuinely changes
  • Never auto-generate it with the current timestamp on every build

Google's John Mueller has confirmed: "If you're going to use lastmod, make sure it's accurate. We do use it, but only when we can trust it."

5. Listing URLs That Differ from Your Canonical Tags

This one is subtle and devastating. Your sitemap says:

<loc>https://example.com/services</loc>

But the page's <link rel="canonical"> tag says:

<link rel="canonical" href="https://example.com/services/" />

Notice the trailing slash difference? Google sees these as two different URLs. Your sitemap points to one, your canonical points to another. Google gets confused, and neither version may get indexed properly.

Fix: Ensure every <loc> URL in your sitemap exactly matches the canonical URL on that page. Check trailing slashes, HTTP vs HTTPS, and www vs non-www.

6. Stuffing Your Sitemap with Low-Value Pages

A sitemap is not a list of every URL on your site. It's a prioritised guide for Google. Including thousands of low-value pages dilutes the signal.

Pages that should NOT be in your sitemap:

  • Tag archive pages with little unique content
  • Paginated pages (/blog/page/2/, /blog/page/3/)
  • Search results pages
  • Thank-you and confirmation pages
  • Login, register, and account pages
  • Filtered or sorted product listing variations
  • PDF files and media attachments

Pages that SHOULD be in your sitemap:

  • Your homepage
  • Core service and product pages
  • Blog posts and articles
  • Location pages
  • Category landing pages
  • Any page you want Google to index and rank

Rule of thumb: If you wouldn't want the page to appear in Google search results, don't put it in your sitemap.

7. Not Connecting Your Sitemap to robots.txt

Your robots.txt file should explicitly tell Google where to find your sitemap. Many sites create a sitemap but never reference it in robots.txt.

User-agent: *
Allow: /
Disallow: /api/

Sitemap: https://yoursite.com/sitemap.xml

This is especially important because Google discovers sitemaps in three ways:

  1. The Sitemap: directive in robots.txt
  2. Manual submission in Search Console
  3. Following links (unreliable)

If you only rely on Search Console submission and something goes wrong with your account, Google may lose track of your sitemap entirely. The robots.txt directive is your safety net.

Bonus: Do You Actually Need a Sitemap?

Yes. Google's own documentation says sitemaps are "especially useful" when:

  • Your site is large (500+ pages)
  • Your site is new with few external links
  • Your site has rich media content
  • Your pages aren't well-linked internally

But even for small sites, a properly formatted sitemap signals professionalism and gives you explicit control over what Google crawls. There's no downside to having one, only downsides to having a broken one.

How to Validate Your Sitemap Right Now

  1. Open your sitemap in a browser: https://yoursite.com/sitemap.xml
  2. Check the XML declaration - Does it have xmlns and xsi:schemaLocation?
  3. Submit to Google Search Console using the HTTPS URL
  4. Click each URL in the sitemap - Do they all return 200?
  5. Compare URLs with canonical tags on each page
  6. Check robots.txt - Does it include Sitemap: https://...?

If any of these checks fail, you've found the reason Google isn't indexing your pages.

Need Help Fixing Your Sitemap?

At Malta SEO Pro, sitemap audits are part of every SEO engagement. We've fixed sitemap issues for over 50 businesses across Malta, often recovering pages that hadn't been indexed for months.

Get a free SEO audit and we'll check your sitemap as part of the assessment.

Ready to Boost Your Online Presence?

Get a free SEO audit for your Malta business and discover opportunities to rank higher on Google.

Get Your Free SEO Audit

We use cookies to improve your experience and analyze site traffic. By clicking “Accept”, you consent to our use of cookies. Read our Privacy Policy and Cookie Policy.