30x-seo-sitemap

Installation

$npx skills add norahe0304-art/30x-seo --skill 30x-seo-sitemap

Summary

The agent can audit existing XML sitemaps for technical compliance and content quality issues, then generate new sitemaps with industry-specific templates and structural guidelines. Invoke this when a user needs sitemap validation, generation, or to flag crawlability and indexing problems.

SKILL.MD

Sitemap Analysis & Generation

Mode 1: Analyze Existing Sitemap

Validation Checks

  • Valid XML format
  • URL count <50,000 per file (protocol limit)
  • All URLs return HTTP 200
  • <lastmod> dates are accurate (not all identical)
  • No deprecated tags: <priority> and <changefreq> are ignored by Google
  • Sitemap referenced in robots.txt
  • Compare crawled pages vs sitemap — flag missing pages

Quality Signals

  • Sitemap index file if >50k URLs
  • Split by content type (pages, posts, images, videos)
  • No non-canonical URLs in sitemap
  • No noindexed URLs in sitemap
  • No redirected URLs in sitemap
  • HTTPS URLs only (no HTTP)

Common Issues

IssueSeverityFix
>50k URLs in single fileCriticalSplit with sitemap index
Non-200 URLsHighRemove or fix broken URLs
Noindexed URLs includedHighRemove from sitemap
Redirected URLs includedMediumUpdate to final URLs
All identical lastmodLowUse actual modification dates
Priority/changefreq usedInfoCan remove (ignored by Google)

Mode 2: Generate New Sitemap

Process

  1. Ask for business type (or auto-detect from existing site)
  2. Load industry template from assets/ directory
  3. Interactive structure planning with user
  4. Apply quality gates:
    • āš ļø WARNING at 30+ location pages (require 60%+ unique content)
    • šŸ›‘ HARD STOP at 50+ location pages (require justification)
  5. Generate valid XML output
  6. Split at 50k URLs with sitemap index
  7. Generate STRUCTURE.md documentation

Safe Programmatic Pages (OK at scale)

āœ… Integration pages (with real setup docs) āœ… Template/tool pages (with downloadable content) āœ… Glossary pages (200+ word definitions) āœ… Product pages (unique specs, reviews) āœ… User profile pages (user-generated content)

Penalty Risk (avoid at scale)

āŒ Location pages with only city name swapped āŒ "Best [tool] for [industry]" without industry-specific value āŒ "[Competitor] alternative" without real comparison data āŒ AI-generated pages without human review and unique value

Sitemap Format

Standard Sitemap

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://example.com/page</loc>
    <lastmod>2026-02-07</lastmod>
  </url>
</urlset>

Sitemap Index (for >50k URLs)

<?xml version="1.0" encoding="UTF-8"?>
<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <sitemap>
    <loc>https://example.com/sitemap-pages.xml</loc>
    <lastmod>2026-02-07</lastmod>
  </sitemap>
  <sitemap>
    <loc>https://example.com/sitemap-posts.xml</loc>
    <lastmod>2026-02-07</lastmod>
  </sitemap>
</sitemapindex>

Output

For Analysis

  • VALIDATION-REPORT.md — analysis results
  • Issues list with severity
  • Recommendations

For Generation

  • sitemap.xml (or split files with index)
  • STRUCTURE.md — site architecture documentation
  • URL count and organization summary

[PROTOCOL]: Update this header on changes, then check CLAUDE.md