Web and SEO terms made simple
I give clear definitions, map related terms, and link to reliable resources.
I update it as I learn.
Crawl hygiene – Glossary
Further reading
Overview of crawling and indexing topics, Google Search Central
Central hub for crawl and index control, includes sitemaps, robots.txt, canonicalization, faceted navigation management, status codes, and site moves.
https://developers.google.com/search/docs/crawling-indexing
How HTTP status codes affect Google Search, Google Search Central
Clear effects of 2xx, 3xx, 4xx, 5xx, soft 404s, and network/DNS errors on crawling and indexing, practical for small sites to prevent crawl waste.
https://developers.google.com/search/docs/crawling-indexing/http-network-errors
Crawl Stats report, Search Console Help
Explains Crawl Stats data, request volumes, response types, host status, example URLs, and how redirect chains are counted, useful for measuring crawl activity coverage.
https://support.google.com/webmasters/answer/9679690?hl=en
Large site owner's guide to managing your crawl budget, Google Search Central
Practical guidance for big sites, covers blocking low value URLs, handling soft 404s and duplicates, keeping sitemaps clean, and monitoring crawl behavior.
https://developers.google.com/search/docs/crawling-indexing/large-site-managing-crawl-budget
Related terms
Last updated:
About this glossary
I’m building this glossary to become a better developer and website strategist, in my ongoing quest to build clearer, better websites. It maps how terms relate, adds practical examples, and highlights best practices so I can communicate more clearly with clients. If it helps others too, great.
It’s a living resource, I aim to cite sources clearly. If you spot errors or omissions, please contact me so I can update it. Drafts may be assisted by AI, I review and fact check before publishing. For details on how I use AI, see AI Use at Webmarks.