Browse & Analyze

Browse recent website audits and performance trends

Explore technology stacks detected across the web

Side-by-side technology comparisons and market share

Browse and download favicons from analyzed websites

Learn

Common questions about BeaverCheck scores and checks

How we calculate scores across 100+ checks

The story behind BeaverCheck and the team

Practical guides to fix common web performance issues

Contrast Checker

Check WCAG color contrast ratios between any two colors

Inspect HTTP response headers for any URL

Analyze SSL/TLS certificate details and chain

Look up DNS records for any domain

Redirect Checker

Trace the full redirect chain for any URL

Real-user INP and Core Web Vitals from Google's Chrome UX Report.

RESTful API for programmatic website audits

Embed an auto-updating health score badge on your site

robots.txt

A plain-text file at the site root telling crawlers which paths they may or may not request, following the Robots Exclusion Protocol.

robots.txt is the file served at /robots.txt that controls which URLs search-engine crawlers (and other bots) are permitted to fetch. It uses a simple line-based format: User-agent: selects which bot the rules apply to, then Disallow: and Allow: rules follow.

A typical robots.txt:

User-agent: *
Allow: /
Disallow: /admin/
Disallow: /search?

Sitemap: https://example.com/sitemap.xml

robots.txt blocks crawling but NOT indexing -- if a blocked URL is linked from elsewhere, it can still appear in search results without a snippet. To prevent indexing, use <meta name="robots" content="noindex"> on the page itself (which requires the page to be crawlable so the meta is read).

Test changes via Google Search Console's URL Inspection tool, which reports how the live robots.txt evaluates against any URL.

Related terms

Sitemap Canonical URL

Further reading

Send Feedback