Web4Agents

Make the web agent-ready. Open source.

Crawl4AI extracts data FROM websites for your agents.
Web4Agents makes websites ready FOR any agent to use.
Complementary tools. Same mission.

⭐ Star on GitHub See Demo

📄

llms.txt

AI content guide — tells agents what matters on your site

15 points

📋

llms-full.txt

Full clean markdown of site content for direct AI consumption

10 points

🤖

robots.txt AI Policy

Are AI bots (GPTBot, ClaudeBot, etc.) allowed or blocked?

15 points

🗺️

sitemap.xml

Discoverable page index for agents to navigate

5 points

🏷️

Schema.org / JSON-LD

Structured data agents can parse without guessing

15 points

🧩

Semantic HTML

Proper labels, ARIA, nav structure for agent navigation

10 points

🔌

WebMCP / Tool Contract

Structured actions agents can call directly

15 points

🛡️

Bot/CAPTCHA Blocking

Does the site actively block AI agents?

10 points

📖

Content Extractability

Can agents extract clean, meaningful content?

5 points

Roadmap

v0.1 — Now

Scanner

Check any site's Agent Readiness Score

v0.2 — Next

Generator

Auto-create llms.txt, Schema.org, WebMCP

v0.3 — Future

Dashboard

Web UI, batch scanning, trend tracking

Web4Agents

llms.txt

llms-full.txt

robots.txt AI Policy

sitemap.xml

Schema.org / JSON-LD

Semantic HTML

WebMCP / Tool Contract

Bot/CAPTCHA Blocking

Content Extractability

🔭 Watch It Scan

Roadmap

Scanner

Generator

Dashboard