╔═══════╗ ║ L 2 ║ ╚═══════╝
> Scrapfly provides web scraping and data extraction APIs for developers. It offers APIs for web scraping, screenshot generation, and data parsing. The service uses AI and LLMs for automated data extraction from web pages. Developers can use the APIs to programmatically retrieve and process data from websites.
> verified.
> sf_entry_point: none
> criteria breakdown
Level 1 - Basic Accessibility
[*] Uses semantic HTML elements (header, nav, main, article, section, footer) instead of just divs
[*] Has proper meta tags (title, description, og:tags, twitter:card)
[*] Includes Schema.org JSON-LD structured data
[*] Does not block automated access with CAPTCHAs on public content
[*] Content is server-side rendered (visible in HTML source, not just JS-rendered)
[*] Uses clean, readable URLs (no excessive query params or hash fragments)
Level 2 - Discoverability
[*] Has a robots.txt that allows legitimate bot access
[*] Provides an XML sitemap
[ ] Has a /llms.txt file describing the site for LLMs
[ ] Publishes an OpenAPI/Swagger specification for its API
[*] Has comprehensive, machine-readable documentation
[*] Primary content is text-based (not locked in images/videos/PDFs)
Level 3 - Structured Interaction
[*] Provides a structured REST or GraphQL API
[*] API returns JSON responses with consistent schema
[*] API supports search and filtering parameters
[ ] Has an A2A agent card at /.well-known/agent.json
[ ] Rate limits are documented and return proper 429 responses with Retry-After
[ ] API returns structured error responses with error codes and messages
Level 4 - Agent Integration
[*] Provides an MCP (Model Context Protocol) server
[ ] Supports WebMCP for browser-based agent interaction
[*] API supports write operations (POST/PUT/PATCH/DELETE), not just reads
[ ] Supports agent-friendly authentication (API keys, OAuth client credentials)
[*] Supports webhooks for event notifications
[ ] Write operations support idempotency keys
Level 5 - Autonomous Operation
[*] Supports event streaming (SSE, WebSockets) for real-time updates
[ ] Supports agent-to-agent capability negotiation
[ ] Has a subscription/management API for agents
[*] Supports multi-step workflow orchestration
[ ] Can proactively notify agents of relevant changes
[ ] Supports cross-service handoff between agents