Send us a URL. We return clean, validated schema.org JSON‑LD — not raw HTML, not noisy markdown. Machine-readable facts your AI agent can use immediately. 200+ entity types identified and extracted.
Every URL is a target. We deploy the optimal extraction strategy automatically — API integration, meta tag parsing, browser rendering, or AI analysis.
Automatically detects the entity type on any page — Restaurant, Product, Event, Person, Article, MedicalCondition, SoftwareApplication, and 200+ more schema.org types.
Every response is specification-compliant JSON-LD with proper @context, @type, and validated property names. Drop it directly into your knowledge graph or downstream pipeline.
Three-tier acquisition system: stealth browser rendering for bot-protected sites, fast headless rendering for standard pages, and direct HTTP for lightweight targets.
Instead of feeding your AI agent 50KB of raw HTML, we deliver a compact JSON object with only the facts that matter. Save tokens, reduce latency, increase accuracy.
Every extraction includes a confidence rating — high, medium, or low — based on extraction source and data quality. Know exactly how much to trust the intel.
Free difficulty check before paying. Know the expected schema type, scraping difficulty, and known blockers for any URL before committing funds.
For high-value domains, we bypass generic extraction entirely. Purpose-built modules deliver native-quality data at zero LLM cost.
Bypass generic page types and drill down into rich organizational entities. We reliably extract nested contact details, founding data, geolocation, and hierarchies straight into clean JSON representations.
Extracts patent numbers, inventors, assignees, filing dates, citations, related patents, and direct PDF links from 40+ citation meta tags — no AI required.
Universal citation meta tag parser covering 30+ academic publishers. Authors with affiliations, DOI, journal/volume/issue hierarchy, PDF links, references, and keywords.
Every URL runs through an intelligent pipeline that selects the fastest, most accurate extraction strategy.
URL validated, DNS checked, domain identified. Specialized fast-paths engaged if available.
Optimal scraping tier deployed — API call, stealth browser, or direct HTTP based on target defenses.
Existing JSON-LD analyzed. If insufficient, AI identifies entity types and extracts structured facts.
Validated schema.org JSON-LD with confidence scoring. Cached for rapid subsequent retrieval.
JSON Recon is accessible via the x402 payment protocol — the open standard for machine-to-machine payments over HTTP. AI agents discover our service at /.well-known/x402 and pay per-request using cryptocurrency on Base. No API keys, no subscriptions, no human sign-up required. Our endpoints are also listed in the x402 Bazaar, the protocol's machine-readable service catalog for automated discovery.