Analyzes your page content to find named entities (people, companies, products, places, technologies) and generates JSON-LD schema markup that makes these entities machine-readable.
Uses regex-based heuristic NER (no LLM calls required) to identify capitalized multi-word phrases, known patterns for organizations, product names, place names, and technical terms. This approach is fast, deterministic, and works offline.
Each extracted entity is searched against the Wikidata API to find a matching knowledge graph entry. If found, the entity is enriched with its Wikidata ID, description, and type classification (Person, Organization, Place, etc.).
Generates <script type="application/ld+json"> blocks you can paste directly into your page. The output includes:
{
"@context": "https://schema.org",
"@type": "Organization",
"name": "Anthropic",
"sameAs": "https://www.wikidata.org/wiki/Q107432851",
"description": "American AI safety company"
}
GET /api/entity-schema?url=https://example.com/page