What is Extractability?

Extractability measures whether AI systems can reliably pull the primary content from your pages.

Definition

Extractability is a derived score (0–100) that combines four signals to predict whether an AI system will successfully extract your primary content — or get confused by noise, empty shells, or unstructured text.

How SEODiff computes it

Extractability is a weighted composite of four components:

Extractability = 0.30 × Structure + 0.25 × Schema + 0.25 × Rendering + 0.20 × BloatEfficiency

Where BloatEfficiency is derived from the Token Bloat Ratio:

BloatEfficiency = clamp(100 / TokenBloatRatio × 5, 0, 100)

This means a page with 20× token bloat gets a BloatEfficiency of 25, while a page with 5× bloat or less gets 100.

Why it matters

A page can be accessible (bots aren't blocked) but still have terrible extractability. Common scenarios:

Score interpretation