Two or more pages produce the exact same text content after stripping boilerplate. This is especially common in programmatic SEO setups where a template fails to inject unique data for certain entries. The JSON field is exact_duplicates.
Triggered when SimilarityKind == "exact". SEODiff computes text similarity using SimHash and flags 100% matches. The similarity threshold for near-duplicates is configurable (default 92%), but exact duplicates always count as an issue regardless of information gain score.
Severity weight: 12. Deductions: −15 on Indexability, −18 on Content score, plus an additional −10 per duplicate cluster. Search engines may choose to index only one version, effectively wasting all other duplicate pages.
/page, /page/, /Page without canonicalization.rel="canonical" to point parameter URLs back to the clean URL.