arstechnica.com 64 C
🛡️ SEO 63 🤖 GEO 71 ⚡ Perf 39 🏗️ Arch 83

arstechnica.com — Global SEODiff Score 64/100

arstechnica.com
📊

At 77/100, the ACRI for arstechnica.com indicates strong fundamentals in AI extractability, surpassing the majority of indexed sites. Within the infrastructure vertical, this places arstechnica.com above the industry average of 58 —, suggesting strong competitive positioning in AI search. Server-side rendering keeps the ghost ratio near zero, giving AI systems direct access to all visible content. The 11.9× token bloat ratio falls within the normal range, though there is room to trim navigation, footer, and script overhead. Only 1 schema block is present — adding Organization, WebSite, and Breadcrumb schemas would significantly improve structured data coverage. The site maintains an open-door policy for AI crawlers — GPTBot, ClaudeBot, and other major agents are all allowed.

64
C — Global SEODiff Score
Comprehensive search visibility assessment
Strong foundations, but Performance (39) is your bottleneck.
🎯 Top Fix: Fix title tag length → +3 pts
🔬 Automated SEODiff Assessment · Snapshot: Feb 28, 2026 · 📋 API
Does your site score higher than arstechnica.com?
Run the same 40-signal audit on your own domain — free, instant results.
Scan Your Site Free →
🧮 Score Transparency — How is this calculated?
🛡️ Traditional SEO (25% weight)63 × 0.25 = 15.8
🤖 AI Readiness / GEO (40% weight)71 × 0.40 = 28.4
⚡ Performance (20% weight)39 × 0.20 = 7.8
🏗️ Architecture & Trust (15% weight)83 × 0.15 = 12.4
Weighted sum = 15.8 + 28.4 + 7.8 + 12.4
Global SEODiff Score = 64 (C)
📊 ACRI Sub-Scores (AI Readiness Detail)
100
Bot Access
avg 92
100
Rendering
avg 93
35
Structure
avg 36
42
Schema
avg 9
85
Tech Stack
avg 63
🔀
Visibility Delta: Google vs AI
Google (Tranco)
Top 0.2%
Rank #1890
+14 pts
Gap
AI (ACRI)
Top 14%
Score 77/100

arstechnica.com shows stronger AI visibility than traditional SEO ranking. Great AI foundation to build on. ACRI measures technical crawler readiness. Read the methodology →

Why arstechnica.com ranks here

Tech stackWordPress
RenderingSSR
Schema coverage1 blocks
Token bloat11.9×

Fastest improvements

  • Reduce token bloat (navigation/footer/code) so agents reach your main content faster (see Token Bloat).
  • Create an llms.txt file so AI crawlers can discover your content structure without heavy crawling. Generate llms.txt →
  • Run a full entropy audit to find which DOM regions waste the most tokens. Run Entropy Audit →
🧪

JavaScript Rendering Check

We check what AI crawlers miss when they skip JavaScript execution.

Running headless browser to simulate AI extraction…
🛡️

Traditional SEO

63/100 25 % of Global Score 🟢 High Confidence

📝 Title Tag

80 chars
Too long

Optimal range: 30–60 characters for SERP display.

📋 Meta Description

144 chars
Good length

Optimal range: 120–160 characters for snippet control.

🔤 Heading Hierarchy

  • ✓ Exactly 1 <h1> tag — found 1
  • ✓ Has <h2> headings — found 40
  • ✓ <h2> not before <h1>

🔍 Indexability

  • ✓ Canonical tag present → https://arstechnica.com/
  • ✓ No noindex directive
  • ✓ Meta viewport set
  • ✓ HTML lang attribute → en
  • ✗ Hreflang tags
  • ✓ Googlebot allowed by robots.txt

🌐 Social / OpenGraph

  • ✓ og:title — Ars Technica
  • ✓ og:description — News and reviews, covering IT, AI, science, space, health, gaming, cybersecurity, tech policy, computers, mobile devices, and operating systems.
  • ✓ og:image — preview
  • ✓ twitter:card — summary_large_image
📐 How the SEO Pillar score is calculated

SEO Pillar = Title (20 pts) + Meta Desc (20 pts) + Heading Hierarchy (20 pts) + Indexability (20 pts) + Social/OG (20 pts)

Each sub-score is derived from the checks above. Canonical tag, lang attribute, og:image, and a single H1 are the highest-impact items.

🤖

AI Readiness / GEO

71/100 40 % of Global Score 🟢 High Confidence

This pillar aggregates citation share, hallucination risk, bot access, schema health, and content extractability. The individual diagnostic sections below contribute to this score.

🔗

Citation Alternatives

Research
💡
Insight: In the infrastructure sector, safely.co.jp (ACRI: 90) currently has stronger AI extractability. AI models tend to prefer sources with higher semantic structure and schema coverage. Domains with ACRI < 40 see 3.5× more hallucinations. Read the research →
arstechnica.com
55
Your ACRI Score
90
Industry Peer ACRI
AI models prioritize pages with strong semantic structure and schema coverage. safely.co.jp has schema coverage of 3 blocks and uses WordPress. Improve your score by implementing the remediation patches below.
📊 Side-by-Side Comparison →
🚨

Hallucination Risk

Research

Is AI lying about your brand? This panel measures how likely LLMs are to hallucinate facts when extracting information from your page.

Analyzing hallucination risk…

🤖 Bot Access Matrix

GPTBot (OpenAI)
Allowed
ClaudeBot (Anthropic)
Allowed
CCBot (Common Crawl)
Allowed
Google-Extended
Allowed
Googlebot
Allowed

👻 Rendering (Ghost Ratio) Docs

Ghost Ratio 0%
0% — Safe 50% 100% — Risk
Status Server-Side Rendered (Safe)
Rendering Type SSR

📊 Structure & Information Density Docs

Structure Grade 35/100 — Low
Structured Elements 64 elements (64 lists, 0 rows, 0 headers)
Total Words1710
Raw Density3.7%
💡Low structure score (35/100). Your content appears as a wall of text with few structured HTML elements. You have 64 list items, 0 table rows, 0 table headers. Convert features into <ul> lists and data into <table> elements to help AI models extract structured information.

🏷️ Schema Health Docs

Organization Schema ✅ Present
Product / Service Schema ⚠️ Not Found
Total Schema Blocks1 block(s) — Basic (low value for AI)

Schema Coverage Map

2/7 schema types detected
✅ Organization
❌ Product/Service
❌ Breadcrumb
❌ FAQ
❌ Article
✅ WebSite
💡Product / Service schema missing. AI models don't know this is a SaaS product. Add Product or SoftwareApplication schema so AI understands what you offer and can surface pricing/features.
💡BreadcrumbList schema missing. AI cannot understand your site hierarchy or how pages relate to each other.
💡FAQ schema missing. Adding FAQPage schema lets AI models directly extract Q&A pairs for Featured Snippets and chatbot answers.

📐 AI Efficiency Metrics Docs

54
AI Extractability
Medium
Crawl Cost
None
Blocklist Risk
Extractability54/100 — AI models can partially extract answers from this page
Crawl CostMedium (50/100) — moderate for AI crawlers to process
Blocklist RiskNone — 0 of 5 AI crawlers blocked

Token Bloat Research

8%
🗑️ 92%
Useful Content (33.1 KB)Bloat (360.9 KB)
Token Bloat Ratio11.9× — Normal

Multimodal Readiness

Visual Context57% Optimized for Vision
Image Alt Coverage35 / 61 images have alt text

TDM Rights

TDM-Reservation HeaderNot set
X-Robots-Tag: noaiNot set

🔥 Structural Entropy Check Research

0 Entropy
Poor Token Bloat: High
Noise Ratio: 91.6% · SNR: 0.09 · Signal: 8464 / Noise: 92396 tokens

🔬 AI-Crawler Simulation

See your website the way AI crawlers do. CSS stripped, structure labeled, content chunked.

🌐
This is what humans see — styled, branded, visual.
Toggle to "AI Agent View" to see what GPTBot, ClaudeBot, and other AI crawlers actually extract from this page.
🤖

AI Answer Preview

NEW

See how AI models summarize your site. Left: your actual content. Right: what the LLM extracts and says about you.

Simulating AI extraction…
🧠

The LLM Interpretation

AI-VERIFIED

A local LLM (mlx-community/Qwen2.5-7B-Instruct-4bit) analyzed the extracted content of arstechnica.com and produced this structured business intelligence. Fields marked SEMANTIC VOID indicate information the AI could not find — a critical gap in your site’s machine-readability.

Core Offering
Measure gravitational redshift with quantum superposition
Target Audience
Academic researchers, theoretical physicists, gravitational scientists
Pricing Model
⚠ SEMANTIC VOID
🏆 Competitive Moat
More accurate measurement of gravitational redshift by 10,000 times
📊 Content Depth
5/10
⚡ Key Pain Points
• No structured FAQ schema
• Thin landing pages for features
Model: mlx-community/Qwen2.5-7B-Instruct-4bit · Analyzed: 2026-02-27 · Data extracted from the site’s main content via strict JSON prompting.

🔧 Tech Stack

FrameworkWordPress
AI-Readiness Score85/100
Server
CDN
HTTP Status200
Load Time1123 ms
Raw HTML Size394.0 KB
Visible Text Size33.1 KB

Performance & Speed

39/100 20 % of Global Score 🟢 High Confidence

⏱️ Time to First Byte

1123 ms
Slow — bots may time out or deprioritise

Google considers <200 ms "good". AI crawlers may have even shorter timeouts.

📦 Page Weight

2448
DOM nodes
394 KB
HTML payload
Heavy page — consider reducing DOM complexity

🗄️ Cache & CDN

  • ✗ Cache-Control header
  • ✗ CDN cache status
  • ✗ CDN detected

🔬 Tracker Tax

0
tracker scripts
0
third-party domains
0.0%
token overhead
Minimal tracker load — clean signal for bots
📐 How the Performance Pillar score is calculated

Perf Pillar = TTFB (35 pts) + Page Weight (25 pts) + Cache/CDN (20 pts) + Tracker Tax (20 pts)

TTFB <200 ms = full marks. DOM >3000 or payload >300 KB incurs heavy penalties. Tracker scripts beyond 5 reduce score.

🏗️

Architecture & Trust

83/100 15 % of Global Score 🟢 High Confidence

🗺️ Sitemap & Robots

  • ✓ Sitemap declared in robots.txt → https://arstechnica.com/sitemap.xml
  • ✓ Googlebot allowed
  • ✓ GPTBot allowed
  • ✓ ClaudeBot allowed

🔗 Linking

191
internal links
8
external links
Good internal linking — helps crawlers discover content

🔒 Security & Trust

  • ✗ HSTS header (Strict-Transport-Security)
  • ✓ Content-Security-Policy header
  • ✓ HTTP status 200 OK (got 200)

♿ Accessibility Signals

  • ✓ HTML lang attribute → en
  • ✓ Meta viewport for mobile
  • ✓ Single H1 for screen readers
📐 How the Architecture Pillar score is calculated

Arch Pillar = Sitemap & Robots (30 pts) + Linking (25 pts) + Security (25 pts) + Accessibility (20 pts)

Having a valid sitemap, allowing AI bots, HSTS, and a good internal link count are the highest-impact items.

🏅 AI-Verified Trust Badge

Your site scores 55/100. Reach 80+ to unlock the green "AI-Verified" badge. Fix the issues below to improve your score.

AI-Verified badge for arstechnica.com
Pending Audit — score below 80 threshold
<a href="https://seodiff.io/radar/domains/arstechnica.com" rel="noopener"><img src="https://seodiff.io/api/v1/badge?domain=arstechnica.com" alt="AI-Verified by SEODiff" width="280" height="52"></a>

💡 Paste in your site footer, GitHub README, or email signature. Badge updates automatically as your score changes.

� Deep Crawl Analysis 19 pages · Deep-10

Homepage ACRI
55
Single-page score
-6
Moderate hidden bloat
Δ delta
Site-Wide ACRI
49
Avg across 19 pages · Range 41–64
Topical Cohesion
15%
Topical Drift
TF-IDF cosine similarity
Total Words
8366
Avg Bloat
134.7×
RAG Fractures [?]
1
⚠️
1 RAG-Chunking Fracture Detected

Poorly formatted tables or pricing grids on 1 page will be split incorrectly during RAG chunking, causing AI models to hallucinate prices and features.

Page Type ACRI Token Bloat Words Status
https://arstechnica.com/faq
New precision for relativity's gravitational redshift - Ars Technica
pricing 64 31.7× 881 💰 Pricing
https://arstechnica.com/features
Category: Features - Ars Technica
pricing 61 45.8× 1210 💰 Pricing
https://arstechnica.com/tag/api/
Tag: API - Ars Technica
pricing 61 49.4× 1112 ⚠️ RAG Fracture
https://arstechnica.com/tag/developer/
Tag: developer - Ars Technica
pricing 61 46.3× 1117 💰 Pricing
https://arstechnica.com/tag/demo/
Tag: demo - Ars Technica
pricing 61 44.4× 1125 💰 Pricing
https://arstechnica.com/tag/gdpr/
Tag: gdpr - Ars Technica
pricing 61 48.1× 1107 💰 Pricing
https://arstechnica.com/tag/documentation/
Tag: documentation - Ars Technica
pricing 51 103.9× 250 💰 Pricing
https://arstechnica.com/tag/case-study/
Tag: case study - Ars Technica
pricing 51 108.0× 229 💰 Pricing
https://arstechnica.com/tag/faq/
Tag: FAQ - Ars Technica
pricing 51 98.0× 259 💰 Pricing
https://arstechnica.com/tag/doc/
Tag: DOC - Ars Technica
pricing 41 197.5× 106 💰 Pricing
https://arstechnica.com/tag/docs/
Tag: docs - Ars Technica
pricing 41 171.2× 127 💰 Pricing
https://arstechnica.com/tag/articles/
Tag: Articles - Ars Technica
pricing 41 204.0× 103 💰 Pricing
https://arstechnica.com/tag/article/
Tag: article - Ars Technica
pricing 41 209.6× 100 💰 Pricing
https://arstechnica.com/tag/company/
Tag: company - Ars Technica
pricing 41 165.1× 132 💰 Pricing
https://arstechnica.com/tag/customers/
Tag: customers - Ars Technica
pricing 41 210.8× 100 💰 Pricing
https://arstechnica.com/tag/contact/
Tag: contact - Ars Technica
pricing 41 204.6× 103 💰 Pricing
https://arstechnica.com/tag/compliance/
Tag: compliance - Ars Technica
pricing 41 199.2× 106 💰 Pricing
https://arstechnica.com/tag/career/
Tag: career - Ars Technica
pricing 41 205.4× 102 💰 Pricing
https://arstechnica.com/tag/careers/
Tag: careers - Ars Technica
pricing 41 217.2× 97 💰 Pricing
📂
Health by Sub-Directory
Average ACRI and top issues aggregated by URL path prefix
Path Pages Avg ACRI Ghost % Bloat Top Issue
/tag/ 17 48 1% 146.0× Bot Blocked
/faq/ 1 64 0% 31.7× High JS Bloat
/features/ 1 61 1% 45.8× Bot Blocked
🔗
Outbound External Citations
0 unique external domains cited across 19 pages
aboutads.info ×19
youtube.com ×19
bsky.app ×19
mastodon.social ×19
condenast.com ×19
facebook.com ×19
instagram.com ×19
dx.doi.org ×1
🔄 Re-Crawl & Update 📡 Track this Domain

Scores update automatically each month. Create a free account for on-demand re-crawls (3/month free).

🔌 API Access

Pull this data programmatically. All sub-page metrics are available via our public API.

curl https://seodiff.io/api/v1/deep10/domain/arstechnica.com

Get your free API key — 100 requests/month included.

🔗 Similar infrastructure Sites

Domains with a similar tech stack, industry, and AI readiness profile to arstechnica.com. Compare side-by-side.

Domain ACRI AI Score Tech Stack Token Bloat Schema
arstechnica.com (this site) 55 77 WordPress 11.9× 1
truemfg.com 80 90 WordPress 8.7× 1 Compare →
editors.ca 80 90 WordPress 6.1× 1 Compare →
rubankov.ru 80 90 WordPress 9.3× 3 Compare →
me-q.jp 80 89 WordPress 3.7× 1 Compare →
qjweb.jp 80 90 WordPress 4.1× 2 Compare →
Compare All 5 Similar Sites →
👻

Rendering Drop-Off by Depth

DEPTH PENALTY 19 PAGES
40%
Homepage Ghost
60%
Deep Pages Avg Ghost
32%
Homepage Bloat
140%
Deep Pages Avg Bloat
Ghost Ratio by Crawl Depth
Depth 0
40%
1 pg · ACRI 64
Orphan
60%
18 pg · ACRI 48
⚠️ Worst Rendering Pages
60% https://arstechnica.com/tag/careers/ orphan
60% https://arstechnica.com/tag/customers/ orphan
60% https://arstechnica.com/tag/article/ orphan
60% https://arstechnica.com/tag/career/ orphan
60% https://arstechnica.com/tag/contact/ orphan
Deeper pages lose rendering fidelity. Pages at depth 4+ often show 2-3× higher ghost ratios than the homepage, meaning AI crawlers see progressively less of your actual content as they navigate deeper into the site architecture.
🏗️

Deep Architecture Health & Equity Distribution

19 PAGES
80.5%
Gini Inequality [?]
Extreme hoarding
87%
Top-3 PR Share [?]
Equity held by 3 pages
18
Orphan Pages [?]
94.7% of site
0.0
Avg Click Depth [?]
Max: 0 clicks
0
Internal Links
18 disconnected
4%
Semantic Cohesion [?]
Topical Drift
5% Healthy (receive equity)
0% Diluted hubs (>50 outbound)
95% Orphans (zero inbound)
🔗 Top Hub Pages (Highest PageRank)
https://arstechnica.com/faq https://arstechnica.com/tag/gdpr/ https://arstechnica.com/tag/developer/ https://arstechnica.com/tag/demo/ https://arstechnica.com/tag/api/
⚠️
18 orphan pages receive zero internal link equity. A Gini coefficient of 0.81 means the top 3 pages hoard 87% of all internal PageRank — AI crawlers may never discover the rest.
💡 Quick Win: Optimal Link Injection
To rescue the orphan page gdpr, add a link to it from your high-authority Hub Page faq. (Semantic Match: 45%)
📥 Orphan: https://arstechnica.com/tag/gdpr/
🔗 Donor: https://arstechnica.com/faq (PR: 0.8579)
Get the full link injection plan for all 18 orphan pages
Unlock Full Link Strategy →
The Gini coefficient measures link equity inequality across your site. A Gini of 0.81 indicates severe equity hoarding — a handful of pages dominate PageRank while hundreds starve. AI crawlers penalize this by ignoring deep pages.
🧲

Semantic Density

THIN 19 TOPICS
19
Topic Clusters
via TF-IDF cosine, ε=.60
0
High-Density Pairs
>92% cosine similarity
Thin
Contextual Depth
≤5 pairs
Semantic Density measures how deeply your site covers each topic. High-density clusters (>92% similarity) mean multiple pages reinforce the same subject — this is positive for RAG/AI systems that need corroborating sources. Thin coverage (few clusters) means AI models have less evidence to cite you.
🧬

Entity Knowledge Gap Analysis

7 MISSING 10% COVERAGE
⚠️ 7 entities mentioned in content but missing from structured data:
OpenAI Go AWS GDPR REST Anthropic Rust
Each missing entity is a lost opportunity for rich snippet inclusion and AI knowledge graph integration.
Schema Coverage: 1 declared vs 10 extracted — 10% coverage Entities with <100% coverage create knowledge graph blind spots for AI systems
🧭

Semantic Nearest Neighbors

View all →
9to5mac.com
9to5Mac delivers breaking tech news and in-depth commentary, focusing on original reporting and providing a reader-friendly experience through high-quality media and sponsored content.
30%
Overlap
60
ACRI
Compare
akismet.com
Detects and removes spam from websites and forms.
29%
Overlap
31
ACRI
Compare
29%
Overlap
53
ACRI
Compare
appdirect.com
B2B subscription commerce platform for technology services
28%
Overlap
78
ACRI
Compare
achievement.org
The Academy of Achievement provides audio podcasts featuring intimate conversations with influential leaders across diverse fields, focusing on leadership principles and experiences.
27%
Overlap
80
ACRI
Compare
View All Semantic Alternatives → 📚 Docs Quality Index →
🎭

Bait & Switch Delta

B 15 PAGES

Compares your homepage rendering quality with inner pages. A high drift score means AI crawlers see a polished homepage but degraded inner content — the "bait & switch" that erodes trust.

61
Homepage ACRI
49
Inner Avg ACRI
+12
ACRI Delta
60%
Homepage Ghost
59%
Inner Avg Ghost
23
Drift Score [?]
Worst Inner Pages
61 60% pricing https://arstechnica.com/tag/developer/
41 60% pricing https://arstechnica.com/tag/company/
51 60% pricing https://arstechnica.com/tag/documentation/
🛡️

E-E-A-T Trust Signals

D 30/100

Trust indicators extracted from surface pages. These signals help AI systems verify your site's Experience, Expertise, Authoritativeness, and Trustworthiness.

Physical Address
Phone Number
Email Contact
About Page
Contact Page
Privacy Policy
Terms of Service
Named Leadership
🔗

Citation Profile

8 DOMAINS

Outbound citation patterns across surface-crawled pages. Sites that cite diverse, authoritative sources signal higher E-E-A-T to AI systems.

106
Total Links
8
Unique Domains
7.1
Avg/Page
8%
Diversity
aboutads.info bsky.app mastodon.social facebook.com youtube.com instagram.com condenast.com dx.doi.org
🏘️ Outbound Neighborhood Trust Avg Trust: 48.1

AI trust scores for the domains arstechnica.com links to. Citing high-trust sources lifts your own credibility signal.

🩹

Remediation Patches

COPY-PASTE

Auto-generated code fixes tailored to arstechnica.com. Copy and paste these into your codebase to improve AI visibility. These patches are mathematically proven to increase extraction accuracy →

Reduce Token Bloat
Medium Impact ⏱ 1–2 hrs
Only 8% of your HTML is useful content. AI crawlers waste context window tokens on bloat.
html
<!-- Move inline CSS to external stylesheets -->
<link rel="stylesheet" href="/css/main.css">

<!-- Move inline scripts to external files with defer -->
<script src="/js/app.js" defer></script>

<!-- Remove duplicate navigation blocks -->
<!-- Keep only ONE <nav> in the <header> -->

<!-- Ensure <main> wraps your primary content -->
<main>
  <!-- Your content here — this is what AI sees first -->
</main>
Add FAQ Schema
Medium Impact ⏱ 10 min
FAQ schema lets AI models directly extract Q&A pairs. This is the easiest way to get featured in AI responses.
html
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is Arstechnica?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Add your answer here — describe what Arstechnica does in 1-2 sentences."
      }
    },
    {
      "@type": "Question",
      "name": "How does Arstechnica work?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Explain the key features and how users interact with Arstechnica."
      }
    }
  ]
}
</script>
📈

Projected Impact

ROI EST.

If you apply the patches above, here's the estimated improvement for arstechnica.com:

Current Score
77
Projected Score
85
Improvement
+8 pts
Reduce token bloat +5 pts
Add FAQ schema +3 pts

*Estimates based on SEODiff's scoring model. Actual results depend on implementation quality.

📋 Data Export

Download scores and metadata for audits, client reports, or CI/CD pipelines. Exports contain computed metrics only (no copyrighted content).

All data is generated automatically and updated with each crawl. JSON exports contain scores and metadata only (no copyrighted content).

Is this your company?

Monitor your AI visibility score weekly and get alerted when changes happen.

Start Free →

🧭 Self-Diffing (Private Layer)

For owned domains, combine this world snapshot with private drift + regression history.
Template Drift
Track in My Site
Drift → Traffic Impact
In development coming soon
Regression Incidents
Track in My Site
Internal Linking
Deep Audit graph
Semantic Structure
GEO view in Deep Audit
Content Quality
Thin/duplicate tracking

🕒 History

Score over timeAvailable in My Site history
Drift eventsTemplate timeline + incidents
Drift → Revenue AttributionComing soon
Schema/rendering/extractability changesTracked per scan in project history