What is ca.gov's AI readiness score?

ca.gov scores 66/100 on SEODiff's AI-Readiness Index, placing it in the Moderate range. The ACRI (AI-Crawler Reality Index) score is 42/100 (Grade D).

What tech stack does ca.gov use?

ca.gov uses Custom / Proprietary as its primary framework with a 5% ghost ratio (SSR rendering).

Is ca.gov visible to AI crawlers like ChatGPT?

GPTBot access: Allowed. ClaudeBot access: Allowed. ca.gov has 1 structured data blocks and a token bloat ratio of 4.5x.

ca.gov SEO & AI Readiness Score: 70/100 (B) — Full Technical Audit

📊

The AI-Readiness profile for ca.gov is strong: an ACRI of 66/100 places it ahead of 56% of domains in the index. Compared to other government sites (avg score: 58), ca.gov performs above the benchmark, suggesting strong competitive positioning in AI search. The low ghost ratio (5%) confirms that what crawlers see matches what users see — a hallmark of strong SSR implementation. With a 4.5× bloat ratio, the page delivers its content without excessive boilerplate, giving AI systems a clean extraction path. Minimal structured data (1 block) limits the site's ability to communicate entity relationships to AI systems. The site maintains an open-door policy for AI crawlers — GPTBot, ClaudeBot, and other major agents are all allowed.

B — Global SEODiff Score

Comprehensive search visibility assessment

Strong foundations, but Traditional SEO (50) is your bottleneck.

🎯 Top Fix: Monitor weekly to catch regressions early

🔬 Automated SEODiff Assessment · Snapshot: Feb 26, 2026 · 📋 API

📈 ACRI Trend 25 snapshots

Feb 21 Feb 26

🔔 Recent AI Indexing Activity

No recent changes detected by adaptive crawler.

Does your site score higher than ca.gov?

Run the same 40-signal audit on your own domain — free, instant results.

Scan Your Site Free →

🧮 Score Transparency — How is this calculated?

🛡️ Traditional SEO (25% weight)50 × 0.25 = 12.5

🤖 AI Readiness / GEO (40% weight)72 × 0.40 = 28.8

⚡ Performance (20% weight)68 × 0.20 = 13.6

🏗️ Architecture & Trust (15% weight)100 × 0.15 = 15.0

Weighted sum = 12.5 + 28.8 + 13.6 + 15.0

Global SEODiff Score = 70 (B)

📊 ACRI Sub-Scores (AI Readiness Detail)

100

Bot Access

avg 92

Rendering

avg 93

Structure

avg 36

Schema

avg 9

Tech Stack

avg 63

🔀

Visibility Delta: Google vs AI

Google (Tranco)

Top 0.1%

Rank #576

+44 pts

Gap

AI (ACRI)

Top 44%

Score 66/100

ca.gov punches above its weight in AI — AI visibility exceeds Google ranking. This is a competitive moat worth protecting. ACRI measures technical crawler readiness. Read the methodology →

Why ca.gov ranks here

Tech stackCustom / Proprietary

Industrygovernment

RenderingSSR

Schema coverage1 blocks

Token bloat4.5×

Fastest improvements

Reduce token bloat (navigation/footer/code) so agents reach your main content faster (see Token Bloat).
Create an llms.txt file so AI crawlers can discover your content structure without heavy crawling. Generate llms.txt →
Run a full entropy audit to find which DOM regions waste the most tokens. Run Entropy Audit →

🧪

JavaScript Rendering Check

We check what AI crawlers miss when they skip JavaScript execution.

Running headless browser to simulate AI extraction…

Overview 🛡️ SEO 🤖 GEO ⚡ Perf 🏗️ Arch Remediation Export

See other Custom / Proprietary sites → government industry → Full leaderboard → Domain A–Z →

🛡️

Traditional SEO

50/100 25 % of Global Score 🟢 High Confidence

📝 Title Tag

32 chars

Good length

Optimal range: 30–60 characters for SERP display.

📋 Meta Description

125 chars

Good length

Optimal range: 120–160 characters for snippet control.

🔤 Heading Hierarchy

✓ Exactly 1 <h1> tag — found 1
✓ Has <h2> headings — found 6
✓ <h2> not before <h1>

🔍 Indexability

✓ Canonical tag present → https://www.ca.gov/
✓ No noindex directive
✓ Meta viewport set
✓ HTML lang attribute → en-US
✗ Hreflang tags
✓ Googlebot allowed by robots.txt

🌐 Social / OpenGraph

✓ og:title — California State Portal | CA.gov
✓ og:description — CA.gov is the official website for the State of California. You can find and access California services, resources, and more.
✓ og:image — preview
✓ twitter:card — summary_large_image

📐 How the SEO Pillar score is calculated

SEO Pillar = Title (20 pts) + Meta Desc (20 pts) + Heading Hierarchy (20 pts) + Indexability (20 pts) + Social/OG (20 pts)

Each sub-score is derived from the checks above. Canonical tag, lang attribute, og:image, and a single H1 are the highest-impact items.

🤖

AI Readiness / GEO

72/100 40 % of Global Score 🟢 High Confidence

This pillar aggregates citation share, hallucination risk, bot access, schema health, and content extractability. The individual diagnostic sections below contribute to this score.

🔗

Citation Alternatives

Research

💡

Insight: In the government sector, lancom-systems.com (ACRI: 83) currently has stronger AI extractability. AI models tend to prefer sources with higher semantic structure and schema coverage. Domains with ACRI < 40 see 3.5× more hallucinations. Read the research →

ca.gov

Your ACRI Score

→

lancom-systems.com

Industry Peer ACRI

AI models prioritize pages with strong semantic structure and schema coverage. lancom-systems.com has schema coverage of 1 blocks and uses TYPO3. Improve your score by implementing the remediation patches below.

📊 Side-by-Side Comparison →

🚨

Hallucination Risk

Research

Is AI lying about your brand? This panel measures how likely LLMs are to hallucinate facts when extracting information from your page.

Analyzing hallucination risk…

🤖 Bot Access Matrix

✅

GPTBot (OpenAI)

Allowed

✅

ClaudeBot (Anthropic)

Allowed

✅

CCBot (Common Crawl)

Allowed

✅

Google-Extended

Allowed

✅

Googlebot

Allowed

👻 Rendering (Ghost Ratio) Docs

Ghost Ratio 5%

0% — Safe 50% 100% — Risk

Status Server-Side Rendered (Safe)

Rendering Type SSR

📊 Structure & Information Density Docs

Structure Grade 28/100 — Low

Structured Elements 8 elements (8 lists, 0 rows, 0 headers)

Total Words341

Raw Density2.3%

💡Low structure score (28/100). Your content appears as a wall of text with few structured HTML elements. You have 8 list items, 0 table rows, 0 table headers. Convert features into <ul> lists and data into <table> elements to help AI models extract structured information.

🏷️ Schema Health Docs

Organization Schema ❌ Missing

Product / Service Schema ⚠️ Not Found

Total Schema Blocks1 block(s) — Basic (low value for AI)

Schema Coverage Map

0/7 schema types detected

❌ Organization

❌ Product/Service

❌ Breadcrumb

❌ FAQ

❌ Article

❌ WebSite

💡Organization schema missing. AI models cannot identify your brand entity. Without it, your brand won't appear in Knowledge Panels or be associated with your content.

💡Product / Service schema missing. AI models don't know this is a SaaS product. Add Product or SoftwareApplication schema so AI understands what you offer and can surface pricing/features.

💡BreadcrumbList schema missing. AI cannot understand your site hierarchy or how pages relate to each other.

💡FAQ schema missing. Adding FAQPage schema lets AI models directly extract Q&A pairs for Featured Snippets and chatbot answers.

💡WebSite schema missing. Add WebSite + SearchAction so Google can generate a Sitelinks Search Box for your brand in AI results.

📐 AI Efficiency Metrics Docs

AI Extractability

Low

Crawl Cost

None

Blocklist Risk

Extractability54/100 — AI models can partially extract answers from this page

Crawl CostLow (10/100) — efficient for AI crawlers to process

Blocklist RiskNone — 0 of 5 AI crawlers blocked

Token Bloat Research

22%

🗑️ 78%

Useful Content (6.4 KB)Bloat (22.4 KB)

Token Bloat Ratio4.5× — Lean

Multimodal Readiness

Visual Context100% Optimized for Vision

Image Alt Coverage10 / 10 images have alt text

TDM Rights

TDM-Reservation HeaderNot set

X-Robots-Tag: noaiNot set

🔥 Structural Entropy Check Research

12 Entropy

Poor Token Bloat: High

Noise Ratio: 77.8% · SNR: 0.29 · Signal: 1636 / Noise: 5724 tokens

Run Full Entropy Audit →

🔬 AI-Crawler Simulation

See your website the way AI crawlers do. CSS stripped, structure labeled, content chunked.

🌐

This is what humans see — styled, branded, visual.
Toggle to "AI Agent View" to see what GPTBot, ClaudeBot, and other AI crawlers actually extract from this page.

🤖

AI Answer Preview

NEW

See how AI models summarize your site. Left: your actual content. Right: what the LLM extracts and says about you.

Simulating AI extraction…

🧠

The LLM Interpretation

AI-VERIFIED

A local LLM (mlx-community/Qwen2.5-7B-Instruct-4bit) analyzed the extracted content of ca.gov and produced this structured business intelligence. Fields marked SEMANTIC VOID indicate information the AI could not find — a critical gap in your site’s machine-readability.

Core Offering

⚠ SEMANTIC VOID

Target Audience

⚠ SEMANTIC VOID

Pricing Model

⚠ SEMANTIC VOID

🛡️ Compliance Standards

SOC 2GDPRHIPAAISO 27001

🏆 Competitive Moat

SEMANTIC VOID

📊 Content Depth

2/10

🔄 Programmatic SEO Signals

directory listings

⚡ Key Pain Points

• No clear description of the product/service

• No target audience defined

• No pricing model mentioned

Model: mlx-community/Qwen2.5-7B-Instruct-4bit · Analyzed: 2026-02-27 · Data extracted from the site’s main content via strict JSON prompting.

🔧 Tech Stack

FrameworkCustom / Proprietary

AI-Readiness Score50/100

Server—

CDN—

HTTP Status200

Load Time1106 ms

Raw HTML Size28.8 KB

Visible Text Size6.4 KB

⚡

Performance & Speed

68/100 20 % of Global Score 🟢 High Confidence

⏱️ Time to First Byte

1106 ms

Slow — bots may time out or deprioritise

Google considers <200 ms "good". AI crawlers may have even shorter timeouts.

📦 Page Weight

315
DOM nodes

29 KB
HTML payload

Lean page — fast for bots and users

🗄️ Cache & CDN

✓ Cache-Control header → public, max-age=1800
✗ CDN cache status
✗ CDN detected

🔬 Tracker Tax

0
tracker scripts

0
third-party domains

0.0%
token overhead

Minimal tracker load — clean signal for bots

📐 How the Performance Pillar score is calculated

Perf Pillar = TTFB (35 pts) + Page Weight (25 pts) + Cache/CDN (20 pts) + Tracker Tax (20 pts)

TTFB <200 ms = full marks. DOM >3000 or payload >300 KB incurs heavy penalties. Tracker scripts beyond 5 reduce score.

🏗️

Architecture & Trust

100/100 15 % of Global Score 🟢 High Confidence

🗺️ Sitemap & Robots

✓ Sitemap declared in robots.txt → https://www.ca.gov/sitemaps/sitemapindex.xml
✓ Googlebot allowed
✓ GPTBot allowed
✓ ClaudeBot allowed

🔗 Linking

45
internal links

5
external links

Good internal linking — helps crawlers discover content

🔒 Security & Trust

✓ HSTS header (Strict-Transport-Security)
✓ Content-Security-Policy header
✓ HTTP status 200 OK (got 200)

♿ Accessibility Signals

✓ HTML lang attribute → en-US
✓ Meta viewport for mobile
✓ Single H1 for screen readers

📐 How the Architecture Pillar score is calculated

Arch Pillar = Sitemap & Robots (30 pts) + Linking (25 pts) + Security (25 pts) + Accessibility (20 pts)

Having a valid sitemap, allowing AI bots, HSTS, and a good internal link count are the highest-impact items.

🏅 AI-Verified Trust Badge

Your site scores 42/100. Reach 80+ to unlock the green "AI-Verified" badge. Fix the issues below to improve your score.

Pending Audit — score below 80 threshold

<a href="https://seodiff.io/radar/domains/ca.gov" rel="noopener"><img src="https://seodiff.io/api/v1/badge?domain=ca.gov" alt="AI-Verified by SEODiff" width="280" height="52"></a>

[![AI-Verified by SEODiff](https://seodiff.io/api/v1/badge?domain=ca.gov)](https://seodiff.io/radar/domains/ca.gov)

💡 Paste in your site footer, GitHub README, or email signature. Badge updates automatically as your score changes.

� Deep Crawl Analysis 4 pages · Deep-10

Homepage ACRI

Single-page score

+21

Subpages outperform homepage

Δ delta

Site-Wide ACRI

Avg across 4 pages · Range 39–82

Topical Cohesion

Topical Drift

TF-IDF cosine similarity

Total Words

4772

Avg Bloat

13.1×

Ext. Citations

Page	Type	ACRI	Token Bloat	Words	Status
https://ca.gov/privacy Privacy policy \| CA.gov	legal	82	3.0×	2064	✓
https://ca.gov/careers Jobs and unemployment \| CA.gov	pricing	77	5.5×	2346	💰 Pricing
https://ca.gov/help Technical help \| CA.gov	support	57	14.3×	248	✓
https://ca.gov/contact Contact \| CA.gov	support	39	29.4×	114	✓

📂

Health by Sub-Directory

Average ACRI and top issues aggregated by URL path prefix

Path	Pages	Avg ACRI	Ghost %	Bloat	Top Issue
/careers/	1	77	0%	5.5×	High JS Bloat
/contact/	1	39	0%	29.4×	High JS Bloat
/help/	1	57	0%	14.3×	High JS Bloat
/privacy/	1	82	0%	3.0×	Healthy

🔗

Outbound External Citations

30 unique external domains cited across 4 pages

saveourwater.com ×4

calcareers.ca.gov ×4

myhazards.caloes.ca.gov ×4

data.ca.gov ×4

calalerts.org ×4

sco.ca.gov ×4

caiso.com ×4

registertovote.ca.gov ×4

🔄 Re-Crawl & Update 📡 Track this Domain

Scores update automatically each month. Create a free account for on-demand re-crawls (3/month free).

🔌 API Access

Pull this data programmatically. All sub-page metrics are available via our public API.

curl https://seodiff.io/api/v1/deep10/domain/ca.gov

Get your free API key — 100 requests/month included.

🔗 Similar government Sites

Domains with a similar tech stack, industry, and AI readiness profile to ca.gov. Compare side-by-side.

Domain	ACRI	AI Score	Tech Stack	Token Bloat	Schema
ca.gov (this site)	42	66	Custom / Proprietary	4.5×	1	—
rucu.ac.tz	43	63	Custom / Proprietary	4.3×	1	Compare →
dissansdigital.com	44	70	Custom / Proprietary	5.2×	1	Compare →
armenian-genocide.org	42	66	Custom / Proprietary	4.2×	0	Compare →
zoomgovdev.com	42	15	Custom / Proprietary	5.0×	0	Compare →
securitybrief.co.nz	38	60	Custom / Proprietary	4.3×	1	Compare →

Compare All 5 Similar Sites →

🎭

Bait & Switch Delta

A 4 PAGES

Compares your homepage rendering quality with inner pages. A high drift score means AI crawlers see a polished homepage but degraded inner content — the "bait & switch" that erodes trust.

Homepage ACRI

Inner Avg ACRI

-33

ACRI Delta

20%

Homepage Ghost

Inner Avg Ghost

Drift Score

Worst Inner Pages

57 0% support https://ca.gov/help

77 0% pricing https://ca.gov/careers

82 0% legal https://ca.gov/privacy

🛡️

E-E-A-T Trust Signals

D 35/100

Trust indicators extracted from surface pages. These signals help AI systems verify your site's Experience, Expertise, Authoritativeness, and Trustworthiness.

❌ Physical Address

❌ Phone Number

✅ Email Contact

❌ About Page

✅ Contact Page

✅ Privacy Policy

❌ Terms of Service

❌ Named Leadership

🔗

Citation Profile

30 DOMAINS

Outbound citation patterns across surface-crawled pages. Sites that cite diverse, authoritative sources signal higher E-E-A-T to AI systems.

Total Links

Unique Domains

14.5

Avg/Page

52%

Diversity

caiso.com

saveourwater.com

sco.ca.gov

data.ca.gov

calalerts.org

myhazards.caloes.ca.gov

calcareers.ca.gov

registertovote.ca.gov

chp.ca.gov

edd.ca.gov

🏘️ Outbound Neighborhood Trust Avg Trust: 0.0

AI trust scores for the domains ca.gov links to. Citing high-trust sources lifts your own credibility signal.

caiso.com 🛡 0

saveourwater.com 🛡 0

ca.gov 🛡 0

calalerts.org 🛡 0

ca.gov 🛡 0

🩹

Remediation Patches

COPY-PASTE

Auto-generated code fixes tailored to ca.gov. Copy and paste these into your codebase to improve AI visibility. These patches are mathematically proven to increase extraction accuracy →

Add Organization JSON-LD

High Impact ⏱ 5 min

AI models cannot identify your brand entity without Organization schema. This is the #1 fix for AI visibility.

html

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Ca",
  "url": "https://ca.gov",
  "logo": "https://ca.gov/images/apple-touch-icon-192x192.png",
  "sameAs": []
}
</script>

Add WebSite + SearchAction JSON-LD

High Impact ⏱ 5 min

Enables the Sitelinks Search Box in Google and allows AI to understand your site structure.

html

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "WebSite",
  "name": "Ca",
  "url": "https://ca.gov",
  "potentialAction": {
    "@type": "SearchAction",
    "target": "https://ca.gov/search?q={search_term_string}",
    "query-input": "required name=search_term_string"
  }
}
</script>

Add FAQ Schema

Medium Impact ⏱ 10 min

FAQ schema lets AI models directly extract Q&A pairs. This is the easiest way to get featured in AI responses.

html

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [
    {
      "@type": "Question",
      "name": "What is Ca?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Add your answer here — describe what Ca does in 1-2 sentences."
      }
    },
    {
      "@type": "Question",
      "name": "How does Ca work?",
      "acceptedAnswer": {
        "@type": "Answer",
        "text": "Explain the key features and how users interact with Ca."
      }
    }
  ]
}
</script>

📈

Projected Impact

ROI EST.

If you apply the patches above, here's the estimated improvement for ca.gov:

Current Score

Projected Score

Improvement

+16 pts

Add Organization schema +6 pts

Add WebSite schema +4 pts

Reduce token bloat +3 pts

Add FAQ schema +3 pts

*Estimates based on SEODiff's scoring model. Actual results depend on implementation quality.

📋 Data Export

Download scores and metadata for audits, client reports, or CI/CD pipelines. Exports contain computed metrics only (no copyrighted content).

📄 Generate llms.txt 📊 Export JSON Data 📸 Download Visual

All data is generated automatically and updated with each crawl. JSON exports contain scores and metadata only (no copyrighted content).

Is this your company?

Monitor your AI visibility score weekly and get alerted when changes happen.

Start Free →

🧭 Self-Diffing (Private Layer)

For owned domains, combine this world snapshot with private drift + regression history.

Template Drift

Track in My Site

Drift → Traffic Impact

In development coming soon

Regression Incidents

Track in My Site

Internal Linking

Deep Audit graph

Semantic Structure

GEO view in Deep Audit

Orphans & Hubs

Deep Audit sections

Content Quality

Thin/duplicate tracking

Open My Site report → Open Self-Diffing incidents → Open History timeline →

🕒 History

Score over timeAvailable in My Site history

Drift eventsTemplate timeline + incidents

Drift → Revenue AttributionComing soon

Schema/rendering/extractability changesTracked per scan in project history

Why ca.gov ranks here

Fastest improvements

JavaScript Rendering Check

Traditional SEO

📝 Title Tag

📋 Meta Description

🔤 Heading Hierarchy

🔍 Indexability

🌐 Social / OpenGraph

AI Readiness / GEO

Citation Alternatives

Hallucination Risk

Hallucination Risk Assessment

🤖 Bot Access Matrix

👻 Rendering (Ghost Ratio) Docs

📊 Structure & Information Density Docs

🏷️ Schema Health Docs

Schema Coverage Map

📐 AI Efficiency Metrics Docs

Token Bloat Research

Multimodal Readiness

TDM Rights

🔥 Structural Entropy Check Research

🔬 AI-Crawler Simulation

📐 Semantic Structure

🧩 Content Chunks

🤖 What Each Bot Sees

AI Answer Preview

The LLM Interpretation

🔧 Tech Stack

Performance & Speed

⏱️ Time to First Byte

📦 Page Weight

🗄️ Cache & CDN

🔬 Tracker Tax

Architecture & Trust

🗺️ Sitemap & Robots

🔗 Linking

🔒 Security & Trust

♿ Accessibility Signals

🏅 AI-Verified Trust Badge

� Deep Crawl Analysis 4 pages · Deep-10

🔌 API Access

🔗 Similar government Sites

Bait & Switch Delta

E-E-A-T Trust Signals

Citation Profile

Remediation Patches

Projected Impact

📧 Get your full report data

📋 Data Export

Is this your company?

🧭 Self-Diffing (Private Layer)

🕒 History

📚 Research & Documentation