mapou Visibility Index · Financial services → Health insurance · May 2026
Which health insurers does AI cite most?
When buyers ask AI assistants questions like “What are the best health insurance options to consider for individuals and families in 2026?” or “Best affordable health insurance plans under $300 a month for individuals and families?”, a small set of health insurersget cited every time. Most don't. This report measures which.
How we measured. MVI is a 0–100 score per brand: 0 means AI never cites you in health insurers, 100 means it cites you in every prompt. We tested 20 brands across 5 AI assistants (ChatGPT, Perplexity, Gemini, Claude, Grok) using 20 fixed prompts, reused every monthly run for replicability.
How we describe AI visibility
- Stage 1, First encounter. The brand is discovered and cited occasionally in AI answers for buyer-intent prompts.
- Stage 2, Repeat use. The brand is cited regularly enough that it feels familiar and reliably present across prompts and engines.
- Stage 3, Default choice. The brand is the go-to recommendation in AI answers within its segment, often appearing first or most consistently.
Bottom line
Blue Cross Blue Shield leads health insurers on AI search visibility with MVI 77, sitting firmly in the default-choice tier, in a tight race with UnitedHealthcare (MVI 70).
Who AI cites most
Blue Cross Blue Shield is cited in 67 of 100 prompt-engine pairs (67%). 95% confidence interval 67-84.
Concentration
The top 3 brands (Blue Cross Blue Shield, UnitedHealthcare, Aetna) capture 49% of all citations in this segment. 12 of 20 tracked brands are cited in fewer than 1 in 10 prompt-engine pairs.
Where the field sits
Of 20 brands tested: 1 in default-choice, 2 in repeat-use, 4 in first-encounter, 13 not yet cited. Overall, AI cites a brand from this segment in 18% of buyer-intent prompt-engine pairs.
Engine asymmetry
Aetna is cited in 75% of ChatGPT prompts but only 15% on Perplexity, visibility is engine-specific, not universal.
Phase flip
UnitedHealthcare actually outperforms Blue Cross Blue Shield on Discovery prompts, overall MVI hides this category-specific strength.
Notable absence
Devoted Health is not yet cited, 0 prompts across all 100 prompt-engine pairs. A recognizable brand AI is not yet surfacing.
Analyst note
Blue Cross Blue Shield leads with an MVI of 77, 7 points ahead of UnitedHealthcare.
Blue Cross Blue Shield is the default choice with a strong presence across all engines, especially ChatGPT at 88. UnitedHealthcare follows but lags in Perplexity with only 57. Aetna's visibility drops sharply in Perplexity, scoring just 15. Thirteen brands are invisible, including Oscar Health and Molina Healthcare, both under 12 MVI. This creates a competitive set where most brands struggle to be noticed.
Risk:Aetna risks losing ground due to its weak performance in the evaluation phase, scoring only 33 compared to its competitors.
Headline finding
Blue Cross Blue Shield leads health insurers on AI search visibility with MVI 77, sitting firmly in the default-choice tier.
Average MVI
20
Default choice
1
Of 20 brands
Repeat use
2
First encounter
4
Not yet cited
13
Citation rate per engine
How often each engine cites a brand from this category as a recommendation, averaged across all 20 brands tested.
ChatGPT
21%
10 / 20 brands cited at least once
Perplexity
18%
13 / 20 brands cited at least once
Gemini
17%
10 / 20 brands cited at least once
Claude
19%
11 / 20 brands cited at least once
Grok
16%
8 / 20 brands cited at least once
Phase strength across the category
Which buyer-intent phases are easiest vs hardest to win in health insurers. Citation rate averaged across all brands tested. Phase weights are part of the MVI formula.
Discovery · 30%
21%
Top: UnitedHealthcare
Filtered discovery · 25%
18%
Top: UnitedHealthcare
Comparison · 25%
20%
Top: Blue Cross Blue Shield
Evaluation · 20%
12%
Top: Blue Cross Blue Shield
Who wins which buyer phase
Top 12brands by MVI mapped against the four buyer-intent phases. Each cell shows the brand's citation rate for that phase, color-coded so the visual pattern tells the story: a brand strong across all four phases reads as a horizontal orange band; a brand strong only at Discovery but weak at Evaluation reads as a left-heavy gradient. This is the segment's findings against the panel.
| Brand | MVI | Discovery | Filtered | Comparison | Evaluation |
|---|---|---|---|---|---|
| Blue Cross Blue Shield | 77 | 77% | 72% | 80% | 80% |
| UnitedHealthcare | 70 | 78% | 75% | 63% | 63% |
| Aetna | 52 | 57% | 55% | 57% | 33% |
| Cigna | 46 | 58% | 47% | 43% | 30% |
| Kaiser Permanente | 44 | 57% | 57% | 38% | 18% |
| Humana | 30 | 42% | 17% | 45% | 13% |
| Anthem | 27 | 25% | 28% | 33% | 20% |
| Oscar Health | 21 | 28% | 13% | 35% | 0% |
| Molina Healthcare | 11 | 5% | 15% | 20% | 3% |
| Independence Blue Cross | 7 | 7% | 7% | 10% | 5% |
| Health Net | 6 | 8% | 2% | 3% | 10% |
| Centene | 5 | 7% | 3% | 10% | 0% |
How to read it. Strong horizontal band = durable brand, AI cites it across the entire funnel. Left-heavy gradient = brand with awareness (Discovery) but weak recommendation (Evaluation), the demand-leak pattern from Finding 03. Right-heavy gradient = brand AI considers in Comparison and Evaluation but does not surface in initial Discovery, the anti-leak pattern. Color steps: dark orange ≥75%, orange ≥50%, peach ≥30%, light ≥10%, beige >0%, neutral 0%.
For CMOs in health insurers
What this report means for your health insurers portfolio
Each bullet is a category-specific decision derived from this month's data, with the mapou service that operationalizes it.
Concentration risk
In health insurers, AI effectively recommends 8.4 of 20 tracked brands. The top brand captures 19% of citations. The next 2 capture another 17%. Visibility is concentrated but not winner-takes-all.
How mapou helps: GEO & Citation Architecture restructures your entity data so you can break into the top set.
Engine convergence
Unusually for our panel, health insurers engines mostly agree. Cross-engine correlation is 0.82 (1.0 = perfect). Improvements on one engine are likely to lift others. Lower-effort optimization than fragmented categories.
How mapou helps: AI Visibility Audit confirms your position is consistent across engines before you commit budget.
Persona-volatile category
Health insurers rankings shift meaningfully by buyer persona. Blue Cross Blue Shield is the baseline #1 brand, but loses the top spot under at least one buyer signal (budget, premium, professional, first-time, values-driven). Top-3 overlap with baseline is only 80% across personas. Your baseline visibility number is incomplete.
How mapou helps: Persona-Tuned MVI computes the visibility number for your actual buyer mix.
Visibility tier landscape
In health insurers, 1 of 20 tracked brands clear MVI 75 (default-choice tier). 13 are below MVI 25 (not yet cited). The strategy differs at each tier. If you are below 25, you need foundational visibility infrastructure before tactical optimization.
How mapou helps: AI Visibility Audit identifies your tier; GEO & Citation Architecture moves you up.
The mapou Visibility Index
What is MVI?
The mapou Visibility Index (MVI) is a 0-100 proprietary score combining four weighted dimensions: Discovery (open recommendations, 30%), Filtered Discovery (budget, persona, use-case, 25%), Comparison (head-to-head authority, 25%), and Evaluation (decision-criteria authority, 20%).
Citations count fully; mentions count at half weight. Engines are equally weighted (no market-share gymnastics). Wilson 95% confidence intervals are shown alongside every score. The same 20 prompts run every month so MVI deltas are paired comparisons, not noise.
How to read this ranking
- Default choice (MVI 75+). AI's go-to recommendation in health insurers. The tier other brands are competing into.
- Repeat use (50–74). Cited often enough to feel reliably present across prompts and engines. One signal away from default.
- First encounter (25–49). Discovered and cited occasionally, but visibility is inconsistent. The brand is real to AI, not yet trusted.
- Not yet cited (0–24). AI does not surface this brand for buyer-intent prompts in health insurers. Effectively invisible in AI-driven discovery.
Ranked by MVI score (Wilson 95% CI shown). The Spread column shows the gap between each brand's best and worst engine, under 15pp is durable, 50pp+ is engine-dependent. Per-engine columns show the count of prompts where each engine cited the brand as a recommendation (out of 20). Read each column as a signal: when ChatGPT cites you but Gemini doesn't, your gap is engine-specific. When all five miss you, the gap is foundational.
| # | Brand | MVI | 95% CI | Spread | Per-engine | ChatGPT | Perplexity | Gemini | Claude | Grok | Tier |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ▲ Comparison (14/20) · popular brands (5/5) | 77 | 67–84 | 45pp | 16 | 14 | 12 | 8 | 17 | Default choice | |
| 2 | ▲ Discovery (23/30) · top-brands lists (5/5) ▼ Comparison (10/20) · filtered values driven (0/5) | 70 | 61–79 | 25pp | 15 | 10 | 12 | 12 | 15 | Repeat use | |
| 3 | ▲ Comparison (11/20) · premium (5/5) ▼ Evaluation (6/20) · emerging brands (0/5) | 52 | 42–61 | 65pp | 15 | 2 | 9 | 11 | 11 | Repeat use | |
| 4 | ▲ Discovery (17/30) · popular brands (5/5) ▼ Evaluation (5/20) · emerging brands (0/5) | 46 | 37–56 | 45pp | 13 | 5 | 10 | 11 | 4 | First encounter | |
| 5 | ▲ Discovery (17/30) · current-year picks (5/5) ▼ Evaluation (3/20) · discovery recommendation request (0/5) | 44 | 36–55 | 35pp | 11 | 12 | 9 | 5 | 7 | First encounter | |
| 6 | ▲ Comparison (9/20) · head-to-head comparison (5/5) ▼ Evaluation (2/20) · budget-friendly (0/5) | 30 | 21–39 | 30pp | 7 | 4 | 3 | 9 | 5 | First encounter | |
| 7 | ▲ Comparison (6/20) · head-to-head comparison (3/5) ▼ Evaluation (4/20) · emerging brands (0/5) | 27 | 19–36 | 45pp | 2 | 8 | 6 | 9 | 0 | First encounter | |
| 8 | ▲ Comparison (7/20) · emerging brands (4/5) ▼ Evaluation (0/20) · open recommendation (0/5) | 21 | 13–28 | 20pp | 2 | 4 | 5 | 6 | 2 | Not yet cited | |
| 9 | ▲ Comparison (4/20) · budget-friendly (2/5) ▼ Evaluation (0/20) · open recommendation (0/5) | 11 | 6–18 | 20pp | 3 | 4 | 1 | 1 | 0 | Not yet cited | |
| 10 | ▲ Comparison (0/20) | 7 | 3–14 | 0pp | 0 | 0 | 0 | 0 | 0 | Not yet cited | |
| 11 | ▲ Evaluation (0/20) · current-year picks (1/5) | 6 | 2–12 | 5pp | 1 | 0 | 0 | 0 | 0 | Not yet cited | |
| 12 | ▲ Comparison (2/20) · top-brands lists (1/5) | 5 | 2–11 | 10pp | 0 | 2 | 2 | 1 | 0 | Not yet cited | |
| 13 | ▲ Discovery (1/30) · top-brands lists (1/5) | 4 | 2–10 | 15pp | 0 | 0 | 0 | 3 | 0 | Not yet cited | |
| 14 | ▲ Comparison (2/20) · current-year picks (1/5) | 4 | 1–8 | 15pp | 0 | 3 | 0 | 0 | 0 | Not yet cited | |
| 15 | ▲ Evaluation (1/20) · evaluation what to avoid (1/5) | 2 | 1–8 | 5pp | 0 | 1 | 0 | 0 | 0 | Not yet cited | |
| 16 | ▲ Filtered Discovery (1/30) · filtered values driven (1/5) | 1 | 0–6 | 5pp | 0 | 1 | 0 | 0 | 0 | Not yet cited | |
| 17 | ▲ Discovery (1/30) · emerging brands (1/5) | 1 | 0–5 | 5pp | 0 | 0 | 0 | 0 | 1 | Not yet cited | |
| 18 | 0 | 0–4 | 0pp | 0 | 0 | 0 | 0 | 0 | Not yet cited | ||
| 19 | 0 | 0–4 | 0pp | 0 | 0 | 0 | 0 | 0 | Not yet cited | ||
| 20 | 0 | 0–4 | 0pp | 0 | 0 | 0 | 0 | 0 | Not yet cited |
Strategic insights for health insurers
Five derived metrics computed from the same data, surfacing how this segment behaves on AI search. See the State of AI Search for cross-segment comparison.
Engine agreement
0.82
Broad consensus
Effective brands
8.4 / 20
Moderately concentrated · top 2 take 36%
Top demand-leak brand
Kaiser Permanente
+39pp Discovery vs Evaluation
Top mention-only brand
Independence Blue Cross
100% of visibility is mention-only
Kingmaker engine by funnel phase
discovery
ChatGPT
83pp spread
filtered
ChatGPT
100pp spread
comparison
Grok
100pp spread
evaluation
ChatGPT
100pp spread
For each phase, the engine where the gap between most-cited and least-cited brand is widest, i.e. where positioning matters most. Win that engine, win that phase.
Brands cited most across the category
Aggregated across every (brand × prompt × engine) combination tested. The most-cited brands here are the names AI consistently surfaces when buyers ask about health insurers.
Emerging brands AI is citing in health insurers
Brand names AI engines surfaced for health insurers prompts that are not currently on the mapou tracked panel. Ranked by mention count and engine breadth. These are panel candidates, brands AI considers part of the category even though we are not yet measuring them.
Method. Aggregated across the canonical run for health insurers. For every (panel brand × prompt × engine) we record the brand names the analyzer extracted (capped at 6 per response), then drop names that match the tracked panel or its aliases, plus a denylist of generic category terms. Threshold to qualify: at least 3 mentions across at least 2 of 5 engines. Click any row to see the AI quote that surfaced the brand. Some entries may be tracked elsewhere on mapou but not in this segment, in which case AI considers them cross-category competitors. Reviewed monthly to inform panel additions.
How health insurers rankings shift by buyer persona
The MVI score is calibrated to a generic shopping-assistant prompt. But buyers don't arrive generically. We re-ran the same 20 canonical prompts five more times, each with a different buyer-persona signal in the system prompt: budget-conscious, premium, working professional, first-time, values-driven. Top-3 overlap with baseline: 80%. Leader holds across all personas: no. Blue Cross Blue Shield loses the #1 spot to a different brand under at least one persona.
| Brand | Baseline | Budget | Premium | Pro | First-time | Values |
|---|---|---|---|---|---|---|
| Blue Cross Blue Shield | 95% | 55% | 100% | 80% | 95% | 55% |
| Cigna | 75% | 55% | 100% | 70% | 65% | 40% |
| Kaiser Permanente | 75% | 65% | 90% | 75% | 90% | 60% |
| Aetna | 70% | 45% | 90% | 60% | 65% | 40% |
| UnitedHealthcare | 65% | 45% | 80% | 55% | 65% | 30% |
Each cell is the citation rate (out of 20 canonical prompts) for that brand under that persona, ChatGPT only. Cells are tinted green when a brand gains 5+ percentage points vs baseline, orange when it loses 5+. Strong tints flag a 20+ percentage-point swing. Top 8 baseline brands shown; full per-persona data is in data/research/persona-robustness/2026-05-07-1625/. The full methodology is on the State of AI Search page.
The 20-prompt taxonomy
Every brand in this report is tested against the same 20 canonical prompts, spanning the four MVI dimensions (Discovery, Filtered Discovery, Comparison, Evaluation). The prompt set is fixed at methodology v1.0 and reused every monthly run, so MVI deltas are paired comparisons not noise.
The exact prompt templates and phase-weighting formula are part of mapou's proprietary methodology, shared with paying clients alongside custom benchmarks for their specific brand.
See the framework →Methodology v1.0. MVI is mapou's proprietary 0-100 visibility score across 5 AI engines and 4 buyer-intent dimensions. 95% Wilson confidence intervals. Equal engine weighting. See the framework →
Run yours
Want to see your brand on this leaderboard? Run a free visibility check on your own brand. We'll show you exactly which prompts you're missing and which engines are losing you the most ground.