mapou Visibility Index · Financial services → Life insurance · May 2026
Which life insurers does AI cite most?
When buyers ask AI assistants questions like “What are the best life insurance options to consider for 2026?” or “Best affordable life insurance options under $50 a month?”, a small set of life insurersget cited every time. Most don't. This report measures which.
How we measured. MVI is a 0–100 score per brand: 0 means AI never cites you in life insurers, 100 means it cites you in every prompt. We tested 20 brands across 5 AI assistants (ChatGPT, Perplexity, Gemini, Claude, Grok) using 20 fixed prompts, reused every monthly run for replicability.
How we describe AI visibility
- Stage 1, First encounter. The brand is discovered and cited occasionally in AI answers for buyer-intent prompts.
- Stage 2, Repeat use. The brand is cited regularly enough that it feels familiar and reliably present across prompts and engines.
- Stage 3, Default choice. The brand is the go-to recommendation in AI answers within its segment, often appearing first or most consistently.
Bottom line
Northwestern Mutual leads life insurers on AI search visibility with MVI 56, sitting firmly in the repeat-use tier, in a tight race with MassMutual (MVI 54).
Who AI cites most
Northwestern Mutual is cited in 53 of 100 prompt-engine pairs (53%). 95% confidence interval 45-64.
Concentration
The top 3 brands (Northwestern Mutual, MassMutual, Haven Life) capture 35% of all citations in this segment. 6 of 20 tracked brands are cited in fewer than 1 in 10 prompt-engine pairs.
Where the field sits
Of 20 brands tested: 2 in repeat-use, 8 in first-encounter, 10 not yet cited. Overall, AI cites a brand from this segment in 21% of buyer-intent prompt-engine pairs.
Engine asymmetry
Prudential is cited in 65% of ChatGPT prompts but only 0% on Perplexity, visibility is engine-specific, not universal.
Phase flip
MassMutual actually outperforms Northwestern Mutual on Filtered Discovery prompts, overall MVI hides this category-specific strength.
Notable absence
Brighthouse Financial is not yet cited, 0 prompts across all 100 prompt-engine pairs. A recognizable brand AI is not yet surfacing.
Analyst note
Northwestern Mutual and MassMutual are the only foothold brands, with MVIs of 56 and 54, respectively, while 10 brands are invisible.
Northwestern Mutual leads with an MVI of 56, followed by MassMutual at 54. Both are in the foothold tier, showing repeat use. Northwestern Mutual performs best on Gemini with a 65 MVI, while MassMutual has a 78 MVI on the same engine. Haven Life, at 46, is the top toehold brand but lacks the repeat visibility of the leaders. Half of the brands are invisible, creating a competitive gap.
Risk:MassMutual is weak in the evaluation phase with a 33 MVI, while Northwestern Mutual has a 53, allowing competitors to gain ground.
Headline finding
Northwestern Mutual leads life insurers on AI search visibility with MVI 56, sitting firmly in the repeat-use tier.
Average MVI
23
Default choice
0
Of 20 brands
Repeat use
2
First encounter
8
Not yet cited
10
Citation rate per engine
How often each engine cites a brand from this category as a recommendation, averaged across all 20 brands tested.
ChatGPT
24%
16 / 20 brands cited at least once
Perplexity
21%
14 / 20 brands cited at least once
Gemini
22%
15 / 20 brands cited at least once
Claude
17%
14 / 20 brands cited at least once
Grok
20%
16 / 20 brands cited at least once
Phase strength across the category
Which buyer-intent phases are easiest vs hardest to win in life insurers. Citation rate averaged across all brands tested. Phase weights are part of the MVI formula.
Discovery · 30%
25%
Top: Northwestern Mutual
Filtered discovery · 25%
20%
Top: MassMutual
Comparison · 25%
23%
Top: MassMutual
Evaluation · 20%
14%
Top: Northwestern Mutual
Who wins which buyer phase
Top 12brands by MVI mapped against the four buyer-intent phases. Each cell shows the brand's citation rate for that phase, color-coded so the visual pattern tells the story: a brand strong across all four phases reads as a horizontal orange band; a brand strong only at Discovery but weak at Evaluation reads as a left-heavy gradient. This is the segment's findings against the panel.
| Brand | MVI | Discovery | Filtered | Comparison | Evaluation |
|---|---|---|---|---|---|
| Northwestern Mutual | 56 | 68% | 42% | 57% | 53% |
| MassMutual | 54 | 53% | 62% | 65% | 33% |
| Haven Life | 46 | 45% | 47% | 63% | 28% |
| New York Life | 38 | 48% | 40% | 45% | 10% |
| State Farm Life | 34 | 32% | 22% | 33% | 55% |
| Guardian Life | 33 | 28% | 45% | 30% | 28% |
| Ethos | 28 | 37% | 30% | 33% | 5% |
| Ladder | 27 | 43% | 25% | 23% | 13% |
| Prudential | 26 | 37% | 23% | 23% | 20% |
| Banner Life | 25 | 27% | 20% | 35% | 15% |
| Pacific Life | 18 | 18% | 13% | 23% | 20% |
| Transamerica | 16 | 20% | 12% | 28% | 0% |
How to read it. Strong horizontal band = durable brand, AI cites it across the entire funnel. Left-heavy gradient = brand with awareness (Discovery) but weak recommendation (Evaluation), the demand-leak pattern from Finding 03. Right-heavy gradient = brand AI considers in Comparison and Evaluation but does not surface in initial Discovery, the anti-leak pattern. Color steps: dark orange ≥75%, orange ≥50%, peach ≥30%, light ≥10%, beige >0%, neutral 0%.
For CMOs in life insurers
What this report means for your life insurers portfolio
Each bullet is a category-specific decision derived from this month's data, with the mapou service that operationalizes it.
Concentration risk
In life insurers, AI effectively recommends 12.9 of 20 tracked brands. The top brand captures 12% of citations. The next 2 capture another 12%. Visibility is concentrated but not winner-takes-all.
How mapou helps: GEO & Citation Architecture restructures your entity data so you can break into the top set.
Engine fragmentation
Engines disagree on the life insurers leaderboard. Mean cross-engine agreement is only 0.52 (1.0 = perfect agreement, 0 = independent). Optimizing for ChatGPT will not necessarily improve your Claude or Gemini visibility. You need engine-specific strategy.
How mapou helps: AI Visibility Audit maps your position separately on each of the 5 engines.
Persona-volatile category
Life insurers rankings shift meaningfully by buyer persona. Haven Life is the baseline #1 brand, but loses the top spot under at least one buyer signal (budget, premium, professional, first-time, values-driven). Top-3 overlap with baseline is only 53% across personas. Your baseline visibility number is incomplete.
How mapou helps: Persona-Tuned MVI computes the visibility number for your actual buyer mix.
Visibility tier landscape
In life insurers, 0 of 20 tracked brands clear MVI 75 (default-choice tier). 10 are below MVI 25 (not yet cited). The strategy differs at each tier. If you are below 25, you need foundational visibility infrastructure before tactical optimization.
How mapou helps: AI Visibility Audit identifies your tier; GEO & Citation Architecture moves you up.
The mapou Visibility Index
What is MVI?
The mapou Visibility Index (MVI) is a 0-100 proprietary score combining four weighted dimensions: Discovery (open recommendations, 30%), Filtered Discovery (budget, persona, use-case, 25%), Comparison (head-to-head authority, 25%), and Evaluation (decision-criteria authority, 20%).
Citations count fully; mentions count at half weight. Engines are equally weighted (no market-share gymnastics). Wilson 95% confidence intervals are shown alongside every score. The same 20 prompts run every month so MVI deltas are paired comparisons, not noise.
How to read this ranking
- Default choice (MVI 75+). AI's go-to recommendation in life insurers. The tier other brands are competing into.
- Repeat use (50–74). Cited often enough to feel reliably present across prompts and engines. One signal away from default.
- First encounter (25–49). Discovered and cited occasionally, but visibility is inconsistent. The brand is real to AI, not yet trusted.
- Not yet cited (0–24). AI does not surface this brand for buyer-intent prompts in life insurers. Effectively invisible in AI-driven discovery.
Ranked by MVI score (Wilson 95% CI shown). The Spread column shows the gap between each brand's best and worst engine, under 15pp is durable, 50pp+ is engine-dependent. Per-engine columns show the count of prompts where each engine cited the brand as a recommendation (out of 20). Read each column as a signal: when ChatGPT cites you but Gemini doesn't, your gap is engine-specific. When all five miss you, the gap is foundational.
| # | Brand | MVI | 95% CI | Spread | Per-engine | ChatGPT | Perplexity | Gemini | Claude | Grok | Tier |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | ▲ Discovery (20/30) · top-brands lists (5/5) ▼ Filtered Discovery (12/30) · emerging brands (0/5) | 56 | 45–64 | 15pp | 10 | 11 | 12 | 9 | 11 | Repeat use | |
| 2 | ▲ Comparison (12/20) · premium (5/5) ▼ Evaluation (6/20) · popular brands (0/5) | 54 | 44–63 | 40pp | 12 | 11 | 14 | 6 | 6 | Repeat use | |
| 3 | ▲ Comparison (12/20) · emerging brands (5/5) ▼ Evaluation (5/20) · top-brands lists (0/5) | 46 | 36–55 | 55pp | 14 | 3 | 7 | 10 | 9 | First encounter | |
| 4 | ▲ Discovery (14/30) · top-brands lists (4/5) ▼ Evaluation (2/20) · emerging brands (0/5) | 38 | 29–47 | 35pp | 6 | 11 | 4 | 8 | 6 | First encounter | |
| 5 | ▲ Evaluation (9/20) · filtered use case specific (4/5) ▼ Filtered Discovery (6/30) · emerging brands (0/5) | 34 | 25–43 | 60pp | 4 | 4 | 2 | 5 | 14 | First encounter | |
| 6 | ▲ Filtered Discovery (13/30) · filtered values driven (5/5) ▼ Evaluation (5/20) · discovery recommendation request (0/5) | 33 | 25–43 | 50pp | 4 | 13 | 7 | 5 | 3 | First encounter | |
| 7 | ▲ Discovery (11/30) · emerging brands (5/5) ▼ Evaluation (1/20) · top-brands lists (0/5) | 28 | 20–37 | 30pp | 3 | 7 | 8 | 7 | 2 | First encounter | |
| 8 | ▲ Discovery (13/30) · emerging brands (5/5) ▼ Evaluation (2/20) · popular brands (0/5) | 27 | 20–37 | 40pp | 2 | 5 | 10 | 7 | 2 | First encounter | |
| 9 | ▲ Discovery (11/30) · top-brands lists (4/5) ▼ Evaluation (4/20) · emerging brands (0/5) | 26 | 19–36 | 65pp | 13 | 0 | 2 | 1 | 10 | First encounter | |
| 10 | ▲ Comparison (7/20) · comparison alternative to leader (4/5) ▼ Evaluation (3/20) · premium (0/5) | 25 | 17–33 | 50pp | 11 | 8 | 3 | 1 | 1 | First encounter | |
| 11 | ▲ Comparison (4/20) · head-to-head comparison (3/5) | 18 | 12–27 | 50pp | 1 | 5 | 10 | 0 | 0 | Not yet cited | |
| 12 | ▲ Comparison (5/20) · head-to-head comparison (3/5) ▼ Evaluation (0/20) · open recommendation (0/5) | 16 | 9–23 | 15pp | 3 | 1 | 3 | 4 | 1 | Not yet cited | |
| 13 | ▲ Discovery (6/30) · emerging brands (3/5) ▼ Comparison (0/20) · top-brands lists (0/5) | 11 | 6–19 | 25pp | 4 | 0 | 0 | 1 | 5 | Not yet cited | |
| 14 | ▲ Comparison (2/20) · current-year picks (2/5) | 11 | 6–18 | 20pp | 2 | 4 | 3 | 1 | 0 | Not yet cited | |
| 15 | ▲ Filtered Discovery (3/30) · filtered persona pro (2/5) | 9 | 5–17 | 10pp | 0 | 0 | 2 | 2 | 2 | Not yet cited | |
| 16 | ▲ Discovery (4/30) · current-year picks (2/5) ▼ Comparison (0/20) · top-brands lists (0/5) | 5 | 2–12 | 15pp | 3 | 0 | 1 | 0 | 1 | Not yet cited | |
| 17 | ▲ Comparison (2/20) · head-to-head comparison (2/5) | 5 | 2–11 | 20pp | 0 | 1 | 0 | 0 | 4 | Not yet cited | |
| 18 | ▲ Discovery (3/30) · open recommendation (1/5) | 4 | 2–10 | 15pp | 3 | 1 | 0 | 0 | 0 | Not yet cited | |
| 19 | ▲ Discovery (2/30) · current-year picks (1/5) | 3 | 1–8 | 10pp | 0 | 0 | 0 | 0 | 2 | Not yet cited | |
| 20 | ▲ Discovery (0/30) | 1 | 0–5 | 0pp | 0 | 0 | 0 | 0 | 0 | Not yet cited |
Strategic insights for life insurers
Five derived metrics computed from the same data, surfacing how this segment behaves on AI search. See the State of AI Search for cross-segment comparison.
Engine agreement
0.52
Partial agreement
Effective brands
12.9 / 20
Wide open · top 2 take 24%
Top demand-leak brand
New York Life
+38pp Discovery vs Evaluation
Top mention-only brand
Lincoln Financial
54% of visibility is mention-only
Kingmaker engine by funnel phase
discovery
ChatGPT
83pp spread
filtered
ChatGPT
67pp spread
comparison
ChatGPT
75pp spread
evaluation
Grok
100pp spread
For each phase, the engine where the gap between most-cited and least-cited brand is widest, i.e. where positioning matters most. Win that engine, win that phase.
Brands cited most across the category
Aggregated across every (brand × prompt × engine) combination tested. The most-cited brands here are the names AI consistently surfaces when buyers ask about life insurers.
Emerging brands AI is citing in life insurers
Brand names AI engines surfaced for life insurers prompts that are not currently on the mapou tracked panel. Ranked by mention count and engine breadth. These are panel candidates, brands AI considers part of the category even though we are not yet measuring them.
Method. Aggregated across the canonical run for life insurers. For every (panel brand × prompt × engine) we record the brand names the analyzer extracted (capped at 6 per response), then drop names that match the tracked panel or its aliases, plus a denylist of generic category terms. Threshold to qualify: at least 3 mentions across at least 2 of 5 engines. Click any row to see the AI quote that surfaced the brand. Some entries may be tracked elsewhere on mapou but not in this segment, in which case AI considers them cross-category competitors. Reviewed monthly to inform panel additions.
How life insurers rankings shift by buyer persona
The MVI score is calibrated to a generic shopping-assistant prompt. But buyers don't arrive generically. We re-ran the same 20 canonical prompts five more times, each with a different buyer-persona signal in the system prompt: budget-conscious, premium, working professional, first-time, values-driven. Top-3 overlap with baseline: 53%. Leader holds across all personas: no. Haven Life loses the #1 spot to a different brand under at least one persona.
| Brand | Baseline | Budget | Premium | Pro | First-time | Values |
|---|---|---|---|---|---|---|
| Haven Life | 70% | 85% | 25% | 45% | 85% | 30% |
| MassMutual | 60% | 30% | 90% | 60% | 60% | 45% |
| Prudential | 60% | 20% | 60% | 45% | 40% | 20% |
| Northwestern Mutual | 55% | 20% | 95% | 55% | 35% | 35% |
| New York Life | 45% | 15% | 80% | 35% | 50% | 45% |
Each cell is the citation rate (out of 20 canonical prompts) for that brand under that persona, ChatGPT only. Cells are tinted green when a brand gains 5+ percentage points vs baseline, orange when it loses 5+. Strong tints flag a 20+ percentage-point swing. Top 8 baseline brands shown; full per-persona data is in data/research/persona-robustness/2026-05-07-1625/. The full methodology is on the State of AI Search page.
The 20-prompt taxonomy
Every brand in this report is tested against the same 20 canonical prompts, spanning the four MVI dimensions (Discovery, Filtered Discovery, Comparison, Evaluation). The prompt set is fixed at methodology v1.0 and reused every monthly run, so MVI deltas are paired comparisons not noise.
The exact prompt templates and phase-weighting formula are part of mapou's proprietary methodology, shared with paying clients alongside custom benchmarks for their specific brand.
See the framework →Methodology v1.0. MVI is mapou's proprietary 0-100 visibility score across 5 AI engines and 4 buyer-intent dimensions. 95% Wilson confidence intervals. Equal engine weighting. See the framework →
Run yours
Want to see your brand on this leaderboard? Run a free visibility check on your own brand. We'll show you exactly which prompts you're missing and which engines are losing you the most ground.