How accurate is the Risk Matching algorithm?

86% Top-3 accuracy (primary or secondary cause predicted within top 3) and 72% exact primary cause accuracy, validated on n=3,421 autopsies with ±1.7pp confidence interval (V10, NB era + retrieval ensemble α=0.50).

What data goes into a Risk Match?

15 structural dimensions: sector, country, stage, total funding, moat type and strength, hype cycle, business model, B2B/B2C orientation, founding year, founder archetype, primary fatal mistake, collapse velocity, survival months, and peak valuation.

How does startup failure cause prediction work?

A Multinomial Naive Bayes classifier trained per era bucket predicts the primary and secondary failure cause. This is combined with a retrieval ensemble (α=0.50) and temporal decay reranking to achieve 86% Top-3 accuracy.

What is the survival curve in the risk report?

A Kaplan-Meier survival curve computed from the matched corpus. It shows the probability that a structurally similar startup survives to 12, 24, and 36 months, based on historical data from the 4,998+ autopsy dataset.

// failure intelligence · how it works

The only system that predicts collapse cause with verified precision.

A Naive Bayes + retrieval ensemble trained on 4,998 verified autopsies — drawn from a research dataset of 17,044. 86% Top-3 accuracy (72% primary cause only) · largest curated startup failure dataset. CB Insights and PitchBook do not publish predictive precision metrics because they do not have this model.

Analyse a startup →Request access for funds

4,998

VERIFIED AUTOPSIES

from 17,044 total research

86%

TOP-3 ACCURACY

primary or secondary cause · n=3,421

72%

TOP-3 PRIMARY CAUSE

exact primary cause · ±1.7pp CI

17,044

RESEARCH DATASET

verified + web-enriched · staging

// data quality

What "editorially reviewed" means.

Every case in the database passes a structured review before it is published. The criteria are the same regardless of sector, geography or funding size.

Documented shutdown

The company must have demonstrably ceased operations, been liquidated, declared bankruptcy, or been acquired under distress. Pivots and rebrands are excluded.

Attributable cause

The primary cause of failure must be identifiable from public documentation — founder post-mortems, regulatory filings, press coverage, court records, or investor statements. If the cause cannot be substantiated, it is left unassigned.

Reconstructible timeline

Key milestones — founding, peak, first warning signals, shutdown — must be reconstructible from at least two independent sources. Survival months are computed from these dates, not estimated.

Human field assignment

Analytical fields — collapse style, hype cycle, moat type, fatal mistake, archetype — are assigned by a human reviewer, not generated automatically. Each field is a considered judgment, not a classification label from a model.

Cases that do not meet all four criteria are either excluded entirely or published with a "unverified" flag. The verified / unverified split is reflected in the precision metrics above — backtesting is always run on verified cases only.

// precision metrics

Verified backtesting results.

Validated on a stratified 80/20 split of 17,044 autopsies in the research pool. n=3,421 test entries, ±1.7pp CI.

86%

TOP-3 · PRIMARY OR SECONDARY CAUSE

"72 out of 100: exact primary cause in Top-3. 86 out of 100: real cause (primary or secondary) in Top-3."

Top-3 (primary or secondary)86%

Top-3 (primary cause only)72%

Version	Top-3	MRR	Dataset
V2	60.0%	—	~1,500
V3	61.3%	—	2,366
V4	73.0%	—	5,318
V5	80.4%	—	5,471
V6	61.9%	—	16,112
V8	68.1%	—	8,324
V9	86.1%†	—	17,044
V10 (current)	72% / 86%*	—	17,044

* V10: 72% primary cause only / 86% primary or secondary cause. NB era + retrieval ensemble α=0.50, n=3,421 ±1.7pp CI. † V9: multi-label metric only (primary or secondary cause). V6: dataset tripled (5k→16k) with external data; retraining from broader corpus reduced precision temporarily.

// model pipeline

Five steps in the risk analysis.

Each analysis runs the same deterministic pipeline. Every score is traceable to concrete signals.

Sector filter across 4,998 autopsies. If fewer than 15 candidates, expands to adjacent sectors with a penalty.

Deterministic

12 weighted signals: sector, moat, country, archetype, hype cycle, business model and more. 5 bonus functions.

12 signals

Adjusted by cause frequency in the real dataset. Prevents over-weighting common causes like 'competition'.

Calibrated on 4,998 cases

Multinomial Naive Bayes + era bucket + retrieval ensemble (α=0.50). Estimates P(cause | sector, region, business model, funding, archetype, founding era) with Laplace smoothing. Ensemble blends 50% NB with 50% retrieval signal from top-50 structural matches. 86% Top-3 accuracy (72% primary cause only), validated on n=3,421 ±1.7pp CI.

4,998 verified · 24h cache

Window of candidates with close scores. Final boost by founder archetype and matching secondary cause.

Top 10 · score ≥ 30

Exact parameters, bonus functions and bayesian adjustment values are proprietary and not published. The pipeline is auditable — every score is traceable to concrete signals.

// model signals

The six most predictive signals.

Approximate weights based on backtesting. The model uses 12 signals in total.

Sector20%

Primary pool filter. Highest discriminative power. Model anchor.

Country / Region14%

Exact or regional match. Captures regulatory and macroeconomic context.

Moat strength12%

Most predictive field in backtesting: +14.8% accuracy when present.

Founder archetype10%

+9.3% confirmed predictive power. Founder profile correlates with collapse type.

Hype cycle9%

Market cycle at founding. Neutralises temporal penalties.

Fatal mistake + semantics9%

New in V5. Exact and text-similarity matching of the primary fatal mistake. Includes unit-economics sub-classification (burn rate, CAC/LTV, margin compression).

Exact weights and internal model parameters are proprietary.

// reading the results

What the numbers actually mean.

The output is not a grade. Each number has a precise technical meaning that determines how to use it.

The score is not a failure probability

A score of 72/100 means the startup shares 72% of structural patterns with companies that failed in this pool, not that it has a 72% chance of failing. The model identifies resemblance, not destiny.

Structural similarity

Top-3 is more reliable than Top-1

The model identifies the primary failure cause in the Top-3 in 72% of analyses (86% when secondary cause also counts). Collapse causes overlap — if the cause you suspect appears in the Top-3, the signal is strong.

n=3,421 · ±1.7pp CI

How to read the matches

Score ≥ 65: strong match, review the autopsy in detail. Score 40–64: moderate, relevant patterns but structural differences exist. Score 30–39: weak, pool is small, treat with caution. Closer score = more weight on that collapse timeline.

Min threshold: 30

When to trust the model more — and less

More reliable: SaaS, Fintech, Marketplace (>100 autopsies), well-defined hype cycle, funding <$200M. Less reliable: Crypto/Web3 (excluded by default), regulatory cause outside US/Europe, funding >$500M, pure timing collapse.

Context-dependent precision

// known limitations

Where the model is less precise.

Transparency about areas where precision falls below the model average.

Regulation

Below-average precision. Few entries with well-documented regulatory cause outside the US and Europe.

Expanded with 10,500+ LatAm/India/Africa/SEA entries

Market timing

Hard to distinguish from 'market fit' without external signals. Pure timing signals are the least discriminative.

V7: isotonic regression with real feedback

Crypto / Web3

Low internal diversity. Crypto collapse patterns differ from any other sector.

Sector excluded from matching by default

// responsible use

A signal, not a verdict.

Risk scores and pattern matches are structured intelligence signals derived from real historical cases. They are designed to surface patterns worth investigating — not to replace primary research, financial modelling, or professional judgement.

Use UnicornBurn the way you would use any professional intelligence platform: as a focused starting point for deeper analysis. A high score means the structural profile closely resembles companies that failed — it does not mean the company will fail. A low score does not mean the company is safe. Always conduct independent due diligence. For the full scope of how outputs should and should not be used, see our Terms of Service 8.