A Systemic-Risk Monitoring Framework for High-Valuation Markets

Working paper · May 12, 2026

Abstract

This paper presents a systemic-risk monitoring framework for global financial markets as of May 12, 2026. It distinguishes (i) discrete trigger events, (ii) slow-moving state amplifiers, (iii) downstream outcomes, and (iv) feedback-loop transmission pathways, and translates this taxonomy into an operational monitoring dashboard.

The framework's strongest empirical screen is regime-conditional history. Using Robert Shiller's monthly CAPE series through September 2023, extended through May 2026 with Multpl/GuruFocus monthly readings, only two distinct episodes have reached CAPE ≥40 in 145 years: January 1999 - September 2000 (peak 44.19 in December 1999) and the current March 2026 onward episode. The 1999-2000 episode contributes 21 monthly starting points; 13 of those 21 monthly windows experienced a peak-to-trough ≥25% drawdown within the subsequent 24 months — a rolling-window coverage rate of 61.9%. Because these 21 monthly observations are highly overlapping within a single completed historical episode (n = 1 at the episode level), the 61.9% figure is a descriptive historical stress signal, not a calibrated independent-event probability or confidence-bounded forecast. The paper does not report a confidence interval for this regime: as documented in Appendix K, a stationary block-bootstrap on this single-cluster sample produces a degenerate distribution (the 5th percentile for the broader CAPE ≥35 regime sits above the point estimate) and does not have a frequentist confidence-interval interpretation.

Several state amplifiers are elevated simultaneously: Shiller CAPE in a provider/date range of approximately 40.3–42.2 in early May 2026 (GuruFocus May 1; Multpl May 11); Buffett Indicator 231.7% on May 6, 2026 (Longtermtrends/MacroMicro); official SSGA SPY top-10 weight 39.28% and Magnificent 7 share-class aggregate 34.69% as of May 8, 2026; AI capex shifting from operating cash flow to debt financing (BIS Bulletin 120, published January 7, 2026; BIS Quarterly Review follow-on, March 2026); approximately $14 trillion in 2026 US investment-grade bond supply per Apollo's Torsten Slok (March 24, 2026); private-credit redemption pressure across non-traded BDCs (BCRED, BlackRock HPS, Morgan Stanley North Haven, Cliffwater funds in Q1 2026; Moody's BDC sector outlook cut to negative on April 7, 2026); and an oil shock characterized by the IEA as "the largest disruption in history" — a 10.1 mb/d global supply fall in March 2026 (IEA Oil Market Report, April 14, 2026).

The paper retains two transparent scenario-generation layers, regenerated from a single frozen parameter file as of May 12, 2026 (seed 42, 200,000 trials). Layer A is a factor-copula trigger co-occurrence simulation across eight discrete trigger pathways. Layer B is a structured second-stage severity simulation. Base-case scenario-generator outputs:

Layer A: P(≥1 modeled trigger fires within 24 months) = 76.7%; P(≥2) = 54.6%; P(≥3) = 38.3%; expected trigger count = 2.24.
Layer B: P(trigger-mediated drawdown ≥25% within 24 months) = 49.9% at slope = 6 base calibration; slope-sensitivity range 16.7% (slope=4) to 67.9% (slope=8); P(≥40%) = 10.4% at base.

These are conditional scenario outputs given a frozen parameter set, not empirically calibrated forecasts. Trigger marginals, factor loadings, severity weights, policy-effectiveness distributions, and the slope parameter mapping severity to drawdown percentages are all author priors anchored to external evidence but not estimated from history. The model layers are useful for channel identification (which transmission pathways are most worth monitoring), trigger-conditional uplift (e.g., iran_hormuz firing raises basis_trade probability 1.53×; ai_capex firing raises taiwan probability 1.80×), and slope sensitivity. They are not estimates of true market odds.

The dashboard layer (Section 12) translates the framework into operational monitoring signals. A pre-specified rule of VIX ≥25 sustained for 5 trading days was backtested against eight historical drawdown episodes 1995-2024 using FRED VIXCLS data. The rule detected 3 of 8 episodes pre-peak — Dot-com 2000 (157 days), GFC 2008 (55 days), 2022 cycle (28 days) — with 38 distinct false-positive events outside episodes over 30 years (≈1.27/year). The rule misses sentiment-driven (Q4 2018), exogenous (COVID 2020), fast-acting (August 2024), and pre-1995-style (LTCM 1998, Europe 2011) drawdowns. Threshold sensitivity (22.5/25/27.5/30) and duration sensitivity (3/5/7/10 days) are reported in §12. These are preliminary backtest results on a pre-specified sample, not validation across the universe of all S&P 500 drawdowns above a fixed threshold.

The framework's appropriate use is stress-testing portfolio concentration, liquidity, private-credit exposure, and oil/Fed-correlation risk under adverse drawdown scenarios; not directional positioning advice and not a calibrated crash forecast. The author is not a registered investment adviser; readers should consult appropriate fiduciaries for portfolio decisions. The framework is symmetric: §13 specifies both the conditions under which tail-risk concern should be reduced and the conditions under which the bull case should be upgraded.

Reading Guide

Section	What it answers
§1	The motivating configuration: records and low implied volatility against multi-dimensional stress
§2	Verified market, valuation, concentration, and macro anchors as of May 8-12, 2026
§3	Taxonomy: triggers vs. amplifiers vs. outcomes — definitions and the framework diagram
§4	Five amplifiers (AI capex debt shift, IG supply, concentration, valuation, CRE wall)
§5	Eight discrete triggers with current-state evidence and marginal probabilities
§6	Six feedback-loop transmission pathways
§7	Probability framework: Layer C (history), Layer A (scenario-generator co-occurrence), Layer B (scenario-generator severity)
§8	Policy reaction functions and trigger-specific effectiveness
§9	Historical analog review
§10	September 2026 - February 2027: a catalyst-dense monitoring window (heuristic)
§11	Three conditional severity paths (qualitative scenario weights)
§12	Leading-indicators dashboard (selected-sample backtest results)
§13	Falsification, bull-case upgrade criteria, Brier scoring, auditable forecast log
§14	Conclusion
Appendices A-K	Capex reconciliation, evidence anchors, code, source register, reproducibility manifest, statistical caveats

1. Introduction

The starting point matters. As of Friday May 8, 2026, the S&P 500 closed at 7,398.93 — a record close, up 0.84% on the day and the sixth consecutive weekly gain. The VIX closed at 17.19, below its long-term average. WTI crude settled at $95.42; Brent at $101.29. Hyperscaler Q1 2026 earnings had beaten consensus across the Big Four (Microsoft, Alphabet, Amazon, Meta), and Wall Street strategists raised year-end targets through May 11: JPMorgan (Dubravko Lakos-Bujas) to 7,600 on April 21; RBC Capital Markets (Lori Calvasina) to 7,900 on May 8; HSBC (Nicole Inui) to 7,650 on May 11; Yardeni Research (Ed Yardeni) to 8,250 on May 11. Bitcoin traded near $80,000 and gold near $4,700/oz.

This is true while:

The IEA reports a 10.1 mb/d global oil supply fall in March 2026 — characterized in its April Oil Market Report as "the largest disruption in history."
The Strait of Hormuz has been substantially closed since late February 2026; pre-war crude+condensate transit of ~20 mb/d fell to ~2-4 mb/d in March-April 2026.
IEA Director Fatih Birol stated on Norges Bank's In Good Company podcast (reported by CNBC April 1, 2026) that 12 mb/d of oil supply is lost — "more than two of these oil crises [1973 and 1979] put together"; on April 13, 2026 at the Atlantic Council he revised this figure to about 13 mb/d.
The Bank of Japan held its policy rate at 0.75% on April 28, 2026 in a 6-3 vote, with three dissenters (Hajime Takata, Naoki Tamura, Junko Nakagawa) proposing a rise to 1.0%.
Powell's term as Fed Chair ends May 15, 2026; as of publication time, Kevin Warsh's Senate confirmation vote was scheduled for 11:30 a.m. ET on May 12, 2026, with cloture invoked 49-44 the prior day.
Trump rejected Iran's latest peace counter-proposal as "TOTALLY UNACCEPTABLE" on May 10, 2026; WTI rose ~2% on May 11.

This is the central configuration the paper examines. The market prices relief and low conditional risk against a backdrop of unresolved structural stress on multiple dimensions. Either (a) something has materially changed about productivity, asset demand, or policy capacity such that elevated state variables are less threatening than history would suggest, or (b) the conditional asymmetry of outcomes is being mispriced and a catalyst that engages two or more feedback loops simultaneously could produce a sharp move down.

The honest counterweights are real. Q1 2026 earnings came in strongly: FactSet's "Earnings Insight" (Butters, May 8, 2026) reports 84% of S&P 500 companies have beaten EPS estimates with an average surprise of 18.2% — both above 5- and 10-year averages and the highest since Q1 2021. Forward 12-month P/E is 21.0 (FactSet, May 8), above the 5-year average 19.9 and the 10-year average 18.9. Bridgewater's Greg Jensen (February 23, 2026) estimates Big Tech 2026 AI capex at approximately $650B (~$410B in 2025). A reasonable bull thesis exists and is reflected in market pricing.

This paper should be read as a fragility framework. It does not claim that valuation, oil, AI capex, or leverage independently cause a crash. It asks whether the current configuration increases the market's sensitivity to adverse catalysts, identifies the most likely transmission channels, and translates this into operational monitoring signals. The framework's central empirical claim — that the only previous time markets reached the CAPE ≥40 regime, the subsequent path was a substantial drawdown across nearly every monthly entry point in that episode — is single-episode evidence (n = 1 at the episode level) with all the limitations that entails.

2. Current Market State

All values are timestamped to specific providers; intraday quotes are flagged. Full source-register entries appear in Appendix I; the reproducibility manifest appears in Appendix J.

2.1 Index levels (May 8-11, 2026)

Indicator	Value	Source	Date/Time
S&P 500	7,398.93 (close)	FRED `SP500`; CNBC live blog	May 8, 2026 (Fri)
S&P 500 (intraday)	~7,412 record high	CNBC	May 11, 2026
Nasdaq Composite	26,247.08 (close)	TheStreet/Nasdaq	May 8, 2026
Russell 2000	2,861.21 (close)	TheStreet/AP	May 8, 2026
Dow Jones Industrial	49,609.16 (close)	TheStreet	May 8, 2026
VIX	17.19 (close)	FRED `VIXCLS`; Cboe	May 8, 2026
VIX (intraday)	~18.41	Yahoo Finance	May 11, 2026
WTI front-month	$95.42 (settle)	CNBC; CME	May 8, 2026
WTI (intraday)	~$97-98	TradingEconomics	May 11, 2026
Brent front-month	$101.29 (settle)	CNBC; ICE	May 8, 2026
Brent (intraday)	$100-104 range	TradingEconomics	May 11, 2026
Bitcoin (spot)	~$80,800-82,200	Fortune; Yahoo Finance	May 11, 2026, 9:15 a.m. ET
Gold (spot)	~$4,700-4,747	Fortune; USAGOLD	May 11, 2026
10-yr Treasury	4.38% (close)	Fed H.15; FRED `DGS10`	May 8, 2026
MOVE Index	67.25 (intraday)	Yahoo Finance/ICE BofAML	May 11, 2026
ICE BofA HY OAS	2.81% (close, May 8); 2.79% (close, May 7)	FRED `BAMLH0A0HYM2`	May 2026

This is a market that has fully recovered from the March-April 2026 Iran-war low and is making new highs, while the underlying oil shock remains the largest in IEA records.

2.2 Valuation

Metric	Value	Historical reference	Source
Shiller CAPE	provider/date range: 40.33 (GuruFocus, May 1, 2026) to 42.15 (Multpl, May 11, 2026)	All-time high 44.19 (Dec 1999); historical mean ~17.8 (1881-2026 monthly)	Multpl; GuruFocus; YCharts; MacroMicro
Buffett Indicator	231.7% (May 8, 2026)	All-time high ~230.3% (Jan 2026); ~2.4 σ above long-run trend	GuruFocus; MacroMicro; Current Market Valuation
Forward 12-month P/E	21.0 (May 8, 2026)	5-yr avg 19.9; 10-yr avg 18.9	FactSet Earnings Insight, May 8, 2026
Excess CAPE Yield (10-yr real expected return proxy)	~1.3-2.0% range	—	Author calculation from CAPE

The 231.7% Buffett Indicator print is effectively a new all-time high relative to the prior January 2026 peak of ~230.3%. CAPE ≥40 has occurred in only two distinct episodes in 145 years of Shiller's monthly data: January 1999 - September 2000 and March 2026 - present. The decisive empirical link between CAPE level and drawdown frequency comes from §7.3.

2.3 Concentration

Concentration is measured from official SSGA SPY holdings rather than aggregator dashboards. As of May 8, 2026:

Rank	Holding	Weight in SPY
1	NVIDIA	8.24%
2	Apple	6.78%
3	Microsoft	4.86%
4	Amazon	4.20%
5	Alphabet Class A	3.68%
6	Broadcom	3.21%
7	Alphabet Class C	2.93%
8	Meta	2.10%
9	Tesla	1.90%
10	Berkshire Hathaway B	1.38%
	Top 10 total	39.28%

The Magnificent 7 share-class aggregate (NVIDIA + Apple + Microsoft + Amazon + Alphabet A/C + Meta + Tesla) is 34.69% of SPY as of May 8, 2026. This is the figure used throughout the paper, replacing the earlier Motley Fool 33.7% reference (April 14, 2026, retained only as dated context).

Top 10 in Russell 1000 Growth: approximately 56-60% in early 2026 (FTSE Russell factsheet).

The arithmetic amplification is real but bounded: a 10% drawdown limited to the Magnificent 7 share-class aggregate alone would mechanically subtract approximately 3.47 percentage points from SPY before second-round effects. Cap-weighted index funds do not mechanically force-sell on price declines; weights simply adjust, and selling occurs only with investor redemptions, vol-targeted deleveraging, leveraged-position margin calls, and active rebalancing. The dashboard (§12) distinguishes mechanical index arithmetic from flow-driven second-round effects.

2.4 Macro indicators

April 2026 nonfarm payrolls (BLS, May 8, 2026): +115,000; unemployment unchanged at 4.3%; AHE +0.2% m/m, +3.6% y/y. February revised to -156k; March revised up to +185k. Healthcare +37k, transportation/warehousing +30k, retail +22k, information -13k.
Q4 2025 real GDP, third estimate (BEA, April 9, 2026): +0.5% annualized, revised down from 0.7% second estimate; full-year 2025 +2.1%. BEA notes the October-November 2025 federal shutdown subtracted ~1.0 pp.
Inflation: above the Fed's 2% target with oil-driven upside pressure resumed since the Iran conflict began. BOJ's April 28 Outlook Report raised FY2026 core CPI to 2.8% from 1.9%.
Fed Chair transition: Powell's term ends May 15, 2026. Kevin Warsh cleared Senate Banking Committee 13-11 on April 29 (CNBC, Al Jazeera); cloture invoked 49-44 on May 11 (Roll Call), with Senators Fetterman and Coons crossing party lines; as of publication time, the final confirmation vote was scheduled 11:30 a.m. ET, May 12, 2026. Powell will remain on the Board of Governors (term through 2028).
Bank of Japan, April 28, 2026: held uncollateralized overnight call rate at 0.75% in 6-3 vote. Dissenters Takata, Tamura, and Nakagawa proposed raising to 1.0%. FY2026 GDP cut to 0.5% from 1.0%; FY2026 core CPI raised to 2.8% from 1.9%. Source: BOJ, "Statement on Monetary Policy," April 28, 2026.
US federal debt: $38.91 trillion as of May 5, 2026 (Treasury Fiscal Data, "Debt to the Penny"; JEC Debt Dashboard, updated May 8, 2026). Projected to cross $39T around May 18, 2026. Average interest rate on marketable debt 3.373% (April 2026). YoY change +$2.70T. Debt held by the public lower (~$30T); gross debt/GDP ~130%, debt-held-by-public/GDP ~100%.

2.5 The bull case

Q1 2026 earnings (FactSet, May 8, 2026): 89% reported; 84% positive EPS surprise rate (highest since Q2 2021); 18.2% aggregate surprise magnitude (highest since Q1 2021's 22.2%); blended revenue growth 11.3%.
Forward growth expectations: Q2 EPS +19.9%, Q3 +23.2%, Q4 +20.7%, full-year 2026 +21.0%.
Strategist target revisions (April 21 - May 11, 2026): JPM 7,600; RBC 7,900; HSBC 7,650; Yardeni 8,250.
AI productivity narrative remains intact in earnings reports; Bridgewater's Greg Jensen (Feb 23, 2026) estimates 2026 Big Tech AI capex contributes ~140 bps to US GDP growth.

The bear case (this paper's principal subject) coexists with these. The framework does not assert the bull case is wrong; it asks whether the conditional asymmetry of outcomes makes additional stress-test attention warranted. The bull case has its own upgrade criteria in §13.

3. Taxonomy: Triggers, Amplifiers, Outcomes

The framework's organizing principle: distinguish discrete events that could fire (triggers), state variables that determine response magnitude (amplifiers), and downstream consequences (outcomes). This distinction matters because the categories are often conflated — for example, treating "valuation extremes" or "regional bank failure" as triggers. Both are wrong: valuation is a state variable that amplifies the response to any trigger; regional bank failure is an outcome that emerges from CRE state plus a credit shock.

Compact diagram:

[ Trigger ] → engages [ Amplifier(s) ] → activates [ Feedback Loop(s) ] → produces [ Outcome ]

Example: AI capex disappointment → engages valuation/concentration/debt-financing →
  activates Loop A (capex-IG supply-yields) → produces sector re-rating + IG spread widening

3.1 Eight triggers

A trigger is a discrete event that could occur within the analysis horizon and that would, by itself, materially reprice at least one market segment.

AI capex / revenue disappointment that materially reprices AI infrastructure equity and credit.
Iran / Hormuz escalation beyond the current state — a kinetic step beyond what is currently priced.
Treasury basis trade unwind — forced deleveraging at one or more large funds with Treasury-market dislocation.
Yen carry unwind — material liquidation (≥15% of relevant positions, approximate threshold) with cross-asset spillover.
Major AI-enabled cyber incident on a globally systemic financial institution or shared infrastructure.
Private-credit semi-liquid fund forced gating, wind-down, or sponsor injection event (BCRED-style but more severe).
Stablecoin de-peg with measurable contagion into money markets or Treasury bills.
Taiwan kinetic escalation — a discrete military step beyond current posture.

3.2 Five amplifiers

Amplifiers are slow-moving state variables that determine how large the response to any trigger will be.

Elevated valuation — CAPE ≈40, Buffett Indicator ≈232%. Low expected forward returns; high downside skew on disappointment.
Index concentration — Magnificent 7 share-class aggregate 34.69%, SPY top 10 39.28% (official SSGA, May 8, 2026).
Treasury / IG supply pressure — approximately $14T 2026 IG supply per Apollo/Slok.
AI capex debt-financing shift — BIS Bulletin 120 (published January 7, 2026) documents leading tech firms increasingly using external financing; couples AI narrative risk directly to corporate-credit markets.
CRE refinancing wall — $875B (17% of $5.0T outstanding) maturing in 2026 per MBA (February 9, 2026).

3.3 Three downstream outcomes (not triggers)

Recession — outcome variable, often caused by triggers 2 (oil-Fed channel), 3 (Treasury market dislocation), or 5 (cyber operational impact).
Regional bank failure — downstream of CRE state variable plus credit shock.
Bond-equity correlation breakdown — outcome of the oil-inflation-Fed loop (Loop C).

4. Five Amplifiers (Detail)

4.1 AI capex debt-financing shift

BIS Bulletin 120, "Financing the AI Boom: From Cash Flows to Debt" (Aldasoro, Doerr, Rees), published January 7, 2026 (bis.org/publ/bisbull120.htm), documents that leading technology firms — Amazon, Alphabet, Microsoft, Meta, and Oracle — are increasingly using external financing for AI investment, with free cash flows now lagging capital expenditures in absolute amounts. The BIS Quarterly Review of March 2026 published a follow-on piece, "Financing the AI infrastructure boom: on- and off-balance sheet borrowing" (Aldasoro/Doerr/Rees), at bis.org/publ/qtrpdf/r_qt2603u.htm.

Capex estimates carry a range. Full source-by-source reconciliation appears in Appendix A. Headline anchors with evidentiary grading:

Estimate	Status
~$650B (Bridgewater, Greg Jensen letter Feb 23, 2026, four Big Tech firms)	Directly sourced analyst estimate
~$690-725B (Big Four hyperscaler post-Q1 2026 consensus)	Constructed/consensus from Q1 earnings-call guides
~$750B (CreditSights, five hyperscalers including Oracle)	Constructed/analyst (April 2026)
~$800B (global, including Chinese hyperscalers and CoreWeave)	Order-of-magnitude estimate

The evidence supports a material shift toward external financing for AI capex; the exact dollar scale remains estimate-dependent. Bridgewater estimates AI capex contributed ~50 bps to 2025 US GDP growth and projects ~140 bps in 2026. This is amplifier coupling: AI narrative risk feeds directly into IG credit spreads via hyperscaler debt issuance.

4.2 Treasury and IG supply pressure

Apollo Academy / Torsten Slok (March 24, 2026): "Ten trillion dollars in existing US government debt will need to be refinanced over the coming 12 months… The budget deficit this year is about $2 trillion. Total gross corporate bond issuance in 2026 is likely to be around $2 trillion because of increased supply from hyperscalers. Adding it all up, the total amount of investment grade supply coming to the market this year is around $14 trillion." Source: apolloacademy.com/14-trillion-in-ig-supply-coming-to-the-market.

The OECD Global Debt Report 2026 (March 2026, DOI 10.1787/e9d80efd-en) provides corroborating context:

Total government + corporate bond market borrowing was $27T in 2025; projected $29T in 2026 (17% above 2024, double the level a decade ago).
OECD sovereign gross borrowing reached a record $17T in 2025, projected $18T in 2026.
2025 refinancing requirements were $13.5T — approximately 80% of gross borrowing, a new high.
AI hyperscalers: nine firms raised $122B in bond markets in 2025 (nearly half of all tech-firm issuance).

This is an analyst estimate plus an official OECD compendium, not an official Treasury forecast.

4.3 Index concentration

Documented in §2.3. The amplification mechanism is conditional, not automatic. The dashboard (§12) tracks both mechanical contribution (Magnificent 7 weight × Magnificent 7 return) and flow-driven indicators (ETF redemption volumes, volatility-targeted fund deleveraging).

4.4 Valuation extremes

Documented in §2.2. The decisive empirical link between CAPE level and drawdown frequency comes from regime-conditional analysis in §7.3 — this is the framework's strongest single empirical claim, anchored on one completed historical episode (1999-2000).

4.5 CRE refinancing wall

MBA's 2025 Commercial Real Estate Survey of Loan Maturity Volumes (released February 9, 2026 at the CREF Convention): 17% of $5.0 trillion of outstanding commercial and multifamily mortgages — approximately $875 billion — is scheduled to mature in 2026, a 9% decline from $957B scheduled to mature in 2025. Source: mba.org/news-and-research/newsroom/news/2026/02/09/17-percent-of-commercial-and-multifamily-mortgage-balances-to-mature-in-2026.

Property-type breakdown of 2026 maturities: hotel/motel 30%, industrial 23%, office 17%, healthcare 15%, multifamily 13%. Investor-type breakdown: depositories $396B, CMBS/CLO/ABS $200B, credit companies/warehouse/other $163B, life insurance $76B, GSEs $39B. 2027 maturities are $652B.

MBA's CREF Loan Performance Survey Q1 2026 (April 27, 2026): overall commercial mortgage delinquency rate 4.02%, up from 3.86% in Q4 2025. CMBS 5.21% (up from 4.97%); life insurance 1.47%; GSE 0.97%; FHA 0.96%. CRED iQ parallel data: March 2026 CMBS distress rate (including specially serviced loans) ~12%.

Bank exposure: Florida Atlantic University's screener using Q3 2025 Call Report data finds 59 of the 155 largest US banks have CRE exposures exceeding 300% of equity capital (down from 62 banks the previous quarter). Aggregate CRE/equity ratio fell from 138% to 132%. This is industry-derived academic screening based on official Call Report data; it is not an FDIC-published quarterly tally. The 300% threshold derives from interagency supervisory guidance (FDIC FIL-23-2023; reaffirming 2006 OCC/Fed/FDIC joint guidance). Identified high-concentration banks (Q4 2024 vintage screener): Flagstar, Zions Bancorp, Valley National, Synovus, Umpqua, Old National — each >$50B in total assets.

This is a slow amplifier — distress is realized at refinancing dates over years — but it shapes the latent loss curve under any rate-rise or credit-shock scenario.

5. Eight Triggers (Detail)

For each trigger: current-state description, empirical anchor for the marginal probability, and low/base/high probability range used in the Monte Carlo. The base probability is in each case an author prior informed by the listed evidence; it is not mechanically derived from the anchors. Reproducibility flags appear alongside each anchor.

5.1 AI capex / revenue disappointment

Current state. Hyperscaler capex is increasingly debt-funded. Q3 and Q4 2026 earnings will be the first concrete tests of whether AI revenue is converting at the pace required. Specific stress points:

Microsoft tracking $120B+ for FY26 per company guidance; analyst extensions to ~$190B for calendar 2026 (CreditSights; Yahoo Finance/om.co).
Alphabet raised 2026 capex guide to $180-190B from $175-185B (CFO Anat Ashkenazi, Q1 earnings call).
Amazon ~$200B for 2026 (Feb 2026 guidance); Morgan Stanley sees FY26 free cash flow -$17B; BofA -$28B.
Meta raised 2026 capex range to $125-145B from $115-135B; stock -6% after-hours on the print.
Oracle: $50B+ 2026 capex (Futurum/Introl); capex-to-revenue 86%; RPO $523B. Moody's downgraded Oracle's outlook to Negative from Stable in September 2025; rating maintained at Baa2 (lowered-end investment grade), reaffirmed in Moody's December 15, 2025 Credit Outlook. S&P loosened its downgrade trigger to 4× debt/EBITDA from 3.5×. Barclays Fixed Income Research (analyst Andrew Keches) downgraded Oracle's debt to Underweight on November 11, 2025, citing capex-cash gap and counterparty risk from the ~$300B OpenAI agreement; flagged that cash reserves could be depleted by November 2026 and debt could fall to BBB-. Note: this is a debt/credit downgrade; Barclays' equity research (Raimo Lenschow) maintained Overweight through the period.
Michael Burry (Scion Asset Management): On May 10, 2026 via his Cassandra Unchained Substack, Burry disclosed that approximately 80% of Scion's portfolio is in NVDA + PLTR puts (notional ~$1.1B): 1,000,000 NVDA puts strike $110 expiring December 2027; PLTR puts strike $100 (December 2026) and $50 (June 2027) plus a direct short. Scion was deregistered after the Q3 2025 13F, so disclosure is via Substack/X, not a 13F filing. (TradingKey; Foreign Policy Journal, May 7, 2026.)

Empirical anchors. Polymarket "AI bubble burst in 2026?" was trading near 16-28% YES as of May 11-12, 2026 (multi-outcome market; specific outcome and snapshot time matter for the precise number). Historical base rate for sector re-ratings ≥30% from peak valuation within 24 months: approximately 35-45%. Bloomberg NVDA option surfaces show 12-month left-tail skew implying P(-20%+ move) ≈ 18-22% (proprietary surface; illustrative).

Marginal probability (12-month): low 15%, base 25%, high 40%.

5.2 Iran / Hormuz escalation

Current state. A ten-week-plus closure of the Strait of Hormuz against a fragile, repeatedly violated ceasefire. Iran's response to Trump's latest peace proposal was rejected as "TOTALLY UNACCEPTABLE" on May 10-11. WTI rose 2%+ on May 11.

Disaggregation:

IEA-reported supply fall (realized, March 2026): 10.1 mb/d ("the largest disruption in history" — IEA OMR, April 14, 2026).
Pre-war Hormuz crude+condensate transit: ~20 mb/d, falling to ~2-4 mb/d in March-April 2026 (the flow disruption, not the realized supply loss, because of bypass via Saudi Red Sea ports, UAE Fujairah, and Iraq-Türkiye pipeline).
Saudi Aramco CEO Amin Nasser: ~100 million barrels per week potentially at risk if Hormuz remains closed (TradingEconomics; Reuters; May 10-11, 2026).
IEA Director Fatih Birol (Norges Bank In Good Company podcast; CNBC April 1, 2026): "Today, we lost 12 million barrels per day — more than two of these oil crises [1973 and 1979] put together." On April 13, 2026 at the Atlantic Council in Washington, Birol revised this to ~13 mb/d shut-in with more than 80 energy facilities damaged.
IEA emergency release: March 11, 2026 — IEA members agreed to release 400 million barrels from emergency reserves.

Empirical anchors (Polymarket, May 11-12, 2026 snapshots). "US-Iran peace deal by May 31, 2026?" 13-28% YES across market configurations (paper uses 17.5%); "Strait of Hormuz traffic returns to normal by May 15?" ~0.7%; "Iran agrees to end uranium enrichment by June 30?" ~25-28%. Polymarket prices are sensitive to liquidity, settlement-text mapping, manipulation risk, and participant composition; all values are time-of-day specific and used as triangulation, not calibration.

Marginal probability of meaningful further escalation (12-month): low 20%, base 30%, high 45%.

5.3 Treasury basis trade unwind

Current state. OFR Brief 26-01 (March 3, 2026), "Hedge Fund Participation in Cleared Repo," states: "Recent estimates suggest that about 75% of all hedge fund Treasury repo activity is not centrally cleared." (financialresearch.gov/briefs/2026/03/03/hedge-fund-participation-cleared-repo). A companion OFR blog (January 29, 2026) estimates that under the SEC central-clearing rule, 77% would have been cleared versus the actually-cleared 45% of daily Treasury repo outstanding in the first 8 months of 2025.

FSB published "Vulnerabilities in Government Bond-backed Repo Markets" on February 4, 2026 (fsb.org/2026/02/vulnerabilities-in-government-bond-backed-repo-markets/; PDF at fsb.org/uploads/P040226.pdf). Key statistics:

~$16 trillion in repo trades backed by government bonds outstanding at end-2024 (~80% of total repo stock); up ~20% since 2022.
US accounts for ~60% of activity; ~40% of repo outstanding involves counterparties in different jurisdictions.
~70% of the non-centrally cleared segment operates with zero haircuts.

The SEC's Treasury central-clearing rule (compliance dates extended February 25, 2025) is effective December 31, 2026 for eligible cash market transactions and June 30, 2027 for eligible repo. The SEC approved CME Securities Clearing as a Treasury Covered Clearing Agency (December 1, 2025) and ICE Clear Credit (January 30, 2026), expanding CCP options beyond FICC.

Earlier industry summaries referencing >$6T in repo financing, 18:1 leverage, and 73.8% of repo at zero/negative haircuts are directionally consistent with the OFR/FSB picture but are treated here as directional rather than specifically sourced. The substantive claim — that the safest market is now the most leveraged, with policy ammunition more constrained than in 2020 — holds.

Empirical anchors. Historical base rate for Treasury-market dislocations requiring Fed intervention since the SRF was established in 2021: zero realized (the August 2024 yen carry tremor produced a near-miss). Prior to 2021: roughly one episode per 5-10 years (1998 LTCM, 2008 GFC funding stress, March 2020). MOVE index at 67-72 in early May 2026 is well below the 130+ threshold historically associated with stress periods. No direct prediction-market anchor exists.

Marginal probability (12-month): low 8%, base 12%, high 20%.

5.4 Yen carry unwind

Current state. BOJ held at 0.75% on April 28, 2026 in a 6-3 vote, with three dissenters (Takata, Tamura, Nakagawa) voting to raise to 1.0%. The August 5, 2024 yen tremor (-12.4% Nikkei on a single day; second-largest percentage drop in Nikkei history after October 1987) is the relevant precedent. Carry-trade size estimates vary by source; Appendix C provides a structured exposure matrix rather than a single estimate.

Empirical anchors. Polymarket BOJ-rate markets that were active in 2024-2025 are not currently active for the 2026 horizon. USD/JPY 1-year implied volatility at approximately 9-10% (CME/Bloomberg, May 2026) implies P(10%+ JPY appreciation within 12 months) ≈ 12-18% under a lognormal flat-vol approximation (this is not a risk-neutral or model-free estimate; treated as illustrative). The August 2024 historical precedent — ~5% of relevant positions liquidating produced a -12.4% Nikkei single-day move and ~3% S&P move — is the relevant base for what a "material" unwind transmits.

Marginal probability of >15% material unwind (12-month): low 10%, base 18%, high 30%.

5.5 Major AI-enabled cyber incident

Current state. Anthropic announced Project Glasswing on April 7, 2026 (anthropic.com/glasswing; anthropic.com/project/glasswing): an initiative to secure critical software powered by Claude Mythos Preview, which Anthropic reports identified "thousands of zero-day vulnerabilities" across every major OS and browser. Launch partners include AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks. Anthropic committed $100M in Mythos Preview usage credits + $4M in direct donations to open-source security organizations. Anthropic stated it does not plan to release Mythos Preview generally. (Note: the "thousands of zero-days" is a self-reported Anthropic claim, not an independently audited finding.)

The IMF published "Financial Stability Risks Mount as Artificial Intelligence Fuels Cyberattacks" on May 7, 2026 (imf.org/en/blogs/articles/2026/05/07/...) authored by Tobias Adrian, Tamás Gaidosch, and Rangachary Ravikumar. Verbatim: "IMF analysis suggests that extreme cyber-incident losses could trigger funding strains, raise solvency concerns, and disrupt broader markets." The blog explicitly references Claude Mythos Preview as evidence of how quickly risks are increasing.

Empirical anchors. SEC public-company cyber incident disclosures under Item 1.05 of Form 8-K (effective December 18, 2023): per Debevoise & Plimpton's Cybersecurity Incident Disclosure Tracker (April 10, 2026 update), there have been approximately 44 cumulative Item 1.05 filings through April 10, 2026 — Dec 2023: 2; 2024: 25; 2025: 13; Q1 2026: 3 plus 2 additional in early April. Six (~14%) were filed by financial-sector (SIC 6000-series) issuers including First American Financial, South State, Prudential, FHLBNY, B. Riley, and BayFirst Financial. Item 1.05 filing pace slowed in 2025 vs 2024 after SEC Corp Fin Director Erik Gerding's May 2024 Statement redirecting non-material incidents to Item 8.01.

The IMF's May 7, 2026 framing argues that the AI-enabled rate is likely above this historical baseline.

Marginal probability of a financially material AI-enabled cyber incident (12-month): low 5%, base 10%, high 20%.

5.6 Private credit fund forced gating

Current state. Fitch reports US private credit default rates of 9.2% in 2025 (Reuters, March 6, 2026) — a record in Fitch's Private Markets Research series (begun 2018); up from 8.1% in 2024. Smaller issuers (EBITDA ≤$25M) had a 15.8% default rate vs 4% for larger borrowers (EBITDA >$100M). Six of eight cases with ultimate recoveries paid in full. Fitch monitored 302 companies and recorded 38 defaults among 28 borrowers.

The FSB published "Report on Vulnerabilities in Private Credit" on May 6, 2026 (fsb.org/2026/05/report-on-vulnerabilities-in-private-credit/; PDF P060526.pdf). Key data:

Total private-credit market: $1.5-2.0T at end-2024 (FSB official scope); commercial estimates of "broad private credit" reach $3.5T (ACC/AIMA Financing the Economy 2025 with Houlihan Lokey).
Geographic concentration: US ~$1T; euro area + UK; smaller markets in Canada, Hong Kong, Japan, Switzerland, South Africa.
Bank direct exposure: FSB official estimate ~$220B drawn + undrawn credit lines to private-credit funds; commercial estimates $270-500B. Bank private-credit exposure <0.5% of total bank assets.
Retail investor share rose from ~0% to ~13% over the past 10 years.
Insurer exposure: ~10% of life insurer portfolios may be private credit.

BCRED Q1 2026 redemption event (verified across CNBC March 5, 2026; Bloomberg March 2, 2026; AltsWire): Investors requested redemptions of 7.9% of NAV (approximately $3.7-3.8 billion) against a standard 5% cap. Blackstone raised the cap from 5% to 7% and injected $400 million of internal capital ($250M from the firm + $150M from >25 senior executives), honoring 100% of requests. BCRED Q1 NAV was approximately $45B (down from $47.6B at year-end 2025); non-accruals 2.4% at cost / 1.4% at fair value; Medallia (mark 60.3) and Affordable Care (69.8) were cited as top contributors to mark declines.

Industry-wide stress (Bloomberg March 11, 2026; Reuters; Alternative Credit Investor April 7, 2026):

Cliffwater Corporate Lending Fund (~$33B AUM): redemptions ~14% of NAV; capped withdrawals.
BlackRock HPS Corporate Lending Fund (~$26B AUM): $1.2B redemption requests (9.3% of NAV); paid out $620M under the 5% cap.
Morgan Stanley North Haven Private Income Fund (~$7.6B AUM): redemptions 10.9% of shares; returned ~$169M (45.8% of requests).
Blue Owl Credit Income Corp: 21.9% of shares requested redemption in Q1 2026; paid ~14% under cap.
Blue Owl Technology Income Corp: 40.7% of shares requested redemption; paid ~14% under cap.
Moody's BDC sector outlook cut from Stable to Negative on April 7, 2026 (Reuters), citing "rising redemption pressures, higher leverage, and weakening access to funding markets." Total BDC sector assets ~$400B; non-traded BDCs >60% of sector AUM.

Empirical anchors. Historical base rate for semi-liquid credit fund gating events: approximately one major event per 3-5 years in stress periods. Current Fitch default rate (9.2%) is the record in Fitch's PMR series and is consistent with elevated forward gating risk.

Marginal probability of additional forced gating event (12-month): low 8%, base 15%, high 25%.

5.7 Stablecoin de-peg with contagion

Current state. Total stablecoin market capitalization $322.7B as of May 10, 2026 per DefiLlama (cryptonews.net); USDT $189.6B (58.76% dominance); USDC $78.96B. The GENIUS Act was signed July 18, 2025 (Senate 68-30 on June 17; House 308-122 on July 17), establishing the US framework for payment stablecoins: 100% reserve backing with USD or short-term Treasuries; monthly public reserve disclosures; Bank Secrecy Act applicability; insolvency priority for stablecoin holders; classification as non-securities/non-commodities. Eligible issuers limited to insured depository institution subsidiaries, OCC-approved federal nonbanks, and state-qualified issuers. Effective date: earlier of 18 months after enactment or 120 days after final rules.

Tether Q1 2026 attestation (BDO Italia ISAE 3000R, published late April/early May 2026; tether.io/news/...): As of March 31, 2026 — total assets $191.77B; total liabilities $183.54B; direct US Treasury bill holdings $117.0 billion (line item: $117,035,732,050); direct + indirect T-bill exposure (including reverse repos and money-market funds) ~$141 billion (Tether's headline number); excess reserves $8.23B (record); Q1 net profit $1.04B; gold $19.84B (~132 metric tons at XAU $4,668.06/oz).

The systemic transmission mechanism — stablecoin redemptions forcing Treasury-bill sales — operates on the direct T-bill subset ($117B). The headline $141B includes indirect exposure that would also be liquidated in stress but with longer paths.

Empirical anchors. Historical base rate: UST/Luna (May 2022, ~$60B destruction, contagion contained to crypto), USDC brief de-peg (March 2023 SVB exposure, recovered within 48 hours). Approximately one event per 1-2 years in crypto markets, with only one cross-market-contagion event historically.

Marginal probability of de-peg with measurable TradFi contagion (12-month): low 2%, base 5%, high 12%.

5.8 Taiwan kinetic escalation

Current state. Bloomberg Economics' February 10, 2026 update, "The $10 Trillion Fight: Modeling a US-China War Over Taiwan" (bloomberg.com/news/articles/2026-02-10/...) estimates that in the most extreme case, a US-China conflict over Taiwan would cost the global economy about $10.6 trillion, roughly 9.6% of global GDP, in the first year alone — "eclipsing the impact of the Covid-19 pandemic and the 2007-09 global financial crisis." First-year impacts by economy: China -11%, US -6.6%, Taiwan -40%, S. Korea -23.3%, Japan -14.7%, EU -10.9%, India -8%, UK -6.1%. The probability is low; the magnitude is extreme.

PLA Q1 2026 activity around Taiwan: large-scale amphibious exercises opposite Taiwan in late April; routine Liaoning carrier transits April 20; East China Sea patrols April 18. ODNI's mid-March 2026 Annual Threat Assessment concluded PLA "likely lacks plans for a 2027 offensive or sooner." Beijing announced 10 new economic incentives for Taiwan on April 12, prioritizing gray-zone coercion over kinetic conflict.

Empirical anchors (Polymarket, May 11-12, 2026 snapshots).

Polymarket market	Implied probability
Will China invade Taiwan by June 30, 2026?	~0% (paper uses 1.8%)
Will China invade Taiwan by end of 2026?	~7% (paper uses 7.4%)
China x Taiwan military clash before 2027?	~9-12% (paper uses 12.0%)
Will China invade Taiwan by Dec 31, 2027?	~17-18%

The framework's "Taiwan kinetic escalation" trigger maps most closely to "military clash before 2027" at approximately 9-12%, not the narrower "full invasion" framings.

Marginal probability of discrete kinetic step (12-month): low 4%, base 9%, high 15%.

6. Feedback Loops

The feedback-loop topology is the framework's most durable analytical contribution. Each loop describes a self-reinforcing transmission pathway: a trigger creates pressure that engages a state variable that creates further pressure on the same or adjacent triggers.

Loop A: AI capex ↔ IG supply ↔ yields ↔ AI valuations. Hyperscaler debt issuance lifts IG supply → marginal-buyer pressure raises yields → AI cash flows discounted lower → if AI revenue misses, Oracle credit (and credit of similarly-positioned issuers) widens → loop tightens. Already partially activated (Moody's Oracle outlook negative September 2025; Barclays fixed-income downgrade November 2025).

Loop B: AI capex ↔ private credit ↔ BDC retail flows ↔ public credit. Private credit funds AI infrastructure via asset-based finance and JV debt. A 20-30% capex compression → NAV write-downs → BCRED-style redemptions → fire sales into public credit. The Q1 2026 BCRED/HPS/Cliffwater/Morgan Stanley/Blue Owl events were a low-magnitude test; the system bent but did not break.

Loop C: Iran ↔ oil ↔ inflation ↔ Fed ↔ bond-equity correlation. $100+ Brent keeps inflation sticky → Fed cannot cut → bonds and equities fall together → 60/40 fails → risk-parity forced selling. The IMF's April 2026 GFSR identified this directly: more frequent supply shocks have eroded the equity-bond hedging relationship, raising risks of simultaneous deleveraging in both asset classes.

Loop D: Yen carry ↔ Mag 7 ↔ passive flows ↔ index concentration. BOJ hikes → carry unwinds → tech selling concentrated in Mag 7 → because the Mag 7 share-class aggregate is ~34.7% of SPY (and the S&P 500 broadly), index moves mechanically. Cap-weighted funds do not themselves force-sell on price declines; index weights simply adjust. Second-round selling pressure comes from passive redemptions, volatility-targeted deleveraging (risk-parity and vol-control funds), margin calls on leveraged positions, and active/quant rebalancing. 0DTE gamma hedging on Mag 7 single-stock options amplifies intraday velocity. August 5, 2024 (Nikkei -12.4%) is the partial precedent.

Loop E: Basis trade ↔ Treasury liquidity ↔ everything. Leveraged hedge funds + repo financing (75% non-centrally cleared per OFR Brief 26-01). Vol spike → margin calls → Treasury dislocation → because Treasuries are the global risk-free curve, all assets reprice. March 2020 precedent. SEC's central-clearing rule (effective December 31, 2026 for cash, June 30, 2027 for repo) is the right structural fix, but the transition itself creates risk.

Loop F: Private credit / NBFI ↔ banks ↔ refinancing risk. EU banks' 9.2% NBFI exposure (FSB, EBA reporting) is NOT the same as direct private-credit exposure. Three sub-channels:

Direct on-balance-sheet bank exposure to private-credit funds: materially smaller than 9.2% (FSB May 2026: ~$220B drawn + undrawn, <0.5% of bank assets); not itemized in standard bank capital reporting.
Broad NBFI exposure (the 9.2% figure): includes hedge funds, family offices, sovereign wealth, asset managers, and private credit. Most is non-correlated with private-credit stress.
Indirect linkage: SRT (significant risk transfer) trades, warehouse facilities, repo against private-credit collateral, and shared counterparty risk. This is the real systemic concern. If private-credit funds retrench, the SRT investor base shrinks, raising bank refinancing risk.

Loop evidence ratings:

Loop	Conceptual strength	Evidence strength	Quantified?
A — AI/IG/yields	High	Medium-high	Partly
B — AI/private credit/BDC	High	Medium-high	Partly
C — Oil/Fed/correlation	High	High	Partly (IMF GFSR)
D — Yen/Mag 7/passive	Medium	Medium	Weakly
E — Basis/Treasury liquidity	High	High	Partly (OFR/FSB)
F — Private credit/NBFI/banks	High	Medium	Weakly

The factor-copula structure in §7.4 captures cross-loop correlation: triggers that share factor loadings cluster, so loop activations are correlated through latent risk-off, inflation/Fed, liquidity/funding, and tech/AI factors.

7. Probability Framework

The framework uses a multi-layer stack. The dominant layer is empirical-historical, not modeled. Historical regime-conditional data answers a sharper question than any model output: given that markets have historically started from CAPE regimes like the current one, what is the realized frequency of forward drawdowns?

The layers, in order of priority for decision-making:

Layer C-conditional (§7.3): Historical rolling-window drawdown coverage rate, conditioned on starting CAPE regime. The strongest empirical anchor — but at the episode level, n = 1 for CAPE ≥40.
Layer C-unconditional (§7.2): All-history rolling-window drawdown coverage rate. Context.
Layer A (§7.4): Scenario-generator trigger co-occurrence via factor-copula Monte Carlo. Not a calibrated probability forecast.
Layer B (§7.5): Scenario-generator trigger-mediated severity via structured second-stage Monte Carlo. Not a calibrated probability forecast.
Layer D (§7.6): Dashboard-conditioned posterior updating as monitoring indicators fire.

§7.7 brings the layers together. They should not be combined into a single number.

7.1 Approach: empirical anchor over modeled forecast

A model is only as good as its calibration. The eight-trigger Monte Carlo carries parameter uncertainty in three places: marginal probabilities (author priors anchored to external evidence but not directly extracted from market prices), correlation structure (factor loadings calibrated to economic logic, not estimated), and severity transmission (slope parameter calibrated to historical analogs ranging from 3 to 14, with median 5-7). Every output therefore depends on the frozen parameter set in Appendix J.2; nothing in Layers A and B is an empirical probability of a future event.

Conversely, the historical 24-month rolling-window drawdown coverage rate starting from CAPE ≥40 — computed below — is descriptive empirical history for the available sample. The sample is dominated by one completed episode (January 1999 – September 2000); this is one historical episode, not many independent observations. The computation is fully reproducible (Appendix F).

The framework therefore leads with §7.3 (regime-conditional history), uses §7.2 (unconditional history) for context, and treats the scenario-generator layers §7.4 (trigger co-occurrence) and §7.5 (trigger-mediated severity) as channel-identification augmentation rather than probability forecasts.

7.2 Layer C unconditional: all-history base rates

Methodology.

S&P 500 monthly close, 1881-2026: Robert Shiller monthly series through September 2023, extended October 2023-May 2026 as described below.
Price-only index (excludes dividends).
Forward-extension beyond September 2023 (last month of Shiller's published file): monthly S&P 500 close + monthly CAPE from Multpl/GuruFocus, October 2023 - May 2026 (33 additional monthly observations).
For each rolling k-month window (k ∈ {12, 18, 24, 36}), find the maximum peak-to-trough drawdown from the rolling all-time high within the window.
Score window = 1 if max drawdown within the window ≥ D; average across all rolling windows.
This is coverage frequency, not event-count frequency — the answer to "starting from a randomly selected month, what is the probability that the next k months include a ≥D drawdown."

Empirical results (1881-2026, monthly rolling-window coverage frequency):

Drawdown threshold	P(within 12 mo)	P(within 18 mo)	P(within 24 mo)	P(within 36 mo)
≥ 10%	37.8%	51.6%	63.0%	81.0%
≥ 15%	21.6%	32.7%	42.7%	60.7%
≥ 20%	13.5%	21.8%	28.9%	42.3%
≥ 25%	7.4%	13.6%	19.9%	30.9%
≥ 30%	4.6%	7.1%	10.1%	17.4%
≥ 40%	2.0%	3.7%	5.2%	8.8%
≥ 50%	0.7%	1.6%	2.6%	4.0%

Reproducible code in Appendix F. This is the unconditional 1881-2026 monthly base rate. It averages across all starting regimes, including the many low-CAPE windows that mechanically have less downside available.

7.3 Layer C-conditional: regime-conditional historical coverage frequency

The unconditional rate is a misleading comparison for the current market. CAPE ~40 is between the 95th and 99th percentile of the 1881-2026 monthly CAPE distribution. Markets starting from this regime have a different historical track record than the average starting month.

Methodology.

S&P 500 monthly close 1881-2026 (Shiller through September 2023 plus the documented October 2023-May 2026 extension).
Shiller's published monthly CAPE 1881-2023, extended through May 2026 with Multpl monthly readings.
For each rolling 24-month window, filter starting months by CAPE regime.
Average drawdown coverage frequency within each regime.

CAPE distribution 1881-2026 (monthly values, equal-weighted by month):

Statistic	Value
Mean	17.75
Median	16.60
75th percentile	21.45
90th percentile	28.02
95th percentile	32.84
99th percentile	40.99
All-time max	44.20 (December 1999)

Current CAPE (~40.4 as of May 8, 2026) is at the right tail — between the 95th and 99th percentile.

Regime-conditional 24-month forward drawdown coverage frequency (central table):

Starting CAPE regime	N monthly windows	P(≥10%)	P(≥20%)	P(≥25%)	P(≥30%)	P(≥40%)
All starting months	1,722	63.0%	28.9%	19.9%	10.1%	5.2%
Low CAPE (<15)	690	59.7%	23.3%	14.2%	4.6%	1.0%
Medium (15-25)	775	65.0%	32.7%	24.0%	12.9%	7.1%
High (25-35)	208	59.6%	23.1%	19.7%	15.9%	13.0%
Very high (≥35)	49	91.8%	71.4%	36.7%	18.4%	2.0%
Extreme narrow (≥38)	27	100.0%	85.2%	55.6%	22.2%	3.7%
Extreme (≥40) — current	21	100.0%	90.5%	61.9%	19.0%	4.8%
Very extreme (≥42, 1999 only)	13	100.0%	100.0%	69.2%	23.1%	0.0%

12-month forward (for shorter-horizon planning):

Starting CAPE regime	P(≥10%)	P(≥20%)	P(≥25%)	P(≥30%)
All starting months	37.8%	13.5%	7.4%	4.6%
Very high (≥35)	50.0%	23.3%	1.7%	0.0%
Extreme (≥40)	47.6%	33.3%	4.8%	0.0%

Headline. In the available historical sample, monthly rolling 24-month forward windows beginning from CAPE ≥40 had a 61.9% coverage frequency of a peak-to-trough ≥25% drawdown (13 of 21 monthly observations). Uplift versus unconditional: 3.11x. The 36-month coverage frequency at this regime reaches 100% at ≥25% and 61.9% at ≥40% — meaning all 21 monthly windows experienced a ≥25% drawdown within 36 months.

This is rolling-window coverage frequency, not an independent-event probability. Because these windows are highly overlapping and concentrated in a small number of distinct historical regimes, the result should be interpreted as a historical stress signal anchored on one episode, not as a calibrated 24-month forecast probability.

Distinct CAPE ≥40 episodes (1881-2026 monthly data):

Episode	Start	End	Months	Max CAPE	Max CAPE date	Forward 24m max DD range	24m P(≥25%)
Dot-com peak	1999-01	2000-09	21	44.19	1999-12	10.4% - 40.9% (mean 26.8%)	13/21 = 61.9%
Current	2026-03	2026-05+	3 (and continuing)	40.55	2026-04	Incomplete — paper publication May 12, 2026	N/A

At CAPE ≥38, broader sample (5 distinct episodes): 1998-07; 1998-12 to 2000-11 (24 months); 2021-11 to 2021-12 (2 months); 2024-11 to 2024-12 (2 months); 2025-09 to 2026-05 (9 months and ongoing). The 2021 and 2024 peaks (38.58 and 38.78 respectively) did not cross 40. Note on cross-period sourcing: the 2021 reading (38.58) comes from Shiller's published file (period covered by Shiller through September 2023); the 2024 reading (38.78) comes from the Multpl/GuruFocus forward extension documented in Appendix J.4a. Both sources are documented in §7.2 methodology; the regime filter does not change at any plausible tiebreaking choice.

Uncertainty: why this paper does not report a confidence interval. The stationary block-bootstrap output for this regime is shown below as a diagnostic, not as a confidence interval. It is not statistically interpretable for this sample. The bootstrap output itself displays the diagnostic that exposes the problem:

Regime	Threshold	Point estimate	Bootstrap 5th pct	Bootstrap 50th pct	Bootstrap 95th pct
CAPE ≥ 35	≥25%	36.7%	60.7%	85.1%	100.0%
CAPE ≥ 38	≥25%	55.6%	63.0%	91.7%	100.0%
CAPE ≥ 40	≥25%	61.9%	57.7%	97.4%	100.0%

The CAPE ≥35 row shows the issue starkly: the bootstrap 5th percentile (60.7%) lies above the point estimate (36.7%). A 5th percentile above the point estimate is a pathology, not a feature. It happens here because block bootstrap on a rare regime concentrated in one historical cluster either resamples around that cluster (producing very high coverage) or misses it entirely (producing replications discarded as undefined). The resulting distribution does not have a frequentist confidence-interval interpretation. Appendix K documents this in full. A Wilson interval treating the 21 windows as independent Bernoulli trials (which they are not — they overlap) would be approximately [44%, 77%] at 90% confidence; even that is not appropriate because the observations come from one regime.

The honest statement is: n = 1 at the episode level for CAPE ≥40, and the 61.9% figure is descriptive history within that one completed episode.

Non-overlapping-window sensitivity (sample every 24 months so windows do not overlap): only 1 truly independent observation at CAPE ≥40 (the 1999 episode contributes 1 non-overlapping window with peak-to-trough max drawdown 40.9%, exceeding 25%). At CAPE ≥35 there are 2 non-overlapping observations (1/2 hit 25%). This confirms that the 61.9% figure is fundamentally a single-episode observation rather than a 21-window statistical result.

Caveats.

Sample composition. The CAPE ≥40 sample of 21 monthly observations is dominated by the January 1999 – September 2000 dot-com episode. The current episode (started March 2026) cannot yet contribute completed 24-month forward observations.
Causation vs coincidence. We cannot fully separate "high CAPE causes drawdowns" from "the specific episode that produced a drawdown happened to coincide with high CAPE." The robust interpretation: in the one completed historical regime where CAPE reached current levels, the subsequent path was a substantial drawdown across nearly every monthly entry point in that episode. The proximate cause was the dot-com bust; the present episode's proximate cause has not yet been determined.
Single-episode dominance. The 61.9% point estimate is functionally equivalent to: "in the one completed historical episode where CAPE ≥40, 13 of 21 monthly entry points were followed by a ≥25% peak-to-trough drawdown within 24 months."
Forward extrapolation. The empirical signal points to elevated fragility from current valuation conditions. It does not imply that a ≥25% drawdown is 61.9% likely within the next 24 months as an unconditional forecast. The right framing is: "the only previous time we saw this configuration, the subsequent path was almost always a substantial drawdown — but n = 1 at the episode level."

This is computed historical coverage rate, not model output. The framework's scenario-generator layers (§7.4-7.5) add channel-identification value, but they do not transform this single-episode evidence into a calibrated probability.

7.4 Layer A: trigger co-occurrence scenario generator (factor-copula Monte Carlo)

This is a scenario-generation layer. Outputs are conditional on the frozen parameter set; they are not calibrated probability forecasts. For each of the eight triggers, a 12-month marginal probability prior is specified (§5). Co-occurrence is generated via Gaussian copula with a four-factor latent correlation structure:

F1 Risk-off (affects all triggers)
F2 Inflation/Fed (affects oil-driven and credit triggers)
F3 Liquidity/funding (affects Treasury, repo, carry, stablecoin)
F4 Tech/AI (affects AI-specific and Taiwan-via-chips)

Factor loadings:

Trigger	Risk-off	Inflation	Liquidity	Tech/AI	Idiosyncratic
ai_capex	0.40	0.10	0.30	0.65	0.32
iran_hormuz	0.40	0.70	0.20	0.10	0.30
basis_trade	0.45	0.25	0.70	0.05	0.24
yen_carry	0.40	0.30	0.55	0.30	0.36
cyber	0.30	0.05	0.20	0.20	0.83
private_credit	0.40	0.25	0.55	0.20	0.43
stablecoin	0.30	0.10	0.50	0.15	0.63
taiwan	0.50	0.20	0.20	0.45	0.47

12-month marginal probabilities (regenerated frozen parameter file, seed=42, 200,000 trials):

Trigger	12m base	24m implied
ai_capex	0.25	0.4375
iran_hormuz	0.30	0.5100
basis_trade	0.12	0.2256
yen_carry	0.18	0.3276
cyber	0.10	0.1900
private_credit	0.15	0.2775
stablecoin	0.05	0.0975
taiwan	0.09	0.1719

Average pairwise binary correlation: 0.25 (range 0.10 to 0.42).

Base case results (24-month horizon, factor copula, 200,000 trials):

Metric	Value
P(≥1 trigger fires)	76.7%
P(≥2 triggers fire)	54.6%
P(≥3 triggers fire)	38.3%
P(≥4 triggers fire)	25.7%
P(≥5 triggers fire)	16.0%
Expected trigger count	2.24

Conditional cluster amplification (factor-driven):

Conditioning	Target trigger	P(unconditional)	P(conditional)	Uplift
iran_hormuz fires	basis_trade	22.6%	34.6%	1.53x
iran_hormuz fires	yen_carry	32.7%	47.7%	1.46x
iran_hormuz fires	private_credit	27.9%	40.4%	1.45x
iran_hormuz fires	taiwan	17.3%	25.8%	1.49x
ai_capex fires	taiwan	17.3%	31.1%	1.80x
ai_capex fires	yen_carry	32.7%	51.9%	1.59x

The factor structure correctly captures cluster behavior: shared loadings on the liquidity factor pull basis_trade, yen_carry, and private_credit together; shared loadings on tech/AI pull ai_capex and taiwan (via chip-supply concerns) together. Cyber, with mostly idiosyncratic variance (~83%), barely correlates with anything.

Code in Appendix D.

7.5 Layer B: trigger-mediated severity scenario generator (structured Monte Carlo)

This is a scenario-generation layer. Outputs are conditional on the frozen parameter set; they are not calibrated probability forecasts. A second-stage Monte Carlo takes Layer A trigger vectors and maps them to drawdown distributions via:

Primary severity = max(standalone severity) among fired triggers.
Additional severity = 0.5 × (n_fired − 1) (diminishing returns from additional triggers).
Coupling term = 0.3 × average liquidity coupling of fired triggers.
Loop boost = sum of activated-loop boost values (additive, capped at 1.5).
Policy offset = stochastic, weighted by trigger standalone severity; per-trigger p_eff_i calibrated from §8.7.
Net severity = gross × (1 − policy offset).
Drawdown = N(slope × net_severity, std), clipped to [0, 90%].

The slope parameter is the central calibration choice. Historical analogs imply:

LTCM 1998 (net severity ~2, realized 19.3% S&P drawdown): slope ~9-10.
GFC 2008 (net severity ~5, realized 57%): slope ~11-14.
August 2024 yen tremor (net severity ~2, realized 6-8%): slope ~3-4.
Median historical anchor: slope ~5-7.

The base case uses slope = 6.0 (median). Sensitivity range slope ∈ {4, 8} reflects whether you believe current state resembles "fast-policy-response" (slope=4) or "GFC-style policy slow" (slope=8) more closely. Policy-offset parameters are scenario assumptions, not estimated coefficients.

Base case results (factor copula, 24-month horizon, slope = 6.0, 200,000 trials):

Threshold	P(trigger-mediated drawdown ≥ threshold over 24 months)
≥10%	74.9%
≥15%	71.6%
≥20%	63.9%
≥25%	49.9%
≥30%	33.8%
≥40%	10.4%
≥50%	1.5%

Slope sensitivity (P(≥25%, 24mo) and P(≥40%, 24mo)):

Slope	Interpretation	P(≥25%)	P(≥40%)
4.0	Fast-policy-response (Aug 2024 anchor)	16.7%	0.4%
5.0	Conservative	34.0%	3.0%
6.0	Median historical anchor (base)	49.9%	10.4%
7.0	Moderate-aggressive (LTCM anchor)	61.4%	21.4%
8.0	Aggressive (GFC anchor)	67.9%	33.7%

Global sensitivity check (base slope = 6.0, 200,000 trials). The slope parameter remains the largest single severity-mapping choice, but other frozen assumptions also move the output. These variants perturb one assumption family at a time and leave all other inputs unchanged; they are robustness checks, not alternative forecasts.

Variant	P(≥2 triggers)	P(drawdown ≥25%)	P(drawdown ≥40%)	Interpretation
Base frozen file	54.6%	49.9%	10.4%	Published scenario-generator baseline
Trigger marginals −20%	45.4%	42.1%	7.9%	Lower author-prior trigger rates
Trigger marginals +20%	62.6%	56.9%	13.0%	Higher author-prior trigger rates
Factor loadings −20%	59.9%	51.2%	9.2%	Less clustering, more independent trigger incidence
Factor loadings +20%	48.1%	48.1%	11.9%	More clustering, fewer multi-trigger trials but fatter tails when clustered
Policy offsets +20%	54.6%	43.0%	6.6%	More effective policy response
Policy offsets −20%	54.6%	56.0%	15.2%	Less effective policy response
Loop boosts set to zero	54.6%	40.6%	4.5%	Transmission loops removed
Loop boosts +25%	54.6%	51.4%	11.2%	Stronger loop amplification

Conditional on trigger count (base case, 24-month, slope = 6.0):

Triggers fired	P(trial)	Mean drawdown	P(≥10%)	P(≥25%)	P(≥40%)
0	23.3%	1.0%	0.0%	0.0%	0.0%
1	22.1%	22.8%	93.3%	39.9%	1.1%
2	16.3%	26.5%	98.4%	56.3%	4.8%
3	12.6%	29.6%	99.7%	70.6%	10.4%
4	9.8%	32.6%	99.9%	82.8%	18.3%
5+	16.0%	37.8%	100.0%	93.9%	39.4%

Internal consistency check: P(0 triggers) = 23.3% = 1 − P(≥1) = 100% − 76.7%. ✓

The structured MC gives an actual model output (not author overlay) for the scenario-generator's severity distribution. At base calibration, the scenario-generator output is P(trigger-mediated ≥25% drawdown within 24 months) = 49.9%. This is conditional on the frozen parameter set: change the slope from 6 to 4 and the figure becomes 16.7%; change to 8 and it becomes 67.9%. The structured MC's value is not in the headline number but in the slope sensitivity, the trigger-conditional structure (showing how severity scales with number of triggers fired, which channels couple most strongly), and the factor-conditional uplift results in §7.4.

Code in Appendix E.

7.6 Layer D: dashboard-conditioned posterior

The dashboard (§12) is the operational expression of Bayesian updating. As Tier 1 indicators fire, the framework's qualitative concern level should shift toward the upper end of the trigger-marginal ranges (§5) and the upper end of the slope sensitivity (§7.5). Empirical lead time of VIX ≥25 sustained 5 days is 28-157 days in 3 of 8 historical drawdown episodes (§12.1). Dashboard-conditioned updating is operationalized via Brier Skill Score against trigger-specific climatology (§13.3).

7.7 How the layers fit together

The layers answer different questions and should not be combined into a single number. In particular, the Layer C historical anchor (descriptive history) and the Layer B scenario-generator output (modeled conditional on frozen parameters) are not the same kind of object and should not be averaged or aggregated.

Layer	Question answered	Headline output
Layer C-conditional (CAPE ≥40)	Historical rolling-window coverage rate from current regime	61.9%; n = 1 at episode level; no statistically meaningful CI
Layer C-conditional (CAPE ≥35)	Broader current-regime coverage	36.7%
Layer C unconditional	All-history monthly average	19.9%
Layer A (scenario generator)	Trigger co-occurrence given frozen priors	P(≥2 in 24mo) = 54.6%
Layer B (slope = 6, scenario generator)	Trigger-mediated severity given frozen priors	49.9% P(≥25%, 24mo)
Layer B (slope = 4, scenario generator)	Fast-policy-response assumption	16.7%
Layer B (slope = 8, scenario generator)	GFC-style policy-slow assumption	67.9%

Interpretation. The framework's empirical-historical anchor (Layer C) and its scenario-generator output (Layer B) are independent constructs. Both point to elevated fragility under the current configuration, but neither produces a calibrated probability forecast. The framework's substantive contribution is channel identification — telling the user which pathways are most worth monitoring — and operational early-warning signals via the dashboard.

Layers should not be added; they overlap and they answer different questions. Specifically:

Layer C-unconditional history includes both trigger-driven and non-trigger drawdowns (sentiment, exogenous shocks).
Layer C-conditional is the same universe filtered by starting regime.
Layer B is trigger-mediated only; it misses sentiment-driven corrections (Q4 2018 analog) and exogenous shocks (March 2020 analog).
Layer A is co-occurrence, not severity.

Defensible composite framing:

In the available historical sample, the CAPE ≥40 regime has occurred in one completed episode (January 1999 – September 2000). Across that episode's 21 overlapping monthly starting points, 13 were followed by a peak-to-trough ≥25% drawdown within 24 months — a rolling-window coverage rate of 61.9%. This is descriptive single-episode evidence (n = 1 at the episode level), not a calibrated probability. The scenario-generator layers identify AI capex / oil-Fed / Treasury basis / yen carry as the channels most worth monitoring; Layer B at base slope generates 49.9% P(≥25%, 24mo) conditional on its frozen parameters, with slope-sensitivity range 16.7% to 67.9%. The dashboard (§12) provides selected-sample early-warning signals.

The framework's position is "elevated fragility, channels identified, monitoring discipline warranted" — not "calibrated crash probability."

8. Policy Reaction Functions

Drawdown severity depends materially on policy response. The framework treats policy effectiveness as trigger-specific — Fed liquidity facilities address basis-trade unwinds well but cannot reopen Hormuz, restore a stablecoin issuer's reserves, or repair systemic cyber damage.

8.1 Federal Reserve

Standing Repo Facility (SRF): Established July 28, 2021 by FOMC. Permanent overnight repo facility against Treasury, agency MBS, and agency debt collateral. Counterparties initially primary dealers, later expanded to depository institutions. Minimum bid rate set at or above administered rates. Reduces basis-trade unwind severity if conditions stay funding-driven rather than solvency-driven.
Discount Window and FIMA repo facility: Available to foreign central banks.
Emergency liquidity facilities (Section 13(3)): Available with Treasury approval. Historical precedents: 2008 (Bear Stearns / Maiden Lane I-III; AIG; PDCF; TALF). 2020 (PMCCF, SMCCF, MMLF, TALF 2.0, MLF, Main Street Lending Program). 2010 Dodd-Frank requires "broad-based eligibility" and Treasury Secretary approval.
Constraint: If Warsh is confirmed, the Powell-to-Warsh transition may reduce implicit "Fed put" credibility short-term. Warsh is historically viewed as more hawkish; deviation from precedent in a crisis would itself be destabilizing.

8.2 Treasury

Buyback program: Active since 2024, designed to support off-the-run Treasury liquidity.
Issuance composition management: Can shift duration mix toward T-bills to reduce price-sensitive long-end supply. Constrained by ~$10T 2026 refinancing — limited room for further T-bill shift without distorting front-end markets.

8.3 Bank of Japan

Communication channel: Can signal pause or slowdown in normalization without changing rates. Used effectively after August 2024.
JGB yield-curve operations: Available if 10-year yields rise too rapidly.
Constraint: Three dissenters (Takata, Tamura, Nakagawa) favor faster hikes; hawkish drift constrains BOJ's use of pause-as-signal.

8.4 Strategic petroleum and energy

SPR drawdowns can absorb 1-2 mb/d of supply shortfall short-term. With IEA citing 10.1 mb/d global supply fall, SPR is palliative not solution.
IEA member coordination already partially executed: March 11, 2026 IEA agreed to release 400 million barrels from emergency reserves; Birol stated IEA is "ready to dig deeper."

8.5 Regulatory forbearance and market structure

Capital rule relief for banks under stress (precedent: 2020 SLR adjustment).
Money market fund liquidity rule modifications.
LULD circuit breakers; market-wide circuit breakers at 7%, 13%, 20% S&P declines.
0DTE option position limits under SEC review.

8.6 Central bank coordination

Fed-ECB-BOE-BOJ-SNB swap lines: standing arrangements activatable on hours-not-days timing.
G7/G20 statements: signaling channel.

8.7 Trigger-specific policy effectiveness

The single biggest determinant of severity: which trigger fires governs how effective standard policy tools are. Coarse calibration of per-trigger effectiveness:

Trigger	Primary policy actor	Effectiveness	Constraint
Treasury basis unwind	Fed (SRF + 13(3)) + Treasury	High (if liquidity-only)	Inflation/political constraints on Fed expansion
Iran/Hormuz escalation	IEA / SPR / G7 / military / diplomacy	Low-medium	Physical supply constraints; SPR limits
AI capex disappointment	Fed / Treasury / regulators	Low	Earnings/valuation issue, not liquidity
Yen carry unwind	BOJ / Fed / MOF	Medium	BOJ inflation credibility constraint
AI-enabled cyber	Fed / Treasury / CISA / private sector	Low-medium	Operational recovery, not just liquidity
Private credit gating	Fed / Treasury / SEC / sponsors	Medium	Solvency vs liquidity; sponsor variation
Stablecoin de-peg	Treasury / Fed / issuers	Medium	Reserve quality and redemption capacity
Taiwan kinetic escalation	Governments / central banks	Low	Real-economy supply shock

Numerical p_eff values per trigger (base case, used in §7.5 structured MC):

Trigger	p_eff (low/base/high)
Treasury basis	0.55 / 0.70 / 0.85
Iran/Hormuz	0.10 / 0.20 / 0.35
AI capex	0.05 / 0.10 / 0.20
Yen carry	0.35 / 0.50 / 0.70
AI cyber	0.15 / 0.25 / 0.40
Private credit	0.35 / 0.50 / 0.65
Stablecoin	0.30 / 0.50 / 0.65
Taiwan	0.05 / 0.10 / 0.20

On the gap between Layer A and Layer B. P(≥2 triggers fire) = 54.6% in Layer A is higher than P(severe drawdown) = 49.9% in Layer B at base slope. Policy offsets are one contributor to that difference. The difference should NOT be interpreted as a direct estimate of "institutional response value," because it also reflects trigger identity (some firings, like minor stablecoin events, transmit little); severity weights (low-severity triggers contribute little to drawdowns); loop activation (only specific combinations engage loops); threshold choice (the ≥25% cutoff is itself a parameter); and stochastic drawdown mapping. Multiple things move at once; the gap is not a clean decomposition.

Policy effectiveness is itself a state variable that can deteriorate. Fed-Treasury coordination strain, BOJ political pressure, fiscal capacity constraints, and transition periods such as Powell-to-Warsh, if confirmed, can all reduce realized policy offset relative to base-case values.

9. Historical Analog Review

This is a qualitative review of how the framework's vocabulary maps to past episodes — not a formal backtest of signal rules. For formal dashboard threshold backtesting, see §12.

Flagging criterion: "Flagged" means that at least one of the framework's named loops or triggers would have been identifiable before the major drawdown using information available at the time. "Flagged: Yes" requires explicit loop activation pre-peak; "Flagged: Partial" requires either loop or trigger identification but with major caveats; "Miss" requires no available framework category.

Episode	Framework loop or trigger	Flagging
1998 LTCM (S&P -19.3%)	Loop E (basis trade)	Partial — loop available, but pre-OFR/FSB documentation thin
2000-02 Dot-com (-49%)	Valuation amplifier + Loop A analog	Partial — valuation extreme identified; specific loop weaker
2008 GFC (-57%)	Loop F + CRE amplifier	Yes — both pre-identified by 2007
Q4 2018 (-19.8%)	No clean trigger; valuation only	Miss — framework's main weakness for sentiment-driven
March 2020 COVID (-34%)	Exogenous, outside framework scope	Miss by design
August 2024 yen carry (-8.5%)	Trigger 4 + Loop D	Yes — trigger and loop both identifiable
2022 inflation/rate cycle (-25.4%)	Loop C analog	Partial — oil-inflation channel, but Fed reaction was the proximate cause

Framework strength: structural-vulnerability identification. Framework weakness: sentiment-driven corrections without clean triggers. Out of scope: exogenous shocks.

This is a stress-testing framework. It will produce more risk signals than realized crashes. The honest framing: if a crash occurs, it will most likely involve one or more of the documented loops; the framework does not assert a crash will occur.

10. Monitoring Window: September 2026 – February 2027

September 2026 – February 2027 is a catalyst-dense monitoring window — more high-information events cluster in this period than in adjacent periods:

Hyperscaler earnings cycles: Q3 2026 (October-November) and Q4 2026 (January-February). Both critical for AI capex justification.
2027 capex guidance: typically released at Q3 or Q4 earnings calls.
BOJ meetings: September 18, October 30, December 18, 2026; further meetings January-March 2027.
US midterm elections: November 3, 2026.
Fed leadership: assuming confirmation, Warsh would be fully bedded in by Q3 2026.
Treasury year-end refunding: completes by December 2026.
Iran war anniversaries: 6-month mark August-September 2026; 1-year mark February 2027.

Heuristic catalyst-density comparison — Sep 2026 – Feb 2027 vs. Mar – Aug 2027:

Catalyst type	Sep 2026 – Feb 2027	Mar 2027 – Aug 2027
Hyperscaler earnings (Big-Four quarters)	8	4
BOJ meetings	4	3
Fed FOMC	4	4
Major Treasury refunding/auctions	6	4
US elections / fiscal milestones	2	0
Iran war milestones	2	1
Significant 2027 capex disclosures	yes	typically not
Approximate total	~25-27	~15-17

This is a heuristic count, not a statistical ranking. Sep-Feb has roughly 1.6x the event density of the adjacent six-month window.

This is not a statistically derived peak in crash probability. The Monte Carlo operates at 6-, 12-, and 24-month horizons without temporal granularity beyond that — the simulation cannot statistically rank the Sep-Feb window against, say, Mar-Aug 2027. The framework asserts no such ranking. The window is valuable because information arrival rate is highest then — conditional probabilities update most through windows like this. The discipline is Bayesian updating of the trigger-probability vector as evidence arrives.

11. Three Conditional Severity Paths (Qualitative Scenario Weights)

Important framing. The scenario weights below are qualitative author-calibrated conditional severity paths given that a material stress episode begins. They are not unconditional 24-month probabilities, are not formally model-derived, and should not be visually mixed with the modeled probabilities in §7. They should not be summed to infer market drawdown odds and are not suitable for portfolio optimization or VaR modeling without external validation.

If a stress episode begins within the 24-month window, the conditional probability that it resolves as a rolling correction vs. a recession-bear vs. a multi-loop systemic event vs. a structural crisis is what §11 describes.

Base case path — Rolling correction (~45-55% conditional path weight, author-calibrated)

S&P 500 retraces 10-20% from current 7,400. Magnificent 7 underperforms broader market by 5-10%. Single-name AI infrastructure casualties (Oracle, possibly a second tier of AI compute companies). Private credit gating spreads to a second tier of BDCs. Volatility elevated to 25-30 range. No multi-loop engagement. Recovery 6-12 months.

Adverse case path — Recession-driven bear (~25-35% conditional path weight, author-calibrated)

Peak-to-trough decline of 25-35%. One major hyperscaler cuts capex by 20%+. Oracle credit downgraded to junk by a major agency. Multiple regional banks need intervention. Industry-wide private credit gating. Magnificent 7 underperforms by 15-20%. Recovery 18-24 months.

Severe case path — Multi-loop systemic event (~10-20% conditional path weight, author-calibrated, conditional also on policy reaction)

Peak-to-trough decline of 40-55%. Triggered by simultaneous activation of 2+ feedback loops — most plausibly Loop C (Iran/oil/Fed) plus Loop A (AI capex/IG supply) or Loop D (yen/Mag 7). Treasury market dislocation requires emergency Fed intervention. Several private credit funds wound down. Recovery 3-5 years.

Tail case path — Structural crisis (~3-7% conditional path weight, author-calibrated)

Decline of 50%+. Loss of Treasury safe-asset status, dollar reserve role, or critical infrastructure. Triggers: Taiwan conflict, Treasury market crisis with fiscal trajectory loss of confidence, systemic AI cyber event. Recovery 5-10 years.

These conditional path weights are author calibration based on historical analog episodes (§9). They are not formally model-derived. They should be read as: given that the framework's identified channels do transmit, this is the rough distribution of severity outcomes.

12. Leading Indicators Dashboard

This is the operational expression of Layer D. The dashboard has been preliminarily backtested against eight selected historical drawdown episodes 1995-2024 using FRED data; it has not been validated across the universe of all S&P 500 drawdowns above pre-specified thresholds. Results below should be read with that caveat.

12.1 Empirical backtest (preliminary)

Tested against: LTCM 1998, Dot-com 2000-02, GFC 2008, Europe 2011, Q4 2018, COVID 2020, 2022 cycle, August 2024.

VIX warning (≥25 sustained 5 trading days) — base case rule:

Episode	S&P drawdown	Pre-peak sustained crossing?	Lead time
LTCM 1998 (S&P -19.3%)	-19.3%	No (single-day touch April 27, 1998 at 26.09 but not 5d sustained)	—
Dot-com 2000-02 (-49%)	-49%	Yes (1999-10-19)	157 days
GFC 2008 (-57%)	-57%	Yes (2007-08-15)	55 days
Europe 2011 (-19%)	-19.4%	No	—
Q4 2018 (-19.8%)	-19.8%	No (sentiment-driven, fast)	—
COVID 2020 (-34%)	-33.9%	No (exogenous)	—
2022 cycle (-25.4%)	-25.4%	Yes (2021-12-06)	28 days
August 2024 (-8.5%)	-8.5%	No (fast-acting)	—

3 of 8 episodes detected pre-peak with 28-157 day lead time. Missed: LTCM 1998 (VIX touched 26 but did not sustain), Europe 2011 (sovereign-debt stress played out below 25-VIX threshold), Q4 2018 (sentiment), COVID 2020 (exogenous), August 2024 (fast-acting).

False positive analysis 1995-2024: 80 sustained-crossing events 1995-2024 (separated by ≥30 days for distinct-event counting); 42 inside episode windows; 38 false positives. Over 30 calendar years, that is ≈1.27 false positives per year.

Threshold and duration sensitivity:

Threshold	Duration	Episodes detected pre-peak	Median lead (d)	FP/year
22.5	3	5/8	57	1.87
22.5	5	4/8	43	1.73
22.5	7	3/8	53	1.57
25.0	3	3/8	57	1.43
25.0	5	3/8	55	1.27
25.0	7	1/8	53	0.90
27.5	3	2/8	42	0.93
27.5	5	0/8	—	0.50
30.0	5	0/8	—	0.27

The sensitivity table makes clear that lowering the threshold (22.5 instead of 25) detects more episodes but at materially higher FP rate. Raising it (27.5 or 30) sacrifices nearly all pre-peak signal. 25.0 sustained 5 days is a defensible compromise but is not magic.

10y Treasury warning (level ≥ 4.75%): detected in 3 of 8 episodes (LTCM, dot-com, GFC) with 107-270 day lead time; 5 missed. The level threshold is too restrictive — it fires only when nominal rates are in a high regime. A 30-day rate-of-change variant would generalize better; flagged for future revision.

WTI ≥ $105 (Brent ~$110 proxy): detected in 1 of 8 episodes (Europe 2011 at 28 days lead); generally a structural threshold not a leading indicator.

HY OAS ≥ 4.5%: the ICE BofA US HY OAS series (FRED BAMLH0A0HYM2) extends back to December 31, 1996, so this can in principle be backtested. The threshold of 4.5% was crossed in 1998 (LTCM), 2000-02 (dot-com), 2008-09 (GFC), 2011 (Europe), 2015-16 (energy), 2020 (COVID), 2022 (cycle). Lead-time sustained-crossing tests are flagged for full validation; preliminary inspection suggests 5 of 8 episodes detected but with longer lead times (90-300 days) and higher FP rate than VIX.

Dashboard validation summary (preliminary):

Indicator	True positives	False positive rate	Recommendation
VIX ≥25 sustained 5d	3 of 8	~1.27/year	KEEP — best-performing selected-sample indicator
10y Treasury ≥4.75%	3 of 8	~0.3/year	REVISE — restrictive level; use 30d rate change
WTI ≥$105 (Brent proxy)	1 of 8	~0.1/year	KEEP as structural signature
HY OAS ≥4.5%	5 of 8 (preliminary)	TBD	INCLUDE; full validation pending

This is "preliminarily backtested on selected historical episodes," not "empirically validated." A full backtest across all S&P 500 drawdowns ≥10%, ≥15%, ≥20%, ≥25%, and ≥30% in the FRED VIX era (1990-2026) with precision/recall and full lead-time distribution is the next iteration.

12.2 Tier 1 — Daily / weekly indicators

Indicator	Source	Current (May 8-11, 2026)	Warning	Action	Cadence
MOVE Index	Cboe / ICE BofAML	~67	>130 sustained 5d	>150	Daily
VIX	Cboe / FRED	17.19 (May 8)	≥25 sustained 5d	≥35	Daily
VIX9D / VIX30	Cboe	normal	inverted >3d	inverted >7d	Daily
Brent crude	ICE	$101.29 (May 8)	>$110	>$125	Daily
HY OAS	FRED `BAMLH0A0HYM2`	2.81% (May 8, 2026)	≥4.5%	≥5.5%	Daily
JPY/USD 3m basis	Bloomberg / OFR	tight	widening >20bp	>40bp	Weekly
10y Treasury	FRED `DGS10`	4.38% (May 8, 2026)	>4.75% or +50bp/30d	>5.0%	Daily

12.3 Tier 2 — Weekly / monthly positioning and structural

Indicator	Source	Action threshold	Cadence
Hyperscaler FCF/capex	Company filings	2 quarters of widening gap	Quarterly
BDC NAV discount	NAV reports	>10% sustained	Weekly
JGB 10y	Bloomberg	>2.5%	Daily
Treasury auction tail	Treasury Direct	2 consecutive sub-2.4x bid-cover	Per auction
Oracle CDS	Markit	>150 bp	Weekly
Brent curve backwardation	ICE	Steepening trend	Weekly

12.4 Tier 3 — Quarterly structural

Indicator	Source	Action threshold
Hyperscaler revenue/capex	Company filings	Below 1.0 sustained
Bank stress test	Fed	Regional CRE failures
Moody's BDC outlook	Moody's	Continued Negative
Private credit defaults	Fitch / Moody's	>12%
Coordinated central bank pivot	Statements	Joint dovish action

12.5 Dashboard interpretation — threshold cluster to scenario mapping

Cluster pattern	Scenario interpretation	Indicated drawdown range
0 tier-1 warnings, 0 tier-2 warnings	Base case stable	≤10% rolling
1 tier-1 warning	Single-vector elevated risk	10-15% near-term plausible
2+ tier-1 warnings OR 1 tier-1 action	Multi-vector stress emerging	15-25%; tactical hedging
2+ tier-1 actions OR 2+ tier-2 actions	Active stress, multi-loop possible	25-35% Adverse range; defensive review
3+ tier-1 actions + tier-2 confirmations	Severe-case territory	35-55% Severe range; full defensive review
Multi-tier across all categories + CB intervention	Tail-case territory	50%+; capital preservation review

12.6 Update protocol

Daily review: Tier 1 indicators against thresholds.
Weekly review: All Tier 1 + Tier 2 plus narrative news.
Monthly review: Full dashboard + framework parameter reassessment.
Quarterly review: Backtest of dashboard signals against realized market behavior; threshold recalibration if signal-noise has shifted.

The dashboard is a live monitoring tool, not a one-time snapshot. Its utility comes from disciplined application over time.

13. Falsification and Bull-Case Upgrade Criteria

Reframed as Bayesian posterior updating: framework parameters revise as evidence accumulates, with explicit decision thresholds. Brier Skill Score against trigger-specific climatology serves as the calibration discipline.

13.1 Trigger marginal probability updating

Each trigger marginal probability is reviewed quarterly. If observed conditions move materially away from the anchor (§5), the marginal is revised.

AI capex risk downgraded if:

Q4 2026 hyperscaler 2027 capex guidance materially above $725B with documented revenue backing; AND
OpenAI revenue growth at or above 50% annualized through 2026 with verified enterprise conversion; AND
Oracle credit rating maintained at Baa2 or above through 2027; AND
Capex-to-FCF ratios stabilizing at Big-Four hyperscalers.

Treasury / repo basis trade risk downgraded if:

SEC central clearing transition (Dec 31, 2026 for cash; Jun 30, 2027 for repo) completes without disruption; AND
2-year period without Treasury auction tail or repo funding spike.

Yen carry risk downgraded if:

BOJ pause or reversal of normalization through 2026-2027; AND
USD/JPY stable in 145-160 range without 5%+ moves; AND
Japan-US rate differential stable.

Iran/oil amplifier downgraded if:

Hormuz reopening with traffic returning to >70% of pre-conflict levels; AND
Brent sustained below $75 for >3 months; AND
Stated Iran-US ceasefire holding for >6 months.

13.2 Framework-wide downgrade conditions

Observed (24-month rolling)	Framework adjustment
0 triggers fire, all amplifiers stable	Marginals reduced 20-30% across the board
0 triggers fire, amplifiers materially soften (CAPE <30, Buffett <180, VIX <14 sustained)	Marginals reduced 40-50%; correlation reduced
1-2 triggers fire but resolve without contagion	Modal case; no substantial framework change
2+ triggers fire with multi-loop engagement	Framework supported for stress-period channel identification; calibration sharpens

The Layer C regime-conditional finding (§7.3) is itself empirical and does not "falsify" in the model-output sense — but the realization in this cycle either confirms the regime base rate or supplies new evidence to revise it.

13.3 Calibration scoring: Brier Skill Score

Brier Skill Score against trigger-specific climatology:

BSS_trigger = 1 − BS_model / BS_climatology

BSS > 0: framework improves on the base-rate forecast for that trigger.
BSS = 0: framework is equivalent to always predicting the base rate.
BSS < 0: framework is worse than the base rate and should be recalibrated.

Computed per trigger, not as composite — trigger base rates differ by orders of magnitude.

Forecast-horizon scoring rule. The framework forecasts 12-month event probabilities. Quarterly score evaluations convert to quarterly hazards before scoring: p_quarter = 1 − (1 − p_12m)^(3/12).

Operational scoring table. Because each trigger has a different natural base rate and no common event universe, scoring is trigger-specific. The initial climatology proxy below is the frozen 12-month base marginal from §5; quarterly scoring converts it to a quarterly hazard using the formula above. Dashboard updates should move the live posterior only within each trigger's low/base/high range unless the forecast log records a structural recalibration.

Trigger	12m climatology proxy / initial prior	Low-base-high range	Dashboard update rule for live posterior	Scoring horizon
AI capex disappointment	25%	15 / 25 / 40%	Move toward high if hyperscaler FCF/capex gap widens for 2 quarters or Oracle CDS/rating stress worsens; move toward low if §13.1 downgrade criteria hold	Quarterly hazard from 12m prior
Iran/Hormuz escalation	30%	20 / 30 / 45%	Move toward high if Brent >$110 or further kinetic escalation occurs; move toward low if Hormuz reopening and Brent/ceasefire criteria hold	Quarterly hazard from 12m prior
Treasury basis unwind	12%	8 / 12 / 20%	Move toward high if MOVE >130 sustained, auction tails cluster, or repo funding spikes; move toward low after clearing transition proceeds without funding stress	Quarterly hazard from 12m prior
Yen carry unwind	18%	10 / 18 / 30%	Move toward high if JPY basis widens >20 bp, BOJ hawkishness rises, or USD/JPY moves abruptly; move toward low if BOJ pause/stability criteria hold	Quarterly hazard from 12m prior
AI-enabled cyber incident	10%	5 / 10 / 20%	Move toward high after material financial-sector cyber disclosure or shared-infrastructure incident; move toward low after four clean quarters without financial-sector severity	Quarterly hazard from 12m prior
Private-credit gating	15%	8 / 15 / 25%	Move toward high if BDC discounts widen, redemption queues rise, or defaults >12%; move toward low if redemptions normalize and defaults decline	Quarterly hazard from 12m prior
Stablecoin de-peg with contagion	5%	2 / 5 / 12%	Move toward high if reserve quality, redemption, or bill-market stress appears; move toward low after stable reserve disclosures and no de-peg stress	Quarterly hazard from 12m prior
Taiwan kinetic escalation	9%	4 / 9 / 15%	Move toward high after direct military incident or materially higher clash-market pricing; move toward low if gray-zone activity de-escalates and ODNI-style assessment remains benign	Quarterly hazard from 12m prior

Rare-event guardrails. For triggers with annual base rates below ~10% (Taiwan, stablecoin, AI cyber), four-quarter samples have very low statistical power. BSS is read as a diagnostic rather than an automatic recalibration trigger for these. Rolling 3-year and 5-year calibration windows are weighted more heavily.

Decision rules:

For common-base-rate triggers (annual base rate >15%): if BSS < 0 over 4 consecutive quarters, revise marginal toward base rate.
For rare-event triggers: BSS is diagnostic only; do not over-react to no-event quarters.
If BSS < 0 sustained over 4 quarters for ≥3 common-base-rate triggers, framework requires structural recalibration.

Log score as complementary diagnostic, computed as −Σ y_i log(p_i) − (1 − y_i) log(1 − p_i). Log scores penalize highly confident wrong predictions more than Brier scores; informative when the framework moves marginal probabilities far from base rate.

13.4 Auditable forecast log

The framework maintains a quarterly forecast log to make Bayesian updating auditable:

Date	Trigger	Prior probability	Evidence update	New probability	Reason
2026-05-12	(baseline as published)	—	—	see §5	Initial publication
2026-08 (Q3 review)	—	—	—	—	First scheduled review

This table is the framework's single source of truth for revisions and should be appended (not edited) at each review.

13.5 Out-of-scope events

The framework cannot anticipate exogenous shocks (March 2020 analogs) or sentiment-driven corrections without clean trigger mapping (Q4 2018). Realized outcomes in these categories should not be used to falsify or validate the framework; they are outside its scope. The dashboard backtest in §12.1 confirms this: VIX missed Q4 2018 (sentiment) and COVID (exogenous) by design.

13.6 Bull-case upgrade criteria

Symmetric with §13.1-13.2. The following observations should upgrade the bull case (subjective adjustment of approximately 20-30% reduction in tail-loss probability mass):

Strong bull case upgraded if:

Q2-Q4 2026 earnings beat rates remain ≥80% with positive guidance revisions; AND
AI capex / FCF ratios stabilize across Big-Four hyperscalers; AND
Iran-Hormuz partial reopening with sustained Brent < $85; AND
BOJ communicates pause / slowed normalization; AND
HY CDX, MOVE, VIX, and JPY basis swaps all return to/stay in normal ranges.

If 4 of 5 conditions hold, shift base-case scenario weight from ~50% to ~65-70%, reduce severe-case weight from ~15% to ~6-8%, and lengthen the monitoring cadence from daily-tier-1 to weekly-tier-1.

This makes the framework symmetric — capable of being shifted by positive evidence as well as negative — rather than only falsifiable.

14. Conclusion

The framework's central empirical claim, restated with appropriate uncertainty:

In the available monthly series (Robert Shiller through September 2023, extended through May 2026 with Multpl/GuruFocus readings), only two distinct episodes have reached CAPE ≥40: January 1999 – September 2000 (peak 44.19 in December 1999) and March 2026 – present. Across the 21 monthly observations of the 1999-2000 episode, 13 were followed by ≥25% peak-to-trough drawdowns within 24 months — a rolling-window coverage rate of 61.9%. Because these 21 monthly observations are highly overlapping within a single completed historical episode (n = 1 at the episode level), this is a descriptive historical stress signal, not a calibrated independent-event probability or confidence-bounded forecast. Current CAPE is in the 40.3-42.2 range (early May 2026, provider-dependent).

What the framework claims:

Layer C-conditional history (the strongest empirical anchor): In the available sample, the only completed CAPE ≥40 regime (January 1999 – September 2000) had a 61.9% rolling-window coverage rate of ≥25% drawdowns within 24 months. n = 1 at the episode level. No statistically meaningful confidence interval is reported, for the reasons documented in §7.3 and Appendix K.
Layer B scenario generator at base slope: Conditional on the frozen parameter set, the scenario-generator output is P(trigger-mediated ≥25% drawdown, 24 months) = 49.9% at slope = 6. This is a scenario output, not a forecast. Slope sensitivity 16.7% (slope=4) to 67.9% (slope=8). The framework's value here is channel identification — AI capex / oil-Fed / Treasury basis / yen carry are the channels most worth monitoring.
Layer A scenario generator: Conditional on the frozen parameter set, P(≥2 modeled triggers in 24 months) = 54.6%. The factor-conditional uplift captures cluster behavior: iran_hormuz firing raises basis_trade probability 1.53×; ai_capex firing raises taiwan probability 1.80×.
Dashboard (selected-sample backtest): VIX warning (≥25 sustained 5+ days) fires before peak in 3 of 8 selected historical drawdown episodes with 28-157 day lead time. False-positive rate ~1.27 per year under the paper's event-counting convention. The rule detects structural drawdowns; it misses sentiment-driven (Q4 2018), exogenous (COVID), and fast-acting (August 2024) drawdowns by design. This is selected-sample backtested, not universally validated.
Prediction-market evidence (Appendix H, May 11-12, 2026 snapshots): AI bubble burst 2026 at 16-28% across market configurations; China-Taiwan military clash before 2027 at 9-12%. Used as low-grade triangulation, not calibration.

What the framework does NOT claim:

The framework does NOT claim a calibrated crash probability. The Layer C 61.9% is descriptive history from one completed episode; the Layer B 49.9% is a scenario output conditional on author-chosen parameters. Neither is an empirically estimated probability of a future market event.

The framework does NOT claim the regime-conditional rate is causal. We cannot separate "high CAPE causes drawdowns" from "the specific episode coinciding with high CAPE." The robust statement is: the only previous time the valuation regime matched today's CAPE ≥40 regime, the subsequent path was a substantial drawdown across nearly every monthly entry point in that episode.

The framework does NOT assert the September 2026 – February 2027 window is the highest-probability crash period within the 24-month horizon. It is catalyst-dense in heuristic terms — earnings, BOJ meetings, midterms, refunding, war anniversaries — but the simulation has no sub-annual temporal resolution and the framework asserts no statistical ranking.

The framework does NOT provide a basis for directional positioning advice. Author priors on trigger marginals (§5) are anchored to evidence but not estimated from history; factor loadings, severity weights, and the slope parameter are scenario assumptions, not estimated coefficients.

What the framework supports as risk-management practice:

Stress-testing portfolios under adverse and severe drawdown scenarios — qualitatively, 25-35% (Adverse path) and 40-55% (Severe path), with conditional-path weights documented in §11. The Layer C historical anchor and the Layer B scenario-generator output both point to elevated fragility; both have material uncertainty; and the right use is stress-testing, not directional positioning.
A risk manager using this framework would test whether liquidity reserves remain adequate under 30-40% equity drawdown scenarios and under private-credit gating assumptions.
Concentration risk warrants explicit stress-testing. With official SSGA SPY top-10 weight at 39.28% and Mag 7 share-class aggregate at 34.69%, cap-weighted vs. equal-weighted broad exposure, jurisdictional diversification, and US-centric concentration are appropriate subjects of stress-test attention.
A live monitoring dashboard (§12, selected-sample backtested) is the operational expression. VIX, 10-year Treasury, HY OAS, and Brent are the FRED-accessible Tier 1 indicators with selected-sample backtest support. Augment with MOVE, JPY basis, Oracle CDS, and BDC NAV discounts where commercial data is available.
Trigger marginal priors are updated quarterly (§13). Real-time prediction-market evidence (Appendix H) is used as triangulation, not calibration. The forecast log (§13.4) makes the framework's revisions auditable.

The paper's contribution is a systemic-risk monitoring framework for high-valuation markets, anchored by descriptive regime-conditional history and operationalized via a selected-sample-backtested dashboard. The framework identifies elevated fragility under high-valuation conditions and maps plausible catalyst pathways. The CAPE result is a high-signal historical screen, not a calibrated independent-event forecast. The scenario-generator layers are useful for channel identification and stress-testing, but should not be read as quantitative probability estimates.

The honest summary: the only previous time markets started from CAPE ≥40, the subsequent path was a substantial drawdown across nearly every monthly entry point in that episode (n = 1 at the episode level); the eight-trigger framework identifies the most likely transmission pathways for today's configuration; the dashboard provides selected-sample-backtested early-warning signals. This is not investment advice. The author is not a registered investment adviser. Readers should consult appropriate fiduciaries for portfolio decisions.

Appendix A: AI Capex Reconciliation

The $650B / $725B / $800B AI capex range used in §4.1. Source-type column distinguishes filing-backed vs. management-guided vs. analyst-estimated vs. author-constructed numbers.

Company	2025 actual (AI/DC, $B)	2026 guidance/estimate (AI/DC, $B)	Total capex 2026 (all-in, $B)	Source type
Microsoft	~80	$120B+ FY26 (company guidance); ~$190B C2026 (analyst)	~$210-220	Company guidance + analyst extension
Alphabet	~75	$180-190B (raised from $175-185B, Q1 call)	~$200-210	Earnings-call guidance
Amazon	~83	~$200B (Feb 2026 guidance)	~$220-240	Earnings-call guidance
Meta	~72	$125-145B (raised from $115-135B)	~$135-150	Earnings-call guidance
Big-Four subtotal	~310	~$650-725B	~$765-820	Aggregated
Oracle	~25	~$50B+ (Futurum/Introl)	~$50-60	Company commentary + analyst
CoreWeave	~20	~$30B	~$30-35	IPO disclosures + analyst
Chinese hyperscalers (Alibaba, Tencent, ByteDance)	~40	~$60-80B	~$80-100	Analyst (Bloomberg / Reuters / 21st Century Business Herald); not filing-backed
Global subtotal	~395	~$820B	~$925-1015	Aggregated

Definitions:

AI/DC capex = portion of total capex attributable to AI infrastructure (servers, GPUs, AI-dedicated data centers, networking, power).
Total capex 2026 = full company capex including non-AI components.

Source-type categories:

Company filing: SEC filings (10-K, 10-Q, 8-K).
Earnings-call guidance: stated by management on quarterly calls; management-attributable but not formally binding.
Analyst estimate: published by sell-side or buy-side analysts (Bridgewater, CreditSights, Tom's Hardware, Yahoo).
Constructed estimate: built from cross-source triangulation; not from a single primary document. Treat as judgmental order-of-magnitude.

Reconciliation of headlines:

Headline	Status
$650B (Bridgewater, Feb 23, 2026)	Directly sourced analyst estimate — four Big Tech firms
$725B (Big-Four consensus, post-Q1 2026)	Constructed/consensus from post-earnings guides
$750B (CreditSights, April 2026)	Constructed/analyst — five hyperscalers including Oracle
$800B (global)	Order-of-magnitude estimate including Oracle, CoreWeave, Chinese hyperscalers

The single most directly sourced number is Bridgewater's $650B (Big-Four scope, February 2026); higher figures reflect post-Q1 2026 upward revisions and broader scope inclusion.

Appendix B: Trigger Marginal Probability Evidence

Each marginal probability in §5 with timestamp, source, conversion method, and reproducibility flag. The base probabilities listed are author priors informed by these anchors; they are not mechanically derived from the anchors.

B.1 AI capex disappointment (base = 25%)

Anchor	Source	Date	Implied probability	Reproducibility
NVDA option left-tail skew	CBOE NVDA puts 12mo ATM-20%	May 8, 2026	P(-20%+ move) ≈ 18-22%	Proprietary (Bloomberg surface)
AI bubble Polymarket	"AI bubble burst in 2026?"	May 11-12, 2026	16-28% across configurations	Public, direct; multi-outcome market
Sector re-rating historical	GS sector rotation, 1980-2020	n.a.	30%+ re-rating from peak in 24mo ≈ 35-45%	Indirect; paywalled GS
Burry Substack positions	Cassandra Unchained	May 10, 2026	Notional $1.1B short via NVDA + PLTR puts (qualitative signal)	Public, direct

B.2 Iran/Hormuz escalation (base = 30%)

Anchor	Source	Date	Implied probability	Reproducibility
Polymarket peace by May 31	"US-Iran peace deal by May 31, 2026?"	May 11-12, 2026	13-28% across configurations	Public, direct
Polymarket Hormuz reopen	"Strait of Hormuz traffic returns to normal by May 15?"	May 11-12, 2026	~0.7%	Public, direct
Polymarket Iran uranium	"Iran agrees to end uranium enrichment by June 30?"	May 11-12, 2026	25-28%	Public, direct
Brent $120 call skew	ICE Brent 6mo $120 strike	April 2026	P(≥$120 6mo) ≈ 18-28%	Proprietary (ICE)
IEA "largest disruption in history"	IEA OMR April 14, 2026	April 14, 2026	qualitative	Public, direct

B.3 Treasury basis trade unwind (base = 12%)

Anchor	Source	Date	Implied	Reproducibility
OFR Brief 26-01	OFR	March 3, 2026	75% non-cleared hedge-fund Treasury repo	Public, direct
FSB repo report	FSB	February 4, 2026	~$16T outstanding; ~70% zero-haircut non-cleared	Public, direct
SRF-era historical	Fed record	2021-2026	0 realized	Public, historical
Pre-SRF historical	LTCM, GFC, March 2020	1998-2020	~1 per 5-10 years	Public, historical
MOVE index	Cboe MOVE	May 8, 2026	67-72 (well below stress threshold)	Public, direct

B.4 Yen carry unwind (base = 18%)

Anchor	Source	Date	Implied	Reproducibility
USD/JPY 1y implied vol	CME / Bloomberg	May 8, 2026	P(10%+ JPY appreciation 12mo) ≈ 12-18% (lognormal flat-vol approximation, NOT risk-neutral)	Proprietary; illustrative
August 2024 precedent	Public market data	August 5, 2024	~5% position unwind → 12.4% Nikkei single-day drop	Public, historical
BOJ April 28 dissent	BOJ statement	April 28, 2026	3 of 9 board members vote for 1.0%	Public, direct

B.5 AI-enabled cyber incident (base = 10%)

Anchor	Source	Date	Implied	Reproducibility
SEC Item 1.05 cumulative count	Debevoise tracker	April 10, 2026	~44 filings since Dec 2023; ~6 financial-sector	Public, direct
IMF May 2026 blog	Adrian/Gaidosch/Ravikumar	May 7, 2026	Qualitative: above historical baseline	Public, direct
Anthropic Project Glasswing	anthropic.com	April 7, 2026	Capability claim (self-reported "thousands of zero-days"), not probability	Public, direct

B.6 Private credit fund forced gating (base = 15%)

Anchor	Source	Date	Implied	Reproducibility
Fitch default rate	Reuters / Fitch	March 6, 2026	9.2% (2025) — record in Fitch PMR series	Public, direct
FSB private credit report	FSB	May 6, 2026	$1.5-2T market; ~$220B bank exposure	Public, direct
Historical gating events	BREIT 2022-23, Bear/Lehman 2007-08	2007-2023	~1 per 3-5 years in stress	Public, historical
BCRED Q1 2026	BCRED filings + CNBC + Bloomberg + AltsWire	Q1 2026	$3.8B (7.9% NAV); honored with cap raise + $400M injection	Public, direct
BlackRock HPS Q1 2026	Bloomberg	March 11, 2026	$1.2B (9.3% NAV) requested; $620M paid	Public, direct
Cliffwater Q1 2026	Investment Executive	March 2026	14% of NAV; capped	Public, direct
Moody's BDC outlook	Moody's (Reuters)	April 7, 2026	Cut to Negative	Public, direct

B.7 Stablecoin de-peg with contagion (base = 5%)

Anchor	Source	Date	Implied	Reproducibility
UST/Luna 2022	Public on-chain + market	May 2022	Full de-peg; contagion contained to crypto	Public
USDC March 2023	Coinbase / Circle	March 2023	Brief de-peg, recovered within 48h	Public
Tether Q1 2026 attestation	BDO Italia	end-March 2026	$117B direct T-bills; $141B incl. indirect	Public, direct
Stablecoin market cap	DefiLlama	May 10, 2026	$322.7B total; $189.6B USDT	Public, direct

B.8 Taiwan kinetic escalation (base = 9%)

Anchor	Source	Date	Implied	Reproducibility
Polymarket China invade June 2026	Polymarket	May 11-12, 2026	~0-1.8%	Public, direct
Polymarket China invade end-2026	Polymarket	May 11-12, 2026	~7%	Public, direct
Polymarket China-Taiwan military clash before 2027	Polymarket	May 11-12, 2026	~9-12%	Public, direct
Polymarket China invade end-2027	Polymarket	May 11-12, 2026	~17-18%	Public, direct
Bloomberg Economics scenario	Bloomberg	February 10, 2026	$10.6T cost / 9.6% global GDP (magnitude, not probability)	Public, direct
ODNI Annual Threat Assessment	ODNI	mid-March 2026	"PLA likely lacks plans for 2027 offensive or sooner"	Public, direct

B.9 Reproducibility summary

Public, direct, reproducible: 13 anchors (Polymarket, FRED, SEC EDGAR/Debevoise, BIS, OFR, FSB, MBA, IEA, BEA, BLS, anthropic.com, tether.io, BOJ).
Public, historical/indirect: 9 anchors (historical events, MBA maturity wall, OFR/FSB reports, ODNI ATA).
Proprietary / illustrative: 5 anchors (Bloomberg/ICE/CME option surfaces).

The base-case probabilities sit inside the cluster of empirical anchors for every trigger. They are not formally derived from them — they remain author priors informed by external evidence.

Appendix C: Yen Carry Exposure Disaggregation

"Yen carry" is not a single trade. Approximate gross exposures (order-of-magnitude only — see note below):

Investor type	Estimated gross (USD)	Funding	Likely liquidation asset
Leveraged macro funds	$100-200B	JPY-denominated repo + JPY-funded swap	US Treasuries, Mag 7
Retail FX margin (Japan)	$80-150B	JPY-margin against USD/JPY long-USD	Direct USD/JPY closeout
FX-hedged Treasury portfolios (Japanese institutional)	$250-400B (gross hedged)	JPY-denominated hedging against USD bonds	Treasuries (forced sale if hedge breaks)
Offshore dollar-funding	$50-100B	Cross-currency basis swaps	USD funding stress, basis blowout
Equity-funded through yen	$40-80B	JPY-denominated equity finance	Asian equities, Mag 7
Total approximate gross	$520-930B	Mixed	Multi-asset

The August 2024 tremor primarily affected leveraged macro fund + retail FX margin (~$200-300B subtotal), liquidating an estimated 5% of these positions. A material (>15%) unwind across all segments would imply roughly $80-150B of forced liquidation, with heaviest pressure on Treasuries (from FX-hedged portfolios) and Mag 7 (from leveraged macro and equity-funded carry).

Important note: gross exposure is not equal to forced-sale amount. Many positions can be unwound gradually; only a fraction would be forced-sold in a stress event. The August 2024 precedent suggests ~5% of relevant positions liquidating produced the -12.4% Nikkei single-day move.

Sources (order of magnitude only): Morgan Stanley, Goldman Sachs FX, Bank of Japan FSR, BIS quarterly review.

Appendix D: Code — Layer A Factor-Copula Monte Carlo

"""
Layer A: Factor-copula Monte Carlo for trigger co-occurrence.
Frozen parameter file, May 12, 2026.

Four-factor latent correlation structure:
  F1: Risk-off / global risk aversion    (affects all triggers)
  F2: Inflation / Fed                     (affects oil-driven and credit triggers)
  F3: Liquidity / funding                 (affects Treasury, repo, carry, stablecoin)
  F4: Tech / AI                           (affects AI-specific and Taiwan-via-chips)

Reproducibility: seed = 42, 200,000 trials.
Package versions: numpy 1.26+, scipy 1.12+.
"""

import numpy as np
from scipy.stats import norm

TRIGGER_NAMES = [
    "ai_capex", "iran_hormuz", "basis_trade", "yen_carry",
    "cyber", "private_credit", "stablecoin", "taiwan",
]

# 12-month marginal probabilities (frozen May 12, 2026)
P_BASE = np.array([0.25, 0.30, 0.12, 0.18, 0.10, 0.15, 0.05, 0.09])

LOADINGS = np.array([
    #  RiskOff  Inflation  Liquidity  TechAI
    [   0.40,    0.10,      0.30,      0.65],   # 0 ai_capex
    [   0.40,    0.70,      0.20,      0.10],   # 1 iran_hormuz
    [   0.45,    0.25,      0.70,      0.05],   # 2 basis_trade
    [   0.40,    0.30,      0.55,      0.30],   # 3 yen_carry
    [   0.30,    0.05,      0.20,      0.20],   # 4 cyber
    [   0.40,    0.25,      0.55,      0.20],   # 5 private_credit
    [   0.30,    0.10,      0.50,      0.15],   # 6 stablecoin
    [   0.50,    0.20,      0.20,      0.45],   # 7 taiwan
])


def scale_prob(p_12, horizon_months):
    return 1.0 - (1.0 - p_12) ** (horizon_months / 12.0)


def simulate_factor_copula(probs_12m, loadings, horizon_months, n_sims, rng):
    n_triggers, n_factors = loadings.shape
    probs = np.array([scale_prob(p, horizon_months) for p in probs_12m])
    sumsq = (loadings ** 2).sum(axis=1)
    eps_std = np.sqrt(np.maximum(1 - sumsq, 0))
    F = rng.standard_normal((n_sims, n_factors))
    eps = rng.standard_normal((n_sims, n_triggers))
    X = F @ loadings.T + eps_std[None, :] * eps
    U = norm.cdf(X)
    return (U < probs).astype(int)


if __name__ == "__main__":
    rng = np.random.default_rng(42)
    fired = simulate_factor_copula(P_BASE, LOADINGS, 24, 200_000, rng)
    n_fired = fired.sum(axis=1)
    print(f"P(>=1)={(n_fired>=1).mean():.4f}")  # 0.7665
    print(f"P(>=2)={(n_fired>=2).mean():.4f}")  # 0.5456
    print(f"P(>=3)={(n_fired>=3).mean():.4f}")  # 0.3831
    print(f"E[N]={n_fired.mean():.4f}")          # 2.2407

Verified output (frozen file May 12, 2026, seed=42, 200K trials):

Metric	Value
P(≥1)	0.7665
P(≥2)	0.5456
P(≥3)	0.3831
P(≥4)	0.2574
P(≥5)	0.1600
E[N]	2.2407

Average implied binary pairwise correlation: 0.25. Range 0.10 to 0.42.

Appendix E: Code — Layer B Second-Stage Severity Monte Carlo

"""
Layer B: Second-stage severity Monte Carlo for trigger-mediated drawdown.
Frozen parameter file, May 12, 2026.

Takes Layer A trigger vectors and maps through:
  1. Identity-weighted standalone severity
  2. Liquidity coupling
  3. Loop activation (additive boost, capped at 1.5)
  4. Trigger-specific stochastic policy effectiveness
  5. Severity-to-drawdown mapping (slope is calibration parameter)

Reproducibility: seed = 42, 200,000 trials.
"""

import numpy as np

# Identity weights (1-5 scale)
STANDALONE_SEVERITY = np.array([3, 4, 4, 3, 4, 3, 2, 5])
LIQUIDITY_COUPLING  = np.array([3, 4, 5, 4, 4, 3, 3, 5])

# Trigger-specific policy effectiveness (base case)
POLICY_EFF_BASE  = np.array([0.10, 0.20, 0.70, 0.50, 0.25, 0.50, 0.50, 0.10])
POLICY_EFF_STD   = np.array([0.07, 0.12, 0.15, 0.18, 0.12, 0.15, 0.18, 0.07])

# Loop activation rules
LOOPS = {
    "A": {"triggers": [0],    "boost": 0.30},  # AI capex
    "B": {"triggers": [0, 5], "boost": 0.40},  # AI capex + private credit
    "C": {"triggers": [1],    "boost": 0.30},  # Iran/oil/Fed
    "D": {"triggers": [3],    "boost": 0.30},  # Yen carry
    "E": {"triggers": [2],    "boost": 0.50},  # Basis trade
    "F": {"triggers": [5],    "boost": 0.20},  # Private credit
}


def compute_severity_score(fired):
    n_sims = fired.shape[0]
    severity_matrix = fired * STANDALONE_SEVERITY
    primary = severity_matrix.max(axis=1).astype(float)
    n_fired = fired.sum(axis=1)
    additional = 0.5 * np.maximum(n_fired - 1, 0)
    coupling_sum = (fired * LIQUIDITY_COUPLING).sum(axis=1).astype(float)
    n_fired_safe = np.where(n_fired == 0, 1, n_fired)
    avg_coupling = coupling_sum / n_fired_safe
    coupling_term = 0.3 * avg_coupling * (n_fired > 0).astype(float)
    loop_boost = np.zeros(n_sims)
    for ld in LOOPS.values():
        active = np.all(fired[:, ld["triggers"]] == 1, axis=1)
        loop_boost += np.where(active, ld["boost"], 0)
    loop_boost = np.minimum(loop_boost, 1.5)
    return primary + additional + coupling_term + loop_boost


def sample_policy_offset(fired, rng):
    n_sims = fired.shape[0]
    eff = rng.normal(loc=POLICY_EFF_BASE, scale=POLICY_EFF_STD, size=(n_sims, 8))
    eff = np.clip(eff, 0.0, 0.95)
    w = fired * STANDALONE_SEVERITY
    wsum = w.sum(axis=1)
    wsum_safe = np.where(wsum == 0, 1, wsum)
    return np.where(wsum == 0, 0.0, (eff * w).sum(axis=1) / wsum_safe)


def severity_to_drawdown(net_severity, rng, slope=6.0):
    expected = np.minimum(slope * net_severity, 80.0)
    std = np.minimum(2.5 + 0.5 * net_severity, 9.0)
    realized = rng.normal(expected, std)
    return np.clip(realized, 0.0, 90.0)


def run(fired, slope=6.0, seed=42):
    rng = np.random.default_rng(seed)
    gross = compute_severity_score(fired)
    offset = sample_policy_offset(fired, rng)
    net = gross * (1.0 - offset)
    return severity_to_drawdown(net, rng, slope=slope)

Verified output (frozen file May 12, 2026, slope=6.0, seed=42):

Threshold	P(drawdown ≥ threshold over 24mo)
≥10%	0.7486
≥15%	0.7163
≥20%	0.6393
≥25%	0.4992
≥30%	0.3380
≥40%	0.1040
≥50%	0.0147

Slope sensitivity: P(≥25%) is 0.1669 (slope=4), 0.3399 (slope=5), 0.4992 (slope=6), 0.6136 (slope=7), 0.6790 (slope=8).

Trigger-count distribution sanity check: P(0) = 0.2335 = 1 − P(≥1) = 1 − 0.7665 ✓

Appendix F: Code — Layer C Historical Coverage Frequency (Monthly Shiller Data)

"""
Layer C: Monthly rolling-window drawdown coverage frequency.
Uses monthly Shiller CAPE (not annual averages).

Data:
  - Shiller monthly file: http://www.econ.yale.edu/~shiller/data/ie_data.xls
    Columns: Date (YYYY.MM), P (S&P composite monthly price), CAPE (column 12)
    Coverage: 1871-01 through 2023-09 in published Yale file.
  - Forward extension Oct 2023 - May 2026 from Multpl/GuruFocus monthly readings.

Reproducibility: any reader can re-run with file downloaded from Shiller's page.
"""

import pandas as pd
import numpy as np

# Parse Shiller's monthly file (Yale download)
raw = pd.read_excel("ie_data.xls", sheet_name="Data", header=None)
data = raw.iloc[8:].copy()
data = data[data[0].notna()].reset_index(drop=True)

def parse_date(d):
    d = float(d)
    year = int(d)
    month = int(round((d - year) * 100))
    if month == 0:
        month = 10  # 1871.10 may parse as 1871.1
    return pd.Timestamp(year=year, month=month, day=1)

shiller = pd.DataFrame({
    "date":  data[0].apply(parse_date),
    "price": pd.to_numeric(data[1], errors="coerce"),
    "cape":  pd.to_numeric(data[12], errors="coerce"),
}).dropna(subset=["price"])
shiller = shiller[shiller["date"] >= "1881-01-01"].reset_index(drop=True)

# Forward-extend Oct 2023 - May 2026 (monthly S&P close + CAPE from Multpl/GuruFocus)
post_2023 = [
    ("2023-10-01", 4275.20, 29.19), ("2023-11-01", 4467.71, 30.50),
    ("2023-12-01", 4682.85, 31.66), ("2024-01-01", 4769.83, 32.32),
    ("2024-02-01", 4953.86, 33.85), ("2024-03-01", 5170.93, 35.32),
    ("2024-04-01", 5099.20, 34.34), ("2024-05-01", 5187.70, 34.55),
    ("2024-06-01", 5360.79, 35.66), ("2024-07-01", 5538.01, 36.85),
    ("2024-08-01", 5523.18, 36.30), ("2024-09-01", 5648.40, 37.07),
    ("2024-10-01", 5762.48, 37.74), ("2024-11-01", 5917.11, 38.78),
    ("2024-12-01", 6040.53, 38.40), ("2025-01-01", 5994.57, 37.93),
    ("2025-02-01", 6037.42, 37.94), ("2025-03-01", 5705.45, 35.34),
    ("2025-04-01", 5440.42, 33.40), ("2025-05-01", 5811.07, 35.13),
    ("2025-06-01", 6022.05, 36.03), ("2025-07-01", 6244.28, 37.06),
    ("2025-08-01", 6395.78, 37.74), ("2025-09-01", 6519.45, 38.10),
    ("2025-10-01", 6720.31, 38.93), ("2025-11-01", 6850.92, 39.28),
    ("2025-12-01", 6980.43, 39.45), ("2026-01-01", 7050.81, 39.65),
    ("2026-02-01", 7140.21, 39.87), ("2026-03-01", 7195.55, 40.11),
    ("2026-04-01", 7325.92, 40.55), ("2026-05-01", 7398.93, 40.42),
]
post = pd.DataFrame(post_2023, columns=["date", "price", "cape"])
post["date"] = pd.to_datetime(post["date"])
shiller = pd.concat([shiller, post], ignore_index=True).sort_values("date").reset_index(drop=True)


def max_dd(p):
    cm = np.maximum.accumulate(p)
    return ((cm - p) / cm).max()


def regime_coverage(prices, capes, window, thresholds, cape_min=None, cape_max=None):
    """Rolling-window coverage frequency, optionally filtered by CAPE regime."""
    n = len(prices)
    dds, n_w = [], 0
    for i in range(n - window):
        c = capes[i]
        if np.isnan(c):
            continue
        if cape_min is not None and c < cape_min:
            continue
        if cape_max is not None and c >= cape_max:
            continue
        dds.append(max_dd(prices[i:i+window+1]))
        n_w += 1
    dds = np.array(dds)
    return {t: (dds >= t).mean() for t in thresholds}, n_w

Verified output (monthly 1881-2026, 1,745 observations):

Regime	N	P(≥10%)	P(≥25%)	P(≥40%)
All	1,722	63.0%	19.9%	5.2%
Low CAPE (<15)	690	59.7%	14.2%	1.0%
Medium (15-25)	775	65.0%	24.0%	7.1%
High (25-35)	208	59.6%	19.7%	13.0%
Very high (≥35)	49	91.8%	36.7%	2.0%
Extreme (≥40)	21	100.0%	61.9%	4.8%
Very extreme (≥42)	13	100.0%	69.2%	0.0%

Distinct CAPE ≥40 episodes:

Episode	Start	End	Months	Max CAPE	Max CAPE date
Dot-com	1999-01	2000-09	21	44.20	1999-12
Current	2026-03	2026-05+	3 (continuing)	40.55	2026-04

Episode 1 forward 24-month max drawdowns (all 21 monthly observations): range 10.4%-40.9%, mean 26.8%, median 29.7%, 13 of 21 ≥25%, 1 of 21 ≥40%.

Episode 2: May 12, 2026 is the publication date; all 3 starting observations have incomplete 24-month forward windows.

Stationary block bootstrap (mean block = 24 months, B = 2000):

Regime / threshold	Point	5th pct	50th pct	95th pct
CAPE ≥40, P(≥25%)	61.9%	57.7%	97.4%	100.0%
CAPE ≥40, P(≥40%)	4.8%	13.4%	74.4%	100.0%
CAPE ≥38, P(≥25%)	55.6%	63.0%	91.7%	100.0%

Non-overlapping windows (sample every 24 months): CAPE ≥40 has 1 truly independent observation (1/1 hit ≥25%, 1/1 hit ≥40%). CAPE ≥35 has 2 independent observations (1/2 hit ≥25%).

Appendix G: Code — Dashboard Threshold Backtest

"""
Dashboard backtest using FRED VIX data.

Tests VIX>=25 sustained 5 days against pre-specified historical drawdowns.
Threshold sensitivity (22.5, 25, 27.5, 30) and duration sensitivity (3, 5, 7, 10).
"""

import urllib.request, io
import pandas as pd
import numpy as np

def fetch_fred(series_id, start="1990-01-01"):
    url = f"https://fred.stlouisfed.org/graph/fredgraph.csv?id={series_id}"
    with urllib.request.urlopen(url, timeout=30) as r:
        df = pd.read_csv(io.StringIO(r.read().decode()))
    df.columns = ["date", series_id]
    df["date"] = pd.to_datetime(df["date"])
    df[series_id] = pd.to_numeric(df[series_id], errors="coerce")
    df = df.dropna()
    return df[df["date"] >= pd.to_datetime(start)].set_index("date")[series_id]

EPISODES = [
    ("LTCM 1998",            "1998-07-17", "1998-08-31", "1998-04-01"),
    ("Dot-com 2000-2002",    "2000-03-24", "2002-10-09", "1999-10-01"),
    ("GFC 2008",             "2007-10-09", "2009-03-09", "2007-01-01"),
    ("Europe 2011",          "2011-04-29", "2011-10-03", "2011-01-01"),
    ("Q4 2018",              "2018-09-20", "2018-12-24", "2018-07-01"),
    ("COVID 2020",           "2020-02-19", "2020-03-23", "2019-12-01"),
    ("2022 cycle",           "2022-01-03", "2022-10-12", "2021-10-01"),
    ("August 2024",          "2024-07-16", "2024-08-05", "2024-05-01"),
]

def detect_pre_peak(series, threshold, duration, episodes):
    for name, peak_s, _, ctx_s in episodes:
        peak = pd.to_datetime(peak_s)
        ctx = pd.to_datetime(ctx_s)
        window = series[(series.index >= ctx) & (series.index <= peak)]
        if len(window) < duration:
            yield (name, None, None); continue
        above = (window >= threshold).astype(int)
        sus = above.rolling(duration, min_periods=duration).sum() == duration
        if sus.any():
            first = sus[sus].index[0]
            yield (name, first, (peak - first).days)
        else:
            yield (name, None, None)

if __name__ == "__main__":
    vix = fetch_fred("VIXCLS", start="1990-01-01")
    print("VIX>=25 sustained 5 days:")
    for name, first, lead in detect_pre_peak(vix, 25.0, 5, EPISODES):
        if first is None:
            print(f"  {name}: not detected")
        else:
            print(f"  {name}: {first.date()}, {lead}d before peak")

Verified output: Dot-com 2000 (-49% drawdown): sustained ≥25 from 1999-10-19, 157 days before peak. GFC 2008 (-57%): sustained ≥25 from 2007-08-15, 55 days before peak. 2022 cycle (-25.4%): sustained ≥25 from 2021-12-06, 28 days before peak. Other 5 episodes not detected pre-peak.

Full-period false positive count 1995-2024: 80 sustained-crossing events (separated by ≥30 days for distinct counting); 42 inside episode windows; 38 false positives; FP rate ≈1.27/year over 30 years.

Sensitivity table: see §12.1 main body.

Appendix H: Code — Real-Time Polymarket Anchor Fetch

"""
Fetch live Polymarket prices for §5 trigger anchors.
Treats prediction-market prices as triangulation, not calibration.
"""

import urllib.request, json

POLYMARKET_API = "https://gamma-api.polymarket.com/markets"
UA = "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36"
MIN_24H_VOLUME = 1000  # liquidity filter

def fetch_universe(limit=1000):
    universe = []
    for offset in range(0, limit, 100):
        params = (f"closed=false&limit=100&offset={offset}"
                  "&order=volume24hr&ascending=false")
        req = urllib.request.Request(f"{POLYMARKET_API}?{params}",
                                     headers={"User-Agent": UA})
        with urllib.request.urlopen(req, timeout=15) as r:
            batch = json.loads(r.read().decode())
        if not batch: break
        universe.extend(batch)
    return universe

def extract_yes_price(market):
    outcomes = market.get("outcomes"); prices = market.get("outcomePrices")
    if isinstance(outcomes, str):
        outcomes = json.loads(outcomes); prices = json.loads(prices)
    if not outcomes: return None
    for o, p in zip(outcomes, prices):
        if o and o.lower() == "yes":
            return float(p)
    return None

# Usage filter example:
# universe = fetch_universe()
# liquid_ai = [m for m in universe
#              if any(kw in m.get("question", "").lower()
#                     for kw in ["ai bubble", "openai", "nvidia"])
#              and float(m.get("volume24hr") or 0) >= MIN_24H_VOLUME]

Snapshot (May 11-12, 2026):

Trigger	Polymarket question	Implied YES (range across market configs)
ai_capex	"AI bubble burst in 2026?"	16-28%
iran_hormuz	"US-Iran peace deal by May 31"	13-28%
iran_hormuz	"Iran agrees to end uranium enrichment by June 30"	25-28%
iran_hormuz	"Strait of Hormuz traffic returns to normal by May 15"	~0.7%
taiwan	"China-Taiwan military clash before 2027"	9-12%
taiwan	"Will China invade Taiwan by 2026"	7%
taiwan	"Will China invade Taiwan by end of 2027"	17-18%

Settlement-text mapping: Each Polymarket market's resolution criteria are documented on the market page. The "military clash before 2027" market resolves YES if there is any direct military exchange between PRC and ROC forces (not gray-zone activity). The "AI bubble burst" market settlement criterion depends on the specific outcome chosen (December 31, 2026 outcome is the leading one with $3M total volume).

Triggers without liquid prediction markets: basis_trade, yen_carry, cyber, private_credit, stablecoin (anchors fall back to historical base rates).

Caveat: Prediction-market prices are sensitive to liquidity, settlement wording, manipulation risk, and participant composition. Different market URLs with similar names produced materially different probabilities for the US-Iran peace and AI-bubble questions. Treat as triangulation, not calibration.

Appendix I: Source Register and Quality Grading

I.1 Source quality grades

Grade	Source type	Examples
A	Primary official	FRED, SEC, FSB, BIS, IMF, IEA, OFR, OECD, MBA, BEA, BLS, BOJ, Treasury Fiscal Data, company SEC filings
A-/B+	Market data provider	Cboe (VIX), FactSet, ICE (Brent), CME, FRED
B+	Major press with named primary source	Reuters, CNBC, Bloomberg, AP citing primary entities
B	Analyst research	Apollo Academy (Slok), Bridgewater, GS, MS, Bloomberg Economics
C+/B-	Prediction markets	Polymarket (public settlement criteria)
C	Aggregators / interpretive sites	Motley Fool, AltsWire, AhaSignals, industry summaries
(D for reproducibility)	Proprietary surfaces	Bloomberg option surfaces (acknowledged as illustrative)

I.2 Source register

ID	Source	Supports	Grade	Type	Date accessed
S1	FRED (`SP500`, `VIXCLS`, `BAMLH0A0HYM2`)	S&P 500 7,398.93 May 8; VIX 17.19; HY OAS 2.81% May 8 / 2.79% May 7	A	Primary	May 11, 2026
S2	CNBC live blogs	Intraday May 8/10/11; oil prints; cloture vote	B+	Major press / primary	May 11, 2026
S3	TheStreet / Trading Economics	Nasdaq/Russell/DJIA May 8 closes; commodity intraday	B+	Major press	May 11, 2026
S4	IEA Oil Market Report April 2026	10.1 mb/d supply fall; "largest disruption in history"	A	Primary	May 11, 2026
S5	CNBC; Atlantic Council	Birol 12 mb/d (Apr 1) → 13 mb/d (Apr 13) statements	B+	Major press / primary	May 11, 2026
S6	Reuters / CNBC / Roll Call	Powell term ends May 15; Warsh cloture May 11; vote May 12	B+	Major press / primary	May 11, 2026
S7	Bank of Japan, Statement on Monetary Policy April 28, 2026	BOJ held 0.75% in 6-3 vote with three dissenters for 1.0%	A	Primary	May 11, 2026
S8	Reuters / MSN	HSBC raised S&P target to 7,650 on May 11	B+	Major press	May 11, 2026
S8b	Benzinga / CNBC	Yardeni raised target to 8,250 on May 11; Calvasina (RBC) to 7,900 May 8; JPM 7,600 April 21	B+	Major press	May 11, 2026
S9	FactSet Earnings Insight (Butters), May 8, 2026	Q1 2026 84% beat / 18.2% surprise; forward P/E 21.0	A-	Market data	May 11, 2026
S10	Bridgewater (Jensen letter Feb 23, 2026); CreditSights April 2026	$650B four-firm 2026 AI capex; $750B five-firm	B	Analyst	May 11, 2026
S11	SSGA SPY Holdings (official)	Top 10 = 39.28%; Mag 7 share-class aggregate = 34.69% from the rounded holding table (May 8, 2026)	A-	Issuer official	May 11, 2026
S11b	Motley Fool Research (Lyle Daly)	Mag 7 = 33.7% S&P (April 14, 2026 — historical context only)	C	Aggregator	May 11, 2026
S12	BIS Bulletin 120 (Aldasoro/Doerr/Rees), bis.org/publ/bisbull120.htm, published January 7, 2026	AI financing shift to debt	A	Primary	May 11, 2026
S12b	BIS Quarterly Review March 2026 (Aldasoro/Doerr/Rees), bis.org/publ/qtrpdf/r_qt2603u.htm	AI on- and off-balance-sheet borrowing follow-on	A	Primary	May 11, 2026
S13	Reuters	Alphabet euro/yen bond market activity	B+	Major press	May 2026
S14	Apollo Academy / Torsten Slok, March 24, 2026	$14T 2026 IG supply estimate	B	Analyst	May 11, 2026
S15	OECD Global Debt Report 2026 (DOI 10.1787/e9d80efd-en)	Sovereign / corporate borrowing context	A	Primary	May 11, 2026
S16	MBA, February 9, 2026 / April 27, 2026	$875B 2026 maturities; 4.02% Q1 2026 delinquency	A	Primary	May 11, 2026
S17	OFR Brief 26-01 (Hempel/Kahn/Mann/Paddrik, March 3, 2026)	75% non-cleared hedge-fund Treasury repo	A	Primary	May 11, 2026
S18	FSB Vulnerabilities in Government Bond-backed Repo Markets, February 4, 2026	$16T repo; concentration risks	A	Primary	May 11, 2026
S19	SEC Press Release 2025-43 (Feb 25, 2025)	Treasury clearing dates Dec 31, 2026 / Jun 30, 2027	A	Primary	May 11, 2026
S20	Anthropic, April 7, 2026 (anthropic.com/glasswing)	Project Glasswing / Claude Mythos Preview	A	Primary	May 11, 2026
S21	IMF blog Adrian/Gaidosch/Ravikumar, May 7, 2026	AI-enabled cyber financial-stability framing	A	Primary	May 11, 2026
S21b	Debevoise & Plimpton tracker, April 10, 2026	~44 cumulative Item 1.05 8-Ks since Dec 2023; 6 financial-sector	B+	Law firm tracker	May 11, 2026
S22	Reuters / Fitch, March 6, 2026	US private-credit defaults 9.2% in 2025 (record in PMR series)	B+	Major press	May 11, 2026
S23	FSB Report on Vulnerabilities in Private Credit, May 6, 2026	$1.5-2T market; ~$220B bank exposure	A	Primary	May 11, 2026
S24	DefiLlama / Cryptonews May 10, 2026	Stablecoin total $322.7B; USDT $189.6B	C+	Aggregator	May 11, 2026
S25	Tether Q1 2026 attestation (BDO Italia), tether.io	$117B direct T-bills; $141B incl. indirect	B+	Issuer attestation	May 11, 2026
S26	White House / Federal Register, July 18, 2025	GENIUS Act signed	A	Primary	May 11, 2026
S27	Polymarket	Real-time prediction-market anchors	C+	Prediction market	May 11-12, 2026
S28	BEA, April 9, 2026 (third estimate)	Q4 2025 GDP +0.5% annualized	A	Primary	May 11, 2026
S29	BLS, May 8, 2026	April 2026 nonfarm payrolls +115k; unemployment 4.3%	A	Primary	May 11, 2026
S30	Shiller monthly data (Yale; multpl.com; GuruFocus)	Shiller monthly CAPE through September 2023, extended through May 2026	A	Primary plus extension	May 12, 2026
S31	Treasury Fiscal Data; JEC Debt Dashboard	$38.91T federal debt May 5, 2026	A	Primary	May 11, 2026
S32	CNBC / Bloomberg / AltsWire	BCRED Q1 2026: 7.9% NAV ($3.8B); cap 5%→7%; $400M injection	B+	Major press	May 11, 2026
S33	Bloomberg / Reuters / Alternative Credit Investor	BlackRock HPS, Morgan Stanley North Haven, Cliffwater, Blue Owl Q1 2026 stress; Moody's BDC outlook to Negative April 7, 2026	B+	Major press	May 11, 2026
S34	FAU CRE screener (Rebel Cole, Q3 2025 vintage)	59 of 155 largest banks CRE >300% equity	C	Academic screener	May 11, 2026
S35	FDIC FIL-23-2023	300% CRE/capital threshold from 2006 joint guidance	A	Primary	May 11, 2026
S36	Bloomberg Economics February 10, 2026	Taiwan war scenario $10.6T / 9.6% global GDP	B	Major press / analyst	May 11, 2026
S37	Barclays Fixed Income Research (Andrew Keches), November 11, 2025	Oracle debt to Underweight	C	Analyst (secondary press)	May 11, 2026
S38	Moody's Credit Outlook, December 15, 2025 / September 2025	Oracle Baa2, outlook Negative	A	Primary	May 11, 2026
S39	TradingKey / Foreign Policy Journal	Burry Substack May 10, 2026 disclosure ~80% Scion in NVDA + PLTR puts	B+	Major press	May 11, 2026
S40	AP via US News, August 5, 2024	Nikkei -12.40% single-day	B+	Major press	May 11, 2026

Claims that depend on lower-grade sources are flagged in-text where used. Source-type column distinguishes whether the source supports a fact (e.g., FRED for VIX), an estimate (e.g., Bridgewater for capex), or a probability (e.g., Polymarket for prediction-market priors).

Appendix J: Reproducibility Manifest

This manifest documents the data, code, and parameter file used to generate the framework's numerical outputs. Any reader can reproduce the published tables from these inputs.

J.1 Data sources and download dates

Dataset	Source URL	Download date	File / endpoint
Shiller monthly CAPE 1881-2023	http://www.econ.yale.edu/~shiller/data/ie_data.xls	May 12, 2026	`ie_data.xls` sheet "Data"
Shiller monthly CAPE Oct 2023 - May 2026 (extension)	https://www.multpl.com/shiller-pe; https://www.gurufocus.com/economic_indicators/56/sp-500-shiller-cape-ratio	May 12, 2026	Tabular series, monthly closes
S&P 500 monthly close 1881-2023	Same as Shiller `ie_data.xls` column 1	May 12, 2026	—
FRED VIX (`VIXCLS`)	https://fred.stlouisfed.org/series/VIXCLS	May 12, 2026	CSV daily 1990-01-02 to 2026-05-08
FRED HY OAS (`BAMLH0A0HYM2`)	https://fred.stlouisfed.org/series/BAMLH0A0HYM2	May 12, 2026	CSV daily 1996-12-31 to 2026-05

J.2 Frozen parameter file (May 12, 2026)

Parameter	Value
Random seed (numpy default_rng)	42
Monte Carlo trials per run	200,000
Layer A 12-month marginals (P_BASE)	[0.25, 0.30, 0.12, 0.18, 0.10, 0.15, 0.05, 0.09]
Layer A factor loadings (LOADINGS)	8×4 matrix, see Appendix D
Layer B severity weights (STANDALONE_SEVERITY)	[3, 4, 4, 3, 4, 3, 2, 5]
Layer B liquidity coupling	[3, 4, 5, 4, 4, 3, 3, 5]
Layer B policy effectiveness mean (POLICY_EFF_BASE)	[0.10, 0.20, 0.70, 0.50, 0.25, 0.50, 0.50, 0.10]
Layer B policy effectiveness std (POLICY_EFF_STD)	[0.07, 0.12, 0.15, 0.18, 0.12, 0.15, 0.18, 0.07]
Layer B base slope	6.0
Layer B slope sensitivity range	{4.0, 5.0, 6.0, 7.0, 8.0}
Layer C window sizes	{12, 18, 24, 36} months
Layer C drawdown thresholds	{10%, 15%, 20%, 25%, 30%, 40%, 50%}
Layer C CAPE regime cuts	<15, [15,25), [25,35), ≥35, ≥38, ≥40, ≥42
Bootstrap method	Stationary block bootstrap (Politis-Romano)
Bootstrap block length (mean)	24 months
Bootstrap replications	2,000

J.3 Package versions

Package	Version
Python	3.11+
numpy	1.26+
scipy	1.12+
pandas	2.0+
xlrd	2.0+ (Shiller `.xls` parser)

J.4 Script execution order

To regenerate all published tables:

Download ie_data.xls from Shiller's Yale page → place in working directory.
Run cape_step1_load.py (Appendix F base) → produces shiller_monthly.pkl and CAPE episode table.
Run cape_step2_rolling.py → produces unconditional and regime-conditional tables (§7.2, §7.3).
Run cape_step3_bootstrap.py → produces block-bootstrap CIs.
Run layer_a_mc.py (Appendix D) → produces fired_layer_a.npy and Layer A tables.
Run layer_b_mc.py (Appendix E) → consumes fired_layer_a.npy, produces Layer B tables.
Run dashboard_backtest.py (Appendix G) → produces VIX backtest table.

All scripts are deterministic given the seed and inputs.

Reproducing the §7.5 global sensitivity table. Each variant in the §7.5 table is generated by editing one parameter family in the frozen parameter file and rerunning Layers A and B with seed 42 and 200,000 trials. Specifically:

Trigger marginals ±20%: scale the P_BASE array in layer_a_mc.py by 0.8 or 1.2 (clipped to [0, 1]), rerun both Layer A and Layer B.
Factor loadings ±20%: scale the LOADINGS matrix in layer_a_mc.py by 0.8 or 1.2 (re-clipping idiosyncratic variance to [0, 1]), rerun both layers.
Policy offsets ±20%: scale POLICY_EFF_BASE in layer_b_mc.py by 0.8 or 1.2 (clipped to [0, 0.95]), rerun Layer B only (Layer A unchanged — this is why the table's P(≥2 triggers) column equals 54.6% across policy-offset rows).
Loop boosts set to zero / +25%: edit the LOOPS dictionary in layer_b_mc.py to multiply each boost value by 0.0 or 1.25, rerun Layer B only (same Layer A invariance).

Each sensitivity run takes approximately the same wall-clock time as the base run. The published table is the deterministic output of these edits; no additional code changes are required.

J.4a Forward-extension table (October 2023 – May 2026)

The post-Shiller extension is reproduced here so the CAPE-regime filter is auditable without opening the code block in Appendix F. Values are the monthly price/CAPE pairs used in the paper's rolling-window computation.

Tiebreaking rule between providers. Two providers were consulted for the post-Shiller extension: Multpl (multpl.com/shiller-pe) and GuruFocus (gurufocus.com/economic_indicators/56/sp-500-shiller-cape-ratio). Where the two providers agreed on a monthly reading (rounded to two decimals), that value was used. Where they differed by less than 0.10 CAPE points (the typical case), the Multpl reading was used as the primary because Multpl's series anchors to month-end closes consistent with Shiller's published methodology. Where they differed by 0.10 or more (rare — observed in only 2 of 33 monthly observations, both in highly volatile months), the lower of the two readings was used to avoid upward bias in the regime filter. The S&P 500 price input column uses the official month-end S&P 500 close from CBOE/S&P Dow Jones Indices and does not depend on provider choice. No tiebreaking decision changed which months crossed the CAPE ≥40 regime threshold.

Month	S&P price input	CAPE input
2023-10	4,275.20	29.19
2023-11	4,467.71	30.50
2023-12	4,682.85	31.66
2024-01	4,769.83	32.32
2024-02	4,953.86	33.85
2024-03	5,170.93	35.32
2024-04	5,099.20	34.34
2024-05	5,187.70	34.55
2024-06	5,360.79	35.66
2024-07	5,538.01	36.85
2024-08	5,523.18	36.30
2024-09	5,648.40	37.07
2024-10	5,762.48	37.74
2024-11	5,917.11	38.78
2024-12	6,040.53	38.40
2025-01	5,994.57	37.93
2025-02	6,037.42	37.94
2025-03	5,705.45	35.34
2025-04	5,440.42	33.40
2025-05	5,811.07	35.13
2025-06	6,022.05	36.03
2025-07	6,244.28	37.06
2025-08	6,395.78	37.74
2025-09	6,519.45	38.10
2025-10	6,720.31	38.93
2025-11	6,850.92	39.28
2025-12	6,980.43	39.45
2026-01	7,050.81	39.65
2026-02	7,140.21	39.87
2026-03	7,195.55	40.11
2026-04	7,325.92	40.55
2026-05	7,398.93	40.42

J.5 Exact list of CAPE ≥40 monthly starting dates

1999-01-01  1999-02-01  1999-03-01  1999-04-01  1999-05-01  1999-06-01
1999-07-01  1999-08-01  1999-09-01  1999-10-01  1999-11-01  1999-12-01
2000-01-01  2000-02-01  2000-03-01  2000-04-01  2000-05-01  2000-06-01
2000-07-01  2000-08-01  2000-09-01     # Episode 1 ends (21 months)
2026-03-01  2026-04-01  2026-05-01     # Episode 2 ongoing as of publication

J.6 Episode 1 (1999-2000) forward 24-month max drawdown outcomes

Full 21-row output from cape_step2_rolling.py, rounded to one decimal point:

Start month	Forward 24m max drawdown	≥25%?
1999-01-01	10.4%	No
1999-02-01	12.1%	No
1999-03-01	20.2%	No
1999-04-01	20.2%	No
1999-05-01	20.2%	No
1999-06-01	20.2%	No
1999-07-01	20.2%	No
1999-08-01	20.7%	No
1999-09-01	29.7%	Yes
1999-10-01	29.7%	Yes
1999-11-01	29.7%	Yes
1999-12-01	29.7%	Yes
2000-01-01	29.7%	Yes
2000-02-01	29.7%	Yes
2000-03-01	29.7%	Yes
2000-04-01	29.7%	Yes
2000-05-01	29.7%	Yes
2000-06-01	31.7%	Yes
2000-07-01	39.2%	Yes
2000-08-01	39.2%	Yes
2000-09-01	40.9%	Yes

Mean 26.8%, median 29.7%, range 10.4%-40.9%. 13 of 21 ≥25%; 1 of 21 ≥40%.

J.7 Publication record

Date	Status
May 12, 2026	Working paper, first public version.

Appendix K: Statistical Caveats — Why the Bootstrap CI Is Not Reported

This appendix documents in full why the paper does not report a confidence interval for the Layer C regime-conditional drawdown coverage rate at CAPE ≥40.

K.1 The bootstrap output

For completeness, the stationary block-bootstrap output (mean block length = 24 months, 2,000 replications, applied to the 1881-2026 monthly series, consisting of Shiller through September 2023 plus the documented extension) is:

Regime	Threshold	Point estimate	5th pct	50th pct	95th pct
CAPE ≥ 35	≥25%	36.7%	60.7%	85.1%	100.0%
CAPE ≥ 38	≥25%	55.6%	63.0%	91.7%	100.0%
CAPE ≥ 40	≥25%	61.9%	57.7%	97.4%	100.0%
CAPE ≥ 35	≥40%	2.0%	38.6%	77.0%	100.0%
CAPE ≥ 40	≥40%	4.8%	13.4%	74.4%	100.0%

The naive reading would be to present the [57.7%, 100%] interval (CAPE ≥40 / ≥25%) as a "90% confidence interval." That reading is wrong. The interval does not have a frequentist confidence-interval interpretation.

K.2 The diagnostic that exposes the problem

In a well-behaved bootstrap, the point estimate should fall inside the bootstrap distribution — typically near the median. Here, for the CAPE ≥35 regime, the bootstrap 5th percentile (60.7%) lies above the point estimate (36.7%). The 5th percentile being above the point estimate is not a quirk; it is a signature of bootstrap failure. The same pathology appears, less dramatically, in the CAPE ≥35 / ≥40% row, where the point estimate is 2.0% and the bootstrap 5th percentile is 38.6%.

K.3 Why this happens

Stationary block bootstrap on a rare regime concentrated in one historical cluster has a degenerate failure mode:

The regime "CAPE ≥40" exists, in the available history, only in the connected cluster January 1999 – September 2000 (21 consecutive months).
Each bootstrap replication resamples blocks from the full 1,745-month time series with replacement.
Many replications draw zero blocks that overlap the 1999-2000 cluster. In those replications, there are no CAPE ≥40 starting points and the coverage rate is undefined (0/0).
Replications with one or more overlapping blocks tend to include multiple overlapping copies of the same cluster (because resampling with replacement preserves and amplifies dense clusters).
When the cluster is present, the coverage rate within it is empirically very high; when present in multiple copies, the bootstrap statistic concentrates near the empirically observed within-cluster rate.

The resulting bootstrap distribution is bimodal-or-truncated: heavy mass near the within-cluster rate, with the lower tail effectively discarded as undefined. The reported "5th percentile" is therefore not the 5th percentile of a sampling distribution for the true population coverage rate — it is the 5th percentile of a degenerate distribution conditional on the cluster being resampled at all. This is not the kind of interval that supports the standard frequentist interpretation.

K.4 The Wilson interval as a sanity check

If the 21 monthly observations within the 1999-2000 episode were treated as independent Bernoulli trials (which they are not; they overlap heavily), the 90% Wilson interval for 13 successes in 21 trials would be approximately [44.1%, 77.0%]. This is a reasonable order-of-magnitude bound for what a "calibrated" interval might look like under an idealization. But even this is not appropriate, because:

The 21 windows are overlapping monthly entry points into the same 24-month-look-ahead horizon, not independent draws.
They all come from one historical regime (the dot-com peak), so any underlying generative-process variation is captured by n = 1 episode, not 21 trials.
The relationship between starting CAPE level and forward drawdown was almost certainly different in this episode than it would be in a different macro regime.

K.5 What the paper reports instead

The paper reports:

The point estimate 13/21 = 61.9% as descriptive within-episode coverage.
The non-overlapping-window count: 1 of 1 (the 1999 episode contributes one non-overlapping 24-month window with a 40.9% max drawdown).
The episode-level count: n = 1 (one completed episode of CAPE ≥40 in 145 years of monthly data).
The explicit caveat: this is descriptive history, not a calibrated probability.

The framework's conclusion does not rest on a statistical confidence interval for this figure. It rests on the substantive fact that the only previous time this configuration appeared, the subsequent path was a substantial drawdown across nearly every monthly entry point in that episode. That fact is informative even though it is unrepeatable history.

K.6 What would change this

A revised analysis could:

Extend the regime threshold downward to CAPE ≥35 or ≥30 to obtain more historical observations, at the cost of moving further from the current configuration.
Use a Bayesian framework with an explicit prior on the future coverage rate, treating the 1999-2000 episode as one observation updating a beta-distribution prior. This would produce a posterior with explicitly stated subjectivity.
Augment with cross-country evidence (Japan 1989, Asian valuation peaks) — at the cost of cross-country regime-comparability assumptions that themselves require defense.

Each of these is a different framework choice with different costs. The paper makes none of them; it reports the within-sample observation honestly and lets the reader draw the appropriate inferences.

End of working paper.