Data Report

Predicting Circuit Splits: A Machine-Learning Study

A model trained on four decades of appellate decisions forecasts where the next circuit split will open -- months before the conflict becomes visible.

Data Report · 9 min read · May 2026

predictive modeling
appellate law
circuit splits
machine learning
litigation strategy

Circuit splits reshape the landscape of federal law, yet they are almost always recognized only in hindsight, after conflicting rulings have already forced litigants and counsel to react. Bennet Legal Research Group set out to invert that timeline. Using SPLITCAST, a machine-learning system trained on 4.1 million federal appellate opinions spanning 1984 through 2025, we forecast the emergence of doctrinal conflict across circuits with a lead time of roughly nine months and an out-of-sample F1 of 0.91. This report describes the model, what it learned about how splits form, and how legal leaders can use anticipatory intelligence to position rather than react.

The big picture

A circuit split is not a random event. It is the visible surface of pressures that build for months or years beneath it: divergent panel compositions, textual ambiguities left unresolved by the Supreme Court, migrating fact patterns, and citation networks that quietly bifurcate long before any judge names the conflict. Our thesis was simple. If those pressures are legible in the historical record, a sufficiently expressive model should be able to detect them before the split crystallizes.

SPLITCAST confirms the thesis. Across a held-out test period covering 2019 through 2025, the model flagged 87 percent of the doctrinal conflicts that legal commentators later identified, with a median lead time of 8.9 months ahead of the first widely recognized split-acknowledging opinion. It did so while maintaining a precision of 0.90, meaning its warnings were rarely false alarms.

The implication for legal strategy is significant. An intelligence capability that anticipates where the law is about to fracture lets firms select forums, time filings, shape test cases, and brief clients on emerging risk while competitors are still reading last quarter's opinions.

What the data shows

SPLITCAST assigns each active doctrinal question a Split Pressure Score between 0 and 100. In backtesting, questions that eventually produced a recognized circuit split crossed a score of 70 an average of 8.9 months before the conflict became public. Questions that never split rarely exceeded 40. The separation between the two populations was clean enough to support a decision threshold with an area under the ROC curve of 0.94.

The model surfaced structure that human observers had missed. It identified 34 doctrinal questions in our monitoring set that carried elevated pressure scores as of May 2026, of which we assess 11 as high-confidence emerging splits. Two of the eleven -- in areas of administrative deference and digital-evidence authentication -- had not, at the time of writing, drawn meaningful commentary despite crossing the model's warning threshold four to six months earlier.

Feature-attribution analysis showed that the strongest predictors were not the ones intuition suggests. Raw disagreement in outcomes mattered less than divergence in the reasoning path -- specifically, the rate at which panels in different circuits began citing non-overlapping lines of authority to resolve nominally identical questions. Citation-graph fragmentation preceded outcome divergence by a median of five months.

Methodology

The training corpus comprised 4.1 million federal appellate opinions from 1984 through 2025, each parsed into a structured representation capturing holdings, citation edges, panel metadata, procedural posture, and a 1,024-dimension semantic embedding of the reasoning. We constructed a labeled set of 2,860 historical circuit splits, hand-verified by our Intelligence Desk, and an equal-sized control set of doctrinal questions that were litigated across multiple circuits but never split.

SPLITCAST is an ensemble: a temporal graph neural network over the evolving citation network, a gradient-boosted model over structured case features, and a transformer that reads reasoning text. Their outputs are combined by a calibrated meta-learner. We trained on 1984 through 2018 and evaluated strictly out of sample on 2019 through 2025, with no information from the test period leaking into training -- a walk-forward protocol that mirrors how the system would have been used in real time.

Reported metrics -- F1 of 0.91, precision 0.90, recall 0.87, ROC-AUC 0.94 -- are all from the held-out window. We stress that these are retrospective forecasts on historical data. Live forward performance is being tracked in a registered prospective study and will be reported in 2027.

Key findings

First, splits are forecastable. The dominant view that doctrinal conflict is essentially unpredictable does not survive contact with the data at scale. The signal is faint in any single opinion but robust across the citation network as a whole.

Second, reasoning diverges before outcomes do. The earliest and most reliable warning sign is not judges disagreeing on results but judges quietly reaching for different authorities to reach the same result. By the time outcomes visibly conflict, the split is often already months old in the model's view.

Third, the highest-pressure questions cluster in domains of rapid factual change -- technology, data, and novel financial instruments -- where existing precedent is stretched over facts it was never written to govern. Fourteen of the model's top twenty active pressure scores sit in these frontier areas.

Implications for leaders

Anticipatory intelligence changes the unit of legal strategy from reaction to positioning. A firm that knows a split is likely to open in a given doctrine can advise clients to structure transactions defensively, can identify favorable forums, and can prepare the arguments that will matter when the conflict surfaces -- all while the rest of the market is unaware.

For general counsel, split forecasting is a risk-management instrument. Knowing that a question governing your industry is under elevated doctrinal pressure lets you quantify and hedge exposure before a conflicting ruling forces a scramble. Several questions in our current monitoring set bear directly on compliance obligations that many companies assume are settled.

Leaders should treat these forecasts as probabilistic intelligence, not prophecy. The value is in shifting the odds and buying time, not in certainty. A nine-month head start, used well, is a decisive advantage even when any individual forecast carries irreducible uncertainty.

What comes next

Bennet Legal Research Group is expanding SPLITCAST to state courts of last resort, where inter-jurisdictional divergence is more frequent and less studied, and where our early experiments suggest even richer predictive signal. We are also building an alerting layer that notifies subscribers the moment a monitored question crosses its warning threshold.

The prospective validation study, registered in early 2026, will report live forecasting performance against splits that emerge through 2027 -- the true test of any predictive system. We will publish those results whether or not they flatter the model, because credibility in this domain is earned only by honest scoring against the future.

The larger lesson extends beyond splits. Much of what the legal world treats as unforeseeable is merely unmeasured. Applied at scale, machine intelligence turns the unforeseeable into the anticipated -- and anticipation, in law as in markets, is where advantage lives.