Methodology Changelog

Significant changes to the scoring methodology, data pipeline, and corpus coverage. Minor bug fixes and infrastructure updates are not listed. Earlier entries are approximate.

2026-05

NT filing detection

NT 10-K and NT 10-Q filings (late-filing notifications) are now detected and flagged separately from scored pairs. Companies with active NT flags display a warning badge on the company page and trigger a dedicated alert email. 1,444 companies in the current corpus carry at least one historical NT flag. NT signals are not included in the drift score — they are a separate binary indicator.

2026-05

10-Q quarterly scoring — year-over-year pairing

Quarterly 10-Q filings are now ingested and scored using year-over-year pairing (Q1 this year vs. Q1 last year, etc.) rather than sequential quarter-to-quarter comparison. Sequential pairing produced near-zero signal during sustained distress periods because adjacent quarters look similar; YoY pairing captures the cumulative drift. A separate 10-Q ceiling is computed from the quarterly corpus using the same 95th-percentile method as the 10-K ceiling. 10-Q scores are shown as orange triangle overlays on the company chart and are included in API responses and alert emails.

2026-05

Corpus expanded to 4,900+ companies

Coverage expanded from the original labeled corpus (~40 companies) through two phases of EDGAR backfill to the current 4,900+ company universe. The control ceiling — the 95th-percentile score across all non-crisis companies in the same year — is now computed from this larger corpus and is more stable than the early labeled-corpus ceiling. All reported statistics are computed on the full corpus.

2026-04

Semantic drift component (research prototype)

We built a second scoring component — semantic drift, the cosine distance between sentence embeddings of consecutive risk-factor sections — to catch meaning-level changes that phrase counting misses (paraphrasing, restructured disclosures, buried hedging). It is computed and stored alongside the primary score. To be precise about what is live: the score currently served across the site and API is the phrase-frequency escalation score. The semantic component is not yet blended into the published number; we'll update this entry when it is.

2026-03

Corpus-wide period normalization

A company's phrase escalation is weighted by how rare each phrase is across the whole corpus (IDF) and downweighted if the phrase escalated across the whole corpus in that filing year. This removes the confound where risk language rises corpus-wide during macro stress periods (COVID, 2008, regional banking crisis) and would otherwise inflate individual company scores. The normalization is corpus-wide, not by SIC sector (per-sector normalization was tested and performed worse). The control ceiling is the 95th percentile of control-company scores.

2025-11

Initial release — phrase-frequency scoring on 10-K filings

FilingDrift launched with a phrase-escalation model applied to annual 10-K filings from EDGAR. Risk factor sections are tokenized, and a curated vocabulary of distress-associated phrases (going concern, impairment, liquidity risk, etc.) is tracked across consecutive filing pairs. Scores are IDF-weighted to reduce the influence of phrases that appear across the entire corpus. Initial labeled corpus: 11 crisis companies, 29 controls.

For the full technical methodology, see the methodology page. For API documentation, see API docs.