Early warning system for cost spikes in Great Britain's electricity balancing mechanism. Half-hourly Elexon BMRS data, gradient boosting models, and SHAP explainability.
The GB electricity balancing mechanism (BM) is the final tool National Grid uses to balance supply and demand in real time. When it goes wrong — renewable intermittency, unexpected demand, plant trips — balancing costs spike sharply. These spikes are expensive and, in theory, partially predictable.
The dissertation question: can we build an early warning system that flags high-cost periods hours before they happen, using only publicly available half-hourly data from Elexon's BMRS API?
| Model | Approach | Key strength | Status |
|---|---|---|---|
| XGBoost | Gradient boosted trees, L1/L2 reg | Handles missing data, robust to outliers | Training |
| LightGBM | Leaf-wise tree growth, histogram binning | Faster on large feature sets, lower memory | Training |
| Baseline | Persistence model (lag-1 prediction) | Sets minimum bar to beat | Done |
Standard classification metrics (AUROC, F1) don't capture the most operationally useful property of an early warning system: how far in advance can it reliably flag an event?
The Early Warning Horizon (EWH) metric measures, for correctly-flagged spike events, the median number of half-hour periods before the event at which the model first exceeds a confidence threshold. A model with AUROC 0.85 but EWH of 1 period is far less useful than one with AUROC 0.82 and EWH of 6 periods (3 hours).
This metric is the primary contribution of the dissertation alongside the forecasting models themselves.
Data pipeline complete. Feature engineering in progress. Model training begins June 2026, with full results and the final dissertation due July 2026. This page will be updated with results, SHAP plots, and a link to the public GitHub repo once the work is submitted.