Part V · Chapter 16

Forecast Evaluation

Bias, RMSE, MAE, MAPE, and Theil's U₂ each emphasize different aspects of forecast error. Pick the metric that matches your loss function.

Learning objectives

Compute bias, MAE, MSE, RMSE, MAPE, Theil U₂.
Understand which metric penalizes which errors.
Recognize zero-bias but high-error situations.
Compare model vs naive benchmark with Theil U₂.

Forecast Error Dashboard

Seed:

Live metrics

Bias:
MAE:
RMSE:
MAPE: %
Theil U₂:

📐 Metric formulas

\[ \text{Bias}=\bar e,\; \text{MAE}=\overline{|e_t|},\; \text{MSE}=\overline{e_t^2},\; \text{RMSE}=\sqrt{\text{MSE}} \]
\[ \text{MAPE}=100\,\overline{|e_t/y_t|},\quad U_2=\sqrt{\frac{\sum(\hat y_t - y_t)^2}{\sum(y_{t-1}-y_t)^2}} \]
RMSE penalizes large errors quadratically
MAE linear penalty, robust
MAPE scale-free; unstable near zero
U₂ < 1 beats naive RW benchmark

🔍 What to look for

⚠️ Pro Tip: What to Avoid

Student says

"Bias is near zero, so the forecast is accurate."

Why this is wrong

Zero bias means errors cancel ON AVERAGE. The forecast can still be wildly off period-by-period (high RMSE/MAE). Bias is necessary but not sufficient.

Correct interpretation

Report bias + RMSE + Theil U₂ jointly. Low bias + low RMSE + U₂ < 1 is the win condition.

📝 Mini-quiz

📋 Key Takeaways

MetricUse when
RMSELarge errors are costly
MAEAll errors equally costly
MAPEScale-free comparison (NO zeros)
Theil U₂Compare vs naive RW benchmark
BiasDetect systematic over/under-prediction