GribStream

Forecast Model Accuracy (URMA benchmark)

This dashboard benchmarks NOAA forecast models against the URMA analysis for North America. Pick a city to see local behavior. You can switch the variable between 2 m temperature and 10 m wind speed. Models compared side by side: NBM, GFS, HRRR, RRFS 2D fields, NAM CONUS Nest, and AIGFS Surface.

What you are seeing

Key metric definitions

MAE (Mean Absolute Error): the average of |forecast − URMA| over the window. Lower is better.

Bias: the average of forecast − URMA. Positive means the model runs high, negative low.

Max error: the largest miss in the window. Useful to spot outliers and regime changes.

How to read the panels

Data sources and docs: NBM · HRRR · GFS · RRFS 2D fields · NAM CONUS Nest · AIGFS Surface · URMA. Requests use the GribStream shared-parameter catalog so each model is queried for the same physical signal with consistent output units. Forecast values are selected as they were known at the start of the 36-hour window and compared with later URMA analysis through 24 hours ago.