Models are trained on the historical data to forecast time steps in the forecast {{ 'horizon' | plurify: numberOfHorizonsInTest}} .
Metrics are computed on the evaluation time steps (i.e. time steps in the {{ 'gap' | plurify: numberOfHorizonsInTest}} are ignored).