Forecasting Pipeline¶

This page describes the main execution flow for a typical foreblocks forecasting workload.

End-to-end flow¶

flowchart TD
    A([Raw series\nT × D]) -->|fit_transform| B[TimeSeriesHandler]
    B --> C[X: N×T×F\ny: N×H×D]
    C -->|create_dataloaders| D[(DataLoader\ntrain / val / test)]

    D --> E[ForecastingModel\nhead + backbone]
    E --> F[Trainer]
    F -->|train loop| G{converged?}
    G -->|no| F
    G -->|yes| H[ModelEvaluator]

    H --> I[metrics\nMAE · RMSE · MAPE]
    H --> J[CV windows\nrolling evaluation]
    H --> K[learning curve\nplots]

    E -.->|optional| L[ConformalPredictionEngine\ncalibrate → predict intervals]
    E -.->|optional| M[DARTSSearcher\narchitecture search]

Pipeline stages¶

Data preparation — raw series or pre-windowed arrays
Dataloader construction — create_dataloaders or manual TimeSeriesDataset
Model assembly — ForecastingModel with head + optional backbone
Training — Trainer with early stopping, scheduler, and optional MLTracker
Evaluation — ModelEvaluator for metrics, CV, and plots
Optional branches — conformal intervals, DARTS search, synthetic data

Data preparation¶

You can start from either:

pre-windowed tensors or NumPy arrays ([N, T, F] + [N, H, D])
a raw multivariate time series shaped [T, D]

If you start from raw series data, TimeSeriesHandler handles:

windowing and horizon slicing
scaling (StandardScaler / RobustScaler)
outlier handling and imputation
optional time features

Model assembly¶

The central abstraction is ForecastingModel. Common paths:

DirectSeq2Seq (recurrent)TransformerHybrid Mamba

A single head module maps from flattened input to the forecast horizon. Simplest path.

LSTM or GRU encoder produces a context vector; a separate decoder generates horizon steps.

TransformerEncoder + TransformerDecoder with patching, attention-type selection, MoE feedforward, and skip connections.

HybridMambaBlock or HybridMamba2Block as backbone — SSM dynamics with optional sliding-window attention.

Training¶

Trainer orchestrates:

optimiser and scheduler stepping
validation loop and early stopping
optional MLTracker integration (auto_track=True by default)
optional NAS hooks for DARTS
optional conformal machinery during training

Tip

Pass auto_track=False for local smoke tests to skip MLTracker initialisation.

Evaluation¶

ModelEvaluator provides batched prediction, rolling cross-validation, and plots. See Evaluation & Metrics for the full API.

Optional branches¶

Conformal prediction intervals

After training, use ConformalPredictionEngine to attach coverage-guaranteed prediction bands to any model, using only a held-out calibration set. See Uncertainty Quantification.

Neural architecture search

Replace the manual model-assembly step with DARTSSearcher. The searcher handles zero-cost screening, bilevel search, and final retraining. See DARTS Guide.

Synthetic datasets

Use foretools.tsgen to generate controlled AR, seasonal, trend, and noise series for benchmarking or ablation studies. See Time Series Generator.