Benders Decomposition

Overview

Stochastic Dual Dynamic Programming (SDDP) solves large multistage stochastic optimization problems by decomposing them into smaller single-stage subproblems. This decomposition is rooted in Benders decomposition (also known as the L-shaped method), which splits a monolithic problem into a master problem and subproblems connected through linear inequalities called cuts.

In the hydrothermal dispatch context, a direct formulation over all stages, scenarios, and equipment would produce an LP with millions of variables and constraints. Benders decomposition breaks this into $T$ stage subproblems, each solved independently for a given incoming state and scenario realization.

The Stage Subproblem

Each stage $t$ solves an LP of the form:

$x_{t}, θ_{t} min c_{t}^{⊤} x_{t} + θ_{t}$

$s.t. A_{t} x_{t} = b_{t} - E_{t} x_{t - 1}, x_{t} \in X_{t}, θ_{t} \geq \underline{V}_{t + 1} (x_{t})$

where $x_{t - 1}$ is the incoming state (reservoir volumes, AR inflow lags) fixed from the previous stage, $c_{t}^{⊤} x_{t}$ captures the immediate cost (thermal generation, penalties, regularization), and $θ_{t}$ approximates the expected future cost from stages $t + 1$ through $T$ .

The full LP structure — objective terms, constraint families, slack variables, and variable bounds — is described in LP Formulation.

How Cuts Approximate Future Cost

The true future cost function $V_{t + 1} (x_{t})$ is convex but unknown. SDDP builds a piecewise-linear lower approximation $\underline{V}_{t + 1}$ through iterative cut generation:

$\underline{V}_{t + 1} (x_{t}) = i \in K max {α_{i} + π_{i}^{⊤} x_{t}}$

Each cut $(α_{i}, π_{i})$ is a supporting hyperplane derived from dual variables of the backward pass. When the stage $t + 1$ subproblem is solved at a trial state $\overset{x}{^}_{t}$ under scenario $ω$ , the optimal dual multipliers $π^{*}$ of the state-linking constraints yield:

Slope: $π = E_{t + 1}^{⊤} π^{*}$ (sensitivity of future cost to incoming state)
Intercept: $α = V_{t + 1} (\overset{x}{^}_{t}, ω) - π^{⊤} \overset{x}{^}_{t}$

Notation note: The matrix form $E_{t + 1}^{⊤} π^{*}$ above is the standard Benders decomposition notation from the stochastic programming literature. In Cobre’s formal specs, the cut coefficients are expressed directly in terms of LP dual variables: $π_{h}^{v} = π_{h}^{f i x}$ (the storage fixing constraint dual) and $π_{h, ℓ}^{l a g}$ (the lag-fixing constraint dual), without explicit technology matrices. Both conventions are mathematically equivalent — the matrix form is compact for general derivations, while the direct-dual form is more practical for implementation. See Cut Coefficient Derivation below.

In the hydrothermal problem, the state variables are reservoir storage volumes $v_{h}$ and AR inflow lags $a_{h, ℓ}$ , so the cut coefficients come from the storage fixing dual $π_{h}^{f i x}$ and the lag-fixing constraint dual $π_{h, ℓ}^{l a g}$ .

Cut Coefficient Derivation

The key insight is how dual variables map to cut coefficients. Cobre introduces an explicit incoming-state LP variable for each state dimension, fixed to the trial value via an equality constraint:

$v_{h}^{in} = \overset{v}{^}_{h}, a_{h, ℓ} = \overset{a}{^}_{h, ℓ}$

By the LP envelope theorem, the dual of each fixing constraint equals the total derivative of the optimal objective with respect to the fixed value, accounting for all downstream effects:

Fixing constraint	Dual	Cut coefficient
$v_{h}^{in} = \overset{v}{^}_{h}$	$π_{h}^{f i x}$	$π_{h}^{v} = π_{h}^{f i x}$
$a_{h, ℓ} = \overset{a}{^}_{h, ℓ}$	$π_{h, ℓ}^{l a g}$	$π_{h, ℓ}^{l a g} = π_{h, ℓ}^{l a g}$

For storage, the dual $π_{h}^{f i x}$ captures contributions from the water balance, FPHA hyperplane constraints, and any generic constraints that reference incoming storage — all aggregated automatically by the LP solver through the single incoming-storage variable. No manual dual combination is required.

The cut intercept ensures the cut passes exactly through the trial point:

$α_{t} = Q_{t} (\overset{x}{^}_{t - 1}, ω) - h \sum π_{t, h}^{v} \cdot \overset{v}{^}_{h} - h, ℓ \sum π_{t, h, ℓ}^{l a g} \cdot \overset{a}{^}_{h, ℓ}$

where $Q_{t} (\overset{x}{^}_{t - 1}, ω)$ is the optimal stage $t$ objective value.

Single-Cut Aggregation

In the backward pass, each trial state is evaluated against all $∣Ω∣$ scenario realizations. The resulting per-scenario cuts are aggregated into one cut per trial point using probability-weighted expectation:

$\overset{α}{ˉ}_{t - 1} = ω \in Ω \sum p (ω) \cdot α_{t} (ω)$

$\overset{π}{ˉ}_{t - 1, h}^{v} = ω \in Ω \sum p (ω) \cdot π_{t, h}^{v} (ω)$

$\overset{π}{ˉ}_{t - 1, h, ℓ}^{l a g} = ω \in Ω \sum p (ω) \cdot π_{t, h, ℓ}^{l a g} (ω)$

where $p (ω)$ is the probability of scenario $ω$ (uniform for the fixed opening tree: $p (ω) = 1/∣Ω∣$ ). The aggregated cut $(\overset{α}{ˉ}, \overset{π}{ˉ}^{v}, \overset{π}{ˉ}^{l a g})$ is added to stage $t - 1$ ’s cut pool. This single-cut formulation keeps the LP compact: after $K$ iterations the LP has $K$ cuts, regardless of the number of scenarios per iteration.

The Forward-Backward Iteration

SDDP alternates between two passes:

Forward pass: Simulate scenario paths from stage 1 to $T$ , solving each stage LP with the current cut approximation. This produces trial states $\overset{x}{^}_{t}$ and a statistical upper bound on optimal cost.
Backward pass: Traverse from stage $T$ back to stage 1. At each stage, solve subproblems for all scenario realizations at each trial state, extract dual multipliers, aggregate them into new cuts, and add each cut to the stage’s cut pool. The first-stage objective (including $θ_{1}$ ) provides a deterministic lower bound.

Cuts accumulate over iterations, progressively tightening $\underline{V}_{t + 1}$ toward the true $V_{t + 1}$ . The algorithm converges when the gap between lower and upper bounds falls below a tolerance.

Why the LP Formulation Matters

The quality of Benders cuts depends directly on the LP formulation:

Recourse slacks (deficit, excess) guarantee feasibility for every scenario, ensuring duals always exist. Without relatively complete recourse, the backward pass would encounter infeasible subproblems.
Penalty magnitudes shape the value function. Correct priority ordering ensures that cuts propagate economically meaningful signals across stages.
State variable identification determines which dual multipliers contribute to cut coefficients. Every constraint linking the current stage to the incoming state produces a component of $π$ .

LP Formulation — complete stage subproblem: objective, constraints, penalties, and Benders cut interface
Cut Management — cut selection, deletion, and storage strategies
SDDP Theory — the iterative algorithm structure
Forward-Backward Pass — detailed mechanics of the forward simulation and backward cut generation passes

Cobre Methodology Reference