Chapter 5: Duality

Duality is a central concept in optimization theory. It associates with every optimization problem (the primal) a companion dual problem whose optimal value provides a lower bound on the primal optimal value. Under convexity and suitable regularity conditions, the two optimal values coincide (strong duality), giving rise to powerful optimality conditions known as the KKT conditions.

5.1 Motivation: From Constrained to Unconstrained

Consider the single-inequality problem

p^{*} = min_{x} f (x) s.t. g (x) \leq 0.

Constrained problems are difficult to optimize directly. The key idea is to absorb the constraint into the objective using an indicator function:

I_{g \leq 0} (x) = {\begin{cases} 0, & g (x) \leq 0, \\ + \infty, & g (x) > 0. \end{cases}

Then the problem becomes unconstrained:

p^{*} = min_{x} (f (x) + I_{g \leq 0} (x)) .

5.1.1 Linearizing the Indicator

Key Claim
$I_{g \leq 0} (x) = max_{λ \geq 0} λ g (x) .$

Explanation: If $g (x) \leq 0$ , then $λ g (x) \leq 0$ for all $λ \geq 0$ , so the supremum is $0$ . If $g (x) > 0$ , then $λ g (x) \to + \infty$ as $λ \to \infty$ , so the supremum is $+ \infty$ .

Thus the indicator function can be expressed as a linear supremum.

5.1.2 Min–Max Reformulation

Substituting, we obtain

f (x) + I_{g \leq 0} (x) = f (x) + max_{λ \geq 0} λ g (x) = max_{λ \geq 0} (f (x) + λ g (x)) .

Hence the original problem becomes an unconstrained min–max problem:

p^{*} = min_{x} max_{λ \geq 0} (f (x) + λ g (x)) .

Game interpretation: Player A (the minimizer) chooses $x$ ; Player B (the maximizer) chooses $λ \geq 0$ . The quantity being optimized is

L (x, λ) = f (x) + λ g (x) .

The order of play matters—this motivates the fundamental inequality of duality.

5.2 The Weak Max–Min Inequality

Theorem (Weak Max–Min Inequality)
For any function $φ (x, y)$ ,
$max_{y} min_{x} φ (x, y) \leq min_{x} max_{y} φ (x, y) .$

Proof sketch: For any fixed $y_{0}$ and $x_{0}$ ,

min_{x} φ (x, y_{0}) \leq φ (x_{0}, y_{0}) \leq max_{y} φ (x_{0}, y) .

Taking $max_{y_{0}}$ on the left and then $min_{x_{0}}$ on the right establishes the inequality.

Applying this inequality to the Lagrangian $L (x, λ) = f (x) + λ g (x)$ yields:

max_{λ \geq 0} min_{x} L (x, λ) \leq min_{x} max_{λ \geq 0} L (x, λ) = p^{*} .

5.3 Definitions: Lagrangian, Dual Function, Dual Problem

For the single-inequality problem, we define:

Definition (Lagrangian, Single-Inequality Case)
$L (x, λ) = f (x) + λ g (x), λ \geq 0.$
$λ$ is called the Lagrange multiplier (or dual variable).

Definition (Dual Function)
$q (λ) = min_{x} L (x, λ) .$

Definition (Dual Problem)
$d^{*} = max_{λ \geq 0} q (λ) .$

Definition (Weak Duality)
$d^{*} \leq p^{*} .$
The difference $p^{*} - d^{*}$ is called the duality gap (always $\geq 0$ ).

Definition (Strong Duality)
$p^{*} = d^{*}$ . This holds for convex problems under appropriate regularity conditions (e.g., Slater's condition).

Example: Unit Circle Constraint

Consider

min_{x_{1}, x_{2}} x_{1} + x_{2} s.t. x_{1}^{2} + x_{2}^{2} \leq 1.

Lagrangian: $L (x, λ) = x_{1} + x_{2} + λ (x_{1}^{2} + x_{2}^{2} - 1), λ \geq 0.$
Dual function: For $λ > 0$ ,
$g (λ) = min_{x} L (x, λ) = - \frac{1}{2 λ} - λ, (x_{1}^{*} = x_{2}^{*} = - \frac{1}{2 λ}) .$
Dual problem:
$max_{λ \geq 0} g (λ) = max_{λ \geq 0} (- \frac{1}{2 λ} - λ) = - \sqrt{2}, (λ^{*} = \frac{1}{\sqrt{2}}) .$
Verification: The primal solution is $x^{*} = (- \frac{1}{\sqrt{2}}, - \frac{1}{\sqrt{2}})$ on the unit circle, giving $p^{*} = - \sqrt{2} = d^{*}$ . Strong duality holds.

5.4 The General Optimization Problem

Extend the idea to the standard form:

\begin{aligned} min_{x} & f (x) \\ s.t. & g_{i} (x) \leq 0, i = 1, \dots, m, \\ h_{j} (x) = 0, j = 1, \dots, p . \end{aligned}

Inequality constraints require $λ_{i} \geq 0$ (nonnegative multipliers).
Equality constraints require no sign restriction on $ν_{j} \in R$ .

Definition (Lagrangian, General Case)
Let $λ \in R_{+}^{m}$ and $ν \in R^{p}$ . The Lagrangian is
$L (x, λ, ν) = f (x) + \sum_{i = 1}^{m} λ_{i} g_{i} (x) + \sum_{j = 1}^{p} ν_{j} h_{j} (x) = f (x) + λ^{⊤} g (x) + ν^{⊤} h (x) .$

Definition (Lagrange Dual Function)
$g (λ, ν) = inf_{x \in D} L (x, λ, ν) = inf_{x \in D} (f (x) + \sum_{i = 1}^{m} λ_{i} g_{i} (x) + \sum_{j = 1}^{p} ν_{j} h_{j} (x)),$
where $D$ is the domain of the primal problem.
Properties:
$g (λ, ν)$ is a concave function of $(λ, ν)$ .
$g$ can be $- \infty$ for some $(λ, ν)$ , which defines its effective domain.
Lower bound property: If $λ ⪰ 0$ , then $g (λ, ν) \leq p^{*}$ .

Proof of lower bound property: If $\tilde{x}$ is feasible and $λ ⪰ 0$ , then

f (\tilde{x}) \geq L (\tilde{x}, λ, ν) \geq inf_{x \in D} L (x, λ, ν) = g (λ, ν) .

Minimizing over all feasible $\tilde{x}$ gives $p^{*} \geq g (λ, ν)$ .

Definition (Lagrange Dual Problem)
$\begin{aligned} max_{λ, ν} & g (λ, ν) \\ s.t. & λ ⪰ 0. \end{aligned}$
Always a convex optimization problem (maximization of a concave function), even when the primal is nonconvex.
Optimal value denoted $d^{*}$ .
$(λ, ν)$ is dual feasible if $λ ⪰ 0$ and $(λ, ν) \in dom g$ .
$d^{*} = - \infty$ if the problem is infeasible; $d^{*} = + \infty$ if unbounded above.

Why the Dual Problem is Always Convex

g (λ, ν) = min_{x} (f (x) + \sum_{i = 1}^{m} λ_{i} g_{i} (x) + \sum_{j = 1}^{p} ν_{j} h_{j} (x)) = - max_{x} (- f (x) - \sum_{i = 1}^{m} λ_{i} g_{i} (x) - \sum_{j = 1}^{p} ν_{j} h_{j} (x)) .

This is the (negative) pointwise supremum of affine functions in $(λ, ν)$ , hence concave. The constraint $λ ⪰ 0$ is convex. Thus the dual is a concave maximization—i.e., a convex optimization problem.

5.5 Weak and Strong Duality

Theorem (Weak Duality)
$d^{*} \leq p^{*} .$
This holds always—for convex and nonconvex problems alike. It can be used to find nontrivial lower bounds for difficult problems.

For example, solving the SDP dual of the two-way partitioning problem provides a lower bound on its (NP-hard) primal.

Theorem (Strong Duality)
$d^{*} = p^{*}$ does not hold in general, but (usually) holds for convex problems. Sufficient conditions that guarantee strong duality in convex problems are called constraint qualifications.

5.6 Slater's Constraint Qualification

Consider a convex problem:

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m, \\ A x = b . \end{aligned}

Definition (Slater's Condition)
The problem is strictly feasible, i.e., there exists $x \in int D$ such that
$f_{i} (x) < 0, i = 1, \dots, m, A x = b .$

Theorem (Slater's Theorem)
If the primal is convex and Slater's condition holds, then:
Strong duality holds: $p^{*} = d^{*}$ .
If $p^{*} > - \infty$ , the dual optimum is attained: there exist dual optimal $λ^{*}, ν^{*}$ .

Refinements:

$int D$ can be replaced with $relint D$ (relative interior).
Linear inequalities do not need to hold with strict inequality.
Many other constraint qualifications exist.

5.7 Complementary Slackness

Assume $x$ satisfies the primal constraints and $λ ⪰ 0$ . Then

\begin{aligned} g (λ, ν) & = inf_{\tilde{x} \in D} (f_{0} (\tilde{x}) + \sum_{i = 1}^{m} λ_{i} f_{i} (\tilde{x}) + \sum_{i = 1}^{p} ν_{i} h_{i} (\tilde{x})) \\ \leq f_{0} (x) + \sum_{i = 1}^{m} λ_{i} f_{i} (x) + \sum_{i = 1}^{p} ν_{i} h_{i} (x) \\ \leq f_{0} (x) . \end{aligned}

Equality $f_{0} (x) = g (λ, ν)$ holds if and only if both inequalities hold with equality:

First inequality: $x$ minimizes $L (\tilde{x}, λ, ν)$ over $\tilde{x} \in D$ .
Second inequality: $λ_{i} f_{i} (x) = 0$ for $i = 1, \dots, m$ .

Definition (Complementary Slackness)
$λ_{i} > 0 ⟹ f_{i} (x) = 0, f_{i} (x) < 0 ⟹ λ_{i} = 0.$
Equivalently, $λ_{i} f_{i} (x) = 0$ for all $i = 1, \dots, m$ .

The name reflects that at least one of $λ_{i}$ and $f_{i} (x)$ is "slack" (zero) while the other may be tight.

5.8 KKT (Karush–Kuhn–Tucker) Conditions

Assume strong duality holds, $x^{*}$ is primal optimal, and $(λ^{*}, ν^{*})$ is dual optimal. Then the following necessary conditions hold:

Primal feasibility: $f_{i} (x^{*}) \leq 0$ for $i = 1, \dots, m$ and $h_{i} (x^{*}) = 0$ for $i = 1, \dots, p$ .
Dual feasibility: $λ^{*} ⪰ 0$ .
Complementary slackness: $λ_{i}^{*} f_{i} (x^{*}) = 0$ for $i = 1, \dots, m$ .
Stationarity: $x^{*}$ is a minimizer of $L (\cdot, λ^{*}, ν^{*})$ .

Conversely, these four conditions imply optimality of $x^{*}$ and $(λ^{*}, ν^{*})$ , and strong duality.

If the problem is convex and the functions $f_{i}$ , $h_{i}$ are differentiable, condition 4 can be written as:

Definition (KKT Conditions for Differentiable Problems)
A point $(x^{*}, λ^{*}, ν^{*})$ satisfies the KKT conditions if:
Primal feasibility: $g_{i} (x^{*}) \leq 0, h_{j} (x^{*}) = 0$
Dual feasibility: $λ_{i}^{*} \geq 0$
Complementary slackness: $λ_{i}^{*} g_{i} (x^{*}) = 0$
Stationarity: $\nabla f (x^{*}) + \sum_{i = 1}^{m} λ_{i}^{*} \nabla g_{i} (x^{*}) + \sum_{j = 1}^{p} ν_{j}^{*} \nabla h_{j} (x^{*}) = 0$

Role of the KKT Conditions

Theorem (KKT Necessity and Sufficiency)
Necessity: If $x^{*}$ is a primal optimal solution and Slater's condition holds, then there exist $(λ^{*}, ν^{*})$ such that $(x^{*}, λ^{*}, ν^{*})$ satisfies all KKT conditions.
Sufficiency: If the problem is convex and $(x^{*}, λ^{*}, ν^{*})$ satisfies the KKT conditions, then $x^{*}$ is a globally optimal solution.

In summary: for a convex problem satisfying Slater's condition,

$x^{*} is optimal ⟺ \exists λ^{*}, ν^{*} satisfying KKT conditions 1–4 (or 1,2,3,4’) .$

5.9 Dual of Standard Problem Classes

5.9.1 Linear Program (LP) — Standard Form

Primal:

\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & A x = b, x ⪰ 0. \end{aligned}

Lagrangian: $L (x, λ, ν) = c^{⊤} x + ν^{⊤} (A x - b) - λ^{⊤} x = - b^{⊤} ν + (c + A^{⊤} ν - λ)^{⊤} x .$

Since $L$ is affine in $x$ , the dual function is finite only when $c + A^{⊤} ν - λ = 0$ :

g (λ, ν) = {\begin{cases} - b^{⊤} ν, & A^{⊤} ν - λ + c = 0, \\ - \infty, & otherwise. \end{cases}

Dual problem:

\begin{aligned} max_{ν} & - b^{⊤} ν \\ s.t. & A^{⊤} ν + c ⪰ 0. \end{aligned}

From Slater's condition: $p^{*} = d^{*}$ if there exists $\tilde{x} ≻ 0$ with $A \tilde{x} = b$ . In fact, $p^{*} = d^{*}$ except when both primal and dual are infeasible.

5.9.2 Linear Program (LP) — Inequality Form

Primal:

\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & A x ⪯ b . \end{aligned}

Lagrangian: $L (x, λ) = c^{⊤} x + λ^{⊤} (A x - b) = (c + A^{⊤} λ)^{⊤} x - b^{⊤} λ, λ ⪰ 0.$

Dual function: $g (λ) = inf_{x} ((c + A^{⊤} λ)^{⊤} x - b^{⊤} λ) = {\begin{cases} - b^{⊤} λ, & A^{⊤} λ + c = 0, \\ - \infty, & otherwise. \end{cases}$

Dual problem:

\begin{aligned} max_{λ} & - b^{⊤} λ \\ s.t. & A^{⊤} λ + c = 0, λ ⪰ 0. \end{aligned}

From Slater's condition: $p^{*} = d^{*}$ if $A \tilde{x} ≺ b$ for some $\tilde{x}$ .

5.9.3 Quadratic Program (QP)

Assume $P \in S_{+ +}^{n}$ (positive definite).

Primal:

\begin{aligned} min_{x} & x^{⊤} P x \\ s.t. & A x ⪯ b . \end{aligned}

Lagrangian: $L (x, λ) = x^{⊤} P x + λ^{⊤} (A x - b), λ ⪰ 0.$

Dual function: Minimizing over $x$ (set gradient to zero: $2 P x + A^{⊤} λ = 0 ⟹ x = - \frac{1}{2} P^{- 1} A^{⊤} λ$ ) gives

g (λ) = - \frac{1}{4} λ^{⊤} A P^{- 1} A^{⊤} λ - b^{⊤} λ .

Dual problem:

\begin{aligned} max_{λ} & - \frac{1}{4} λ^{⊤} A P^{- 1} A^{⊤} λ - b^{⊤} λ \\ s.t. & λ ⪰ 0. \end{aligned}

From Slater's condition: $p^{*} = d^{*}$ if $A \tilde{x} ≺ b$ for some $\tilde{x}$ . In fact, for QP, $p^{*} = d^{*}$ always.

5.9.4 Equality-Constrained QP

Primal:

\begin{aligned} min_{x} & \frac{1}{2} x^{⊤} Q x + c^{⊤} x \\ s.t. & A x = b, \end{aligned}

where $Q ⪰ 0$ and $A$ has full row rank ( $m \leq n$ ).

KKT system:

[\begin{matrix} Q & A^{⊤} \\ A & 0 \end{matrix}] [\begin{matrix} x^{*} \\ ν^{*} \end{matrix}] = [\begin{matrix} - c \\ b \end{matrix}] .

Since the problem is convex, the KKT solution $(x^{*}, ν^{*})$ is globally optimal.

5.9.5 SDP Connection (Two-Way Partitioning)

Primal (nonconvex):

\begin{aligned} min_{x} & x^{⊤} W x \\ s.t. & x_{i}^{2} = 1, i = 1, \dots, n . \end{aligned}

The feasible set is ${- 1, 1}^{n}$ ( $2^{n}$ discrete points). The cost can be written as $x^{⊤} W x = \sum_{i} W_{i i} + 2 \sum_{i > j} W_{i j} x_{i} x_{j}$ , where the cost of assigning $i$ and $j$ to different sets is $- 4 W_{i j}$ .

Lagrangian: $L (x, ν) = x^{⊤} W x + \sum_{i = 1}^{n} ν_{i} (x_{i}^{2} - 1) = x^{⊤} (W + diag (ν)) x - 1^{⊤} ν .$

Dual function:

g (ν) = {\begin{cases} - 1^{⊤} ν, & W + diag (ν) ⪰ 0, \\ - \infty, & otherwise. \end{cases}

Dual problem (SDP):

\begin{aligned} max_{ν} & - 1^{⊤} ν \\ s.t. & W + diag (ν) ⪰ 0. \end{aligned}

Lower bound: $p^{*} \geq - 1^{⊤} ν$ if $W + diag (ν) ⪰ 0$ . For example, choosing $ν = - λ_{min} (W) 1$ yields $p^{*} \geq n λ_{min} (W)$ .

5.10 Geometric Interpretation of Duality

The relationship between the primal and dual problems can be understood geometrically as a min–max exchange. The primal problem minimizes over $x$ the pointwise maximum (over $λ \geq 0$ ) of the Lagrangian; the dual maximizes over $λ \geq 0$ the pointwise minimum (over $x$ ) of the Lagrangian:

\underset{d^{*}}{\underset{⏟}{max_{λ \geq 0} min_{x} L (x, λ)}} \leq \underset{p^{*}}{\underset{⏟}{min_{x} max_{λ \geq 0} L (x, λ)}} .

The duality gap $p^{*} - d^{*} \geq 0$ measures the "wedge" between these two saddle-point problems. Strong duality ( $p^{*} = d^{*}$ ) occurs precisely when a saddle point of $L (x, λ)$ exists—i.e., there is a pair $(x^{*}, λ^{*})$ such that

L (x^{*}, λ) \leq L (x^{*}, λ^{*}) \leq L (x, λ^{*})

for all $x$ and $λ \geq 0$ . This is the essence of the KKT conditions.

Epigraph Interpretation

Define the set

A = {(u, t) \in R^{m + 1} | \exists x : f_{i} (x) \leq u_{i} (i = 1, \dots, m), f_{0} (x) \leq t} .

The primal optimal value is $p^{*} = inf {t ∣ (0, t) \in A}$ . The dual problem can be viewed as finding the best affine support (hyperplane) to $A$ from below, parametrized by $λ ⪰ 0$ :

g (λ) = inf_{(u, t) \in A} (t + λ^{⊤} u) .

When $A$ is convex (as in convex problems), supporting hyperplane theorems explain why strong duality tends to hold under mild regularity conditions.

5.11 Sensitivity Analysis

Dual variables carry economic meaning: they measure the sensitivity of the optimal value to perturbations in the constraints.

5.11.1 Single Perturbation

Consider perturbing a single inequality constraint $g (x) \leq 0$ to $g (x) \leq u$ :

p^{*} (u) = min_{x} {f (x) ∣ g (x) \leq u} .

Global Lower Bound
$p^{*} (u) \geq p^{*} - λ^{*} u,$
where $λ^{*}$ is the optimal dual variable for the original ( $u = 0$ ) problem, assuming strong duality holds.

If $λ^{*}$ is large and $u < 0$ (tightening the constraint): $p^{*} (u)$ increases greatly.
If $λ^{*}$ is small and $u > 0$ (loosening the constraint): $p^{*} (u)$ does not decrease much.

Local Sensitivity
$\frac{d p^{*} (u)}{d u} = - λ^{*} .$
For small $| u |$ , $p^{*} (u) \approx p^{*} - λ^{*} u$ .

Economic interpretation: $λ^{*}$ is the shadow price of the constraint. If the market price for relaxing this resource is less than $λ^{*}$ , it pays to buy more. If the price exceeds $λ^{*}$ , it is advantageous to sell (violate) the constraint.

5.11.2 General Case (Multiple Perturbations)

Perturb both inequality and equality constraints:

\begin{aligned} p^{*} (u, ℓ) = min_{x} & f (x) \\ s.t. & g_{i} (x) \leq u_{i}, i = 1, \dots, m, \\ h_{i} (x) = ℓ_{i}, i = 1, \dots, p . \end{aligned}

Global Lower Bound
$p^{*} (u, ℓ) \geq q (λ^{*}, ν^{*}) - u^{⊤} λ^{*} - ℓ^{⊤} ν^{*} = p^{*} (0, 0) - u^{⊤} λ^{*} - ℓ^{⊤} ν^{*} .$

Interpretation of dual variables:

Sign of $λ_{i}^{}$ , $ν_{i}^{}$	Effect of perturbation
Large $λ_{i}^{*}$	$p^{*}$ increases greatly if $u_{i} < 0$ (tighten)
Small $λ_{i}^{*}$	$p^{*}$ decreases little if $u_{i} > 0$ (loosen)
Large positive $ν_{i}^{*}$	$p^{*}$ increases greatly if $ℓ_{i} < 0$
Large negative $ν_{i}^{*}$	$p^{*}$ increases greatly if $ℓ_{i} > 0$
Small $	\nu_i^*

Local Sensitivity (General Case)
$\frac{\partial p^{*} (u, ℓ)}{\partial u_{i}} = - λ_{i}^{*}, \frac{\partial p^{*} (u, ℓ)}{\partial ℓ_{i}} = - ν_{i}^{*} .$

5.12 Worked Examples

5.12.1 Least-Norm Solution of Linear Equations

Primal:

\begin{aligned} min_{x} & x^{⊤} x \\ s.t. & A x = b . \end{aligned}

Lagrangian: $L (x, ν) = x^{⊤} x + ν^{⊤} (A x - b) .$

Set gradient to zero: $\nabla_{x} L = 2 x + A^{⊤} ν = 0 ⟹ x = - \frac{1}{2} A^{⊤} ν .$

Plug into $L$ :

g (ν) = L (- \frac{1}{2} A^{⊤} ν, ν) = - \frac{1}{4} ν^{⊤} A A^{⊤} ν - b^{⊤} ν .

This is a concave function of $ν$ . By the lower bound property:

p^{*} \geq - \frac{1}{4} ν^{⊤} A A^{⊤} ν - b^{⊤} ν for all ν .

5.12.2 Water-Filling (Power Allocation)

Primal (reformulated):

\begin{aligned} min_{x} & - \sum_{i = 1}^{n} \ln (x_{i} + α_{i}) \\ s.t. & x ⪰ 0, 1^{⊤} x = 1, \end{aligned}

where $α_{i} > 0$ .

Lagrangian: $L (x, λ, ν) = - \sum_{i} \ln (x_{i} + α_{i}) - λ^{⊤} x + ν (1^{⊤} x - 1) .$

KKT conditions: $x^{*}$ is optimal iff there exist $λ \in R^{n}$ , $ν \in R$ such that:

$x^{*} ⪰ 0$ , $1^{⊤} x^{*} = 1$
$λ ⪰ 0$
$λ_{i} x_{i}^{*} = 0$ , $i = 1, \dots, n$
Stationarity: $- \frac{1}{x_{i}^{*} + α_{i}} - λ_{i} + ν = 0$ , $i = 1, \dots, n$ .

Solution:

If $ν \leq 1 / α_{i}$ : $λ_{i} = 0$ and $x_{i}^{*} = 1 / ν - α_{i}$ .
If $ν \geq 1 / α_{i}$ : $x_{i}^{*} = 0$ and $λ_{i} = ν - 1 / α_{i}$ .

These combine to

x_{i}^{*} = max {0, \frac{1}{ν} - α_{i}}, λ_{i} = max {0, ν - \frac{1}{α_{i}}} .

Determine $ν$ from $1^{⊤} x^{*} = 1$ :

\sum_{i = 1}^{n} max {0, \frac{1}{ν} - α_{i}} = 1.

Interpretation: Think of $n$ patches at heights $α_{i}$ . Flood the area with a unit amount of water. The resulting water level is $1 / ν^{*}$ ; patch $i$ receives $x_{i}^{*}$ water.

5.12.3 Projection onto the 1-Norm Ball

Primal:

\begin{aligned} min_{x} & \frac{1}{2} ∥ x - a ∥_{2}^{2} \\ s.t. & ∥ x ∥_{1} \leq 1. \end{aligned}

KKT conditions:

$∥ x ∥_{1} \leq 1$
$λ \geq 0$
$λ (1 - ∥ x ∥_{1}) = 0$
$x$ minimizes $L (\tilde{x}, λ) = \frac{1}{2} ∥ \tilde{x} - a ∥_{2}^{2} + λ (∥ \tilde{x} ∥_{1} - 1) = \sum_{k = 1}^{n} (\frac{1}{2} ({\tilde{x}}_{k} - a_{k})^{2} + λ | {\tilde{x}}_{k} |) - λ .$

The problem is separable. For $λ \geq 0$ , the minimizer is the soft-thresholding operator:

x_{k} = {\begin{cases} a_{k} - λ, & a_{k} \geq λ, \\ 0, & - λ \leq a_{k} \leq λ, \\ a_{k} + λ, & a_{k} \leq - λ . \end{cases}

Hence $∥ x ∥_{1} = \sum_{k} | x_{k} | = \sum_{k} max {0, | a_{k} | - λ} .$

If $∥ a ∥_{1} \leq 1$ , solution is $λ = 0$ , $x = a$ .
Otherwise, solve the piecewise-linear equation in $λ$ :

\sum_{k = 1}^{n} max {0, | a_{k} | - λ} = 1.

5.13 Summary

Concept	Key Idea
Lagrangian	Weighted sum of objective and constraints
Dual function	Infimum of Lagrangian over primal variables; always concave
Dual problem	Maximize dual function over $λ ⪰ 0$ ; always convex
Weak duality	$d^{} \leq p^{}$ (always holds)
Strong duality	$d^{} = p^{}$ (holds for convex problems under constraint qualifications)
Slater's condition	Existence of a strictly feasible point guarantees strong duality
Complementary slackness	$λ_{i}^{} g_{i} (x^{}) = 0$ — at optimum, at least one of $λ_{i}$ , $g_{i} (x)$ is zero
KKT conditions	Necessary and sufficient optimality conditions for convex problems under Slater
Sensitivity	$\partial p^{} / \partial u_{i} = - λ_{i}^{}$ — dual variables as shadow prices

Chapter 5: Duality ​

5.1 Motivation: From Constrained to Unconstrained ​

5.1.1 Linearizing the Indicator ​

5.1.2 Min–Max Reformulation ​

5.2 The Weak Max–Min Inequality ​

5.3 Definitions: Lagrangian, Dual Function, Dual Problem ​

Example: Unit Circle Constraint ​

5.4 The General Optimization Problem ​

Why the Dual Problem is Always Convex ​

5.5 Weak and Strong Duality ​

5.6 Slater's Constraint Qualification ​

5.7 Complementary Slackness ​

5.8 KKT (Karush–Kuhn–Tucker) Conditions ​

Role of the KKT Conditions ​

5.9 Dual of Standard Problem Classes ​

5.9.1 Linear Program (LP) — Standard Form ​

5.9.2 Linear Program (LP) — Inequality Form ​

5.9.3 Quadratic Program (QP) ​

5.9.4 Equality-Constrained QP ​

5.9.5 SDP Connection (Two-Way Partitioning) ​

5.10 Geometric Interpretation of Duality ​

Epigraph Interpretation ​

5.11 Sensitivity Analysis ​

5.11.1 Single Perturbation ​

5.11.2 General Case (Multiple Perturbations) ​

5.12 Worked Examples ​

5.12.1 Least-Norm Solution of Linear Equations ​

5.12.2 Water-Filling (Power Allocation) ​

5.12.3 Projection onto the 1-Norm Ball ​

5.13 Summary ​

Chapter 5: Duality

5.1 Motivation: From Constrained to Unconstrained

5.1.1 Linearizing the Indicator

5.1.2 Min–Max Reformulation

5.2 The Weak Max–Min Inequality

5.3 Definitions: Lagrangian, Dual Function, Dual Problem

Example: Unit Circle Constraint

5.4 The General Optimization Problem

Why the Dual Problem is Always Convex

5.5 Weak and Strong Duality

5.6 Slater's Constraint Qualification

5.7 Complementary Slackness

5.8 KKT (Karush–Kuhn–Tucker) Conditions

Role of the KKT Conditions

5.9 Dual of Standard Problem Classes

5.9.1 Linear Program (LP) — Standard Form

5.9.2 Linear Program (LP) — Inequality Form

5.9.3 Quadratic Program (QP)

5.9.4 Equality-Constrained QP

5.9.5 SDP Connection (Two-Way Partitioning)

5.10 Geometric Interpretation of Duality

Epigraph Interpretation

5.11 Sensitivity Analysis

5.11.1 Single Perturbation

5.11.2 General Case (Multiple Perturbations)

5.12 Worked Examples

5.12.1 Least-Norm Solution of Linear Equations

5.12.2 Water-Filling (Power Allocation)

5.12.3 Projection onto the 1-Norm Ball

5.13 Summary