Chapter 4: Convex Optimization Problems

4.1 Optimization Problem in Standard Form

An optimization problem in standard form is written as

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m \\ h_{j} (x) = 0, j = 1, \dots, p \end{aligned}

where:

$x \in R^{n}$ is the optimization variable.
$f_{0} : R^{n} \to R$ is the objective (cost) function.
$f_{i}$ are inequality constraint functions.
$h_{j}$ are equality constraint functions.

Optimal Value

The optimal value $p^{⋆}$ of the problem is defined as

p^{⋆} = inf {f_{0} (x) ∣ f_{i} (x) \leq 0, h_{j} (x) = 0}

$p^{⋆} = \infty$ : the problem is infeasible.
$p^{⋆} = - \infty$ : the problem is unbounded below.

Feasible, Optimal, and Locally Optimal Points

$x$ is feasible if it satisfies all constraints.
$x$ is optimal if $x$ is feasible and $f_{0} (x) = p^{⋆}$ .
$x$ is locally optimal if $x$ is optimal for (for any $R > 0$ )
$\begin{aligned} min_{z} & f_{0} (z) \\ s.t. & f_{i} (z) \leq 0, i = 1, \dots, m \\ h_{i} (z) = 0, i = 1, \dots, p \\ ∥ z - x ∥_{2} \leq R \end{aligned}$
If $x$ is feasible and $f_{0} (x) - p^{⋆} \leq ε$ , then $x$ is called $ε$ -suboptimal.

Examples (with $n = 1$ , $m = p = 0$ ):

$f_{0} (x) = - \log x$ , $dom f_{0} = R_{+ +}$ : $p^{⋆} = - \infty$
$f_{0} (x) = x \log x$ , $dom f_{0} = R_{+ +}$ : $p^{⋆} = - 1 / e$ , $x^{⋆} = 1 / e$
$f_{0} (x) = x^{3} - 3 x$ , $dom f_{0} = R$ : $p^{⋆} = - \infty$ , local optimum at $x = 1$

Implicit Constraints and Domain

The implicit domain constraint is

x \in D = ⋂_{i = 0}^{m} dom f_{i} \cap ⋂_{j = 1}^{p} dom h_{j}

$D$ is the domain of the problem.
Explicit constraints: $f_{i} (x) \leq 0$ , $h_{j} (x) = 0$ .
The problem is unconstrained if $m = p = 0$ .

Example:

min_{x} f_{0} (x) = - \sum_{i = 1}^{k} \log (b_{i} - a_{i}^{⊤} x)

is an unconstrained problem with implicit constraints $a_{i}^{⊤} x < b_{i}$ .

Feasibility Problem

A feasibility problem asks whether any feasible point exists:

\begin{aligned} find & x \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m \\ h_{j} (x) = 0, j = 1, \dots, p \end{aligned}

This can be considered a special case of the general problem with $f_{0} (x) \equiv 0$ :

$p^{⋆} = 0$ if constraints are feasible (any feasible $x$ is optimal).
$p^{⋆} = \infty$ if infeasible.

4.2 Convex Optimization Problem

A problem is a convex optimization problem if it is of the form

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m \\ a_{i}^{⊤} x = b_{i}, i = 1, \dots, p \end{aligned}

Definition. An optimization problem is convex if:
$f_{0}$ is convex.
$f_{1}, \dots, f_{m}$ are convex.
$h_{1}, \dots, h_{p}$ are affine: $h_{i} (x) = a_{i}^{⊤} x - b_{i}$ .

Key Property. The feasible set of a convex optimization problem is convex: it is the intersection of $m$ sublevel sets (of convex functions) and $p$ hyperplanes, which is convex.

4.3 Local and Global Optima

Theorem (Local = Global). For a convex optimization problem, every locally optimal point is also globally optimal.

Proof. Suppose $x$ is locally optimal and there exists a feasible $y$ with $f_{0} (y) < f_{0} (x)$ . For $z = θ y + (1 - θ) x$ with small $θ > 0$ , convexity gives

f_{0} (z) \leq θ f_{0} (y) + (1 - θ) f_{0} (x) < f_{0} (x)

But local optimality requires $f_{0} (x) \leq f_{0} (z)$ for all feasible $z$ sufficiently near $x$ — a contradiction.

Uniqueness. If $f_{0}$ is strictly convex, then a convex optimization problem has at most one global minimizer (uniqueness).

4.4 First-Order Optimality Condition

For a convex problem

min_{x} f (x) s.t. x \in C

Theorem (First-Order Optimality). A feasible point $x^{⋆}$ is optimal if and only if
$⟨ \nabla f (x^{⋆}), y - x^{⋆} ⟩ \geq 0 \forall y \in C$

Geometric interpretation:

$\nabla f (x^{⋆})$ makes an acute ( $\leq 90^{\circ}$ ) angle with all feasible directions at an optimal $x^{⋆}$ .
$\nabla f (x^{⋆})$ defines a supporting hyperplane to the feasible set $C$ at $x^{⋆}$ .
$- \nabla f (x^{⋆})$ is normal to the tangent plane of the feasible set.

Special Cases

Unconstrained optimization:

\nabla f_{0} (x^{⋆}) = 0

Equality-constrained minimization:

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & A x = b \end{aligned} ⟹ \exists z : \nabla f_{0} (x^{⋆}) + A^{⊤} z = 0

Minimization over the nonnegative orthant:

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & x ⪰ 0 \end{aligned} ⟹ \nabla f_{0} (x^{⋆}) ⪰ 0, x_{i}^{⋆} (\nabla f_{0} (x^{⋆}))_{i} = 0, i = 1, \dots, n

Example: Unconstrained Quadratic Minimization

Consider minimizing the quadratic function:

f (x) = \frac{1}{2} x^{⊤} Q x + b^{⊤} x + c

where $Q ⪰ 0$ . The first-order condition yields

\nabla f (x) = Q x + b = 0

If $Q ≻ 0$ , there is a unique solution: $x = - Q^{- 1} b$ .
If $Q$ is singular and $b \notin R (Q)$ , there is no solution ( $min_{x} f (x) = - \infty$ ).
If $Q$ is singular and $b \in R (Q)$ , there are infinitely many solutions: $x = - Q^{+} b + z$ , $z \in N (Q)$ , where $Q^{+}$ is the pseudoinverse of $Q$ .

4.5 Equivalent Transformations

Two problems are (informally) equivalent if their solutions can be readily converted into each other. Common transformations that preserve equivalence:

Transformations and Change of Variables

If $h : R \to R$ is a monotone increasing transformation, then

min_{x \in C} f (x) ⟺ min_{x \in C} h (f (x))

This can be used to reveal "hidden convexity" of a problem.

If $φ : R^{n} \to R^{m}$ is one-to-one and its image covers the feasible set $C$ , then we can change variables:

min_{x} f (x) s.t. x \in C ⟺ min_{y} f (φ (y)) s.t. φ (y) \in C

Example: Log-likelihood transformation. In maximum likelihood estimation (MLE), given independent samples $x_{1}, \dots, x_{n}$ with likelihood

L (θ) = \prod_{i = 1}^{n} p (x_{i} ∣ θ)

applying the strictly increasing $\log (\cdot)$ transformation yields

ℓ (θ) = \log L (θ) = \sum_{i = 1}^{n} \log p (x_{i} ∣ θ)

with the key property: ${argmax}_{θ} L (θ) = {argmax}_{θ} ℓ (θ)$ .

Eliminating Equality Constraints

Given:

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m \\ A x = b \end{aligned}

Any feasible point can be expressed as $x = M y + x_{0}$ , where $A x_{0} = b$ and $R (M) = N (A)$ . The problem is equivalent to

\begin{aligned} min_{y} & f_{0} (M y + x_{0}) \\ s.t. & f_{i} (M y + x_{0}) \leq 0, i = 1, \dots, m \end{aligned}

Introducing Slack Variables

Given:

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m \\ A x = b \end{aligned}

inequality constraints can be transformed via slack variables $s_{i} \geq 0$ :

\begin{aligned} min_{x, s} & f_{0} (x) \\ s.t. & s_{i} \geq 0, i = 1, \dots, m \\ f_{i} (x) + s_{i} = 0, i = 1, \dots, m \\ A x = b \end{aligned}

Note. This transformation is no longer convex unless $f_{i}$ are affine.

Epigraph Formulation

The standard form problem

\begin{aligned} min_{x} & f_{0} (x) \\ s.t. & f_{i} (x) \leq 0, i = 1, \dots, m \\ A x = b \end{aligned}

is equivalent to its epigraph form:

\begin{aligned} min_{x, t} & t \\ s.t. & f_{0} (x) - t \leq 0, \\ f_{i} (x) \leq 0, i = 1, \dots, m \\ A x = b \end{aligned}

Relaxation

Given an optimization problem

min_{x} f (x) s.t. x \in C

we can always take an enlarged constraint set $\tilde{C} \supseteq C$ and consider

min_{x} f (x) s.t. x \in \tilde{C}

This is called a relaxation and its optimal value is always smaller or equal to that of the original problem (for minimization).

Example: Continuous relaxation of integer constraints.

\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & A x ⪯ b, \\ x \in {0, 1}^{n} \end{aligned} ⟹ x \in [0, 1]^{n}

Feasible set becomes convex (a polytope).
The problem reduces to a linear program (LP).
Provides a lower bound on the integer optimum.

Example: Relaxing nonconvex sets. $x^{2} + y^{2} = 1$ (unit circle) $⟹$ $x^{2} + y^{2} \leq 1$ (unit disk). For non-affine equality constraints $h_{j} (x) = 0$ where $h_{j}$ are convex, replace with $h_{j} (x) \leq 0$ .

Converting a Nonconvex Problem to a Convex One

Via observation:

Nonconvex problem:

\begin{aligned} min_{x_{1}, x_{2}} & f_{0} (x) = x_{1}^{2} + x_{2}^{2} \\ s.t. & f_{1} (x) = \frac{x_{1}}{1 + x_{2}^{2}} \leq 0, \\ h_{1} (x) = (x_{1} + x_{2})^{2} = 0 \end{aligned}

$f_{0}$ is convex, but $f_{1}$ is not convex and $h_{1}$ is not affine. The equivalent convex formulation is:

\begin{aligned} min_{x_{1}, x_{2}} & x_{1}^{2} + x_{2}^{2} \\ s.t. & x_{1} \leq 0, \\ x_{1} + x_{2} = 0 \end{aligned}

Via substitution:

Nonconvex problem: $min_{x} (| x | - 1)^{2}$ . Let $t = | x |$ where $t \geq 0$ . The equivalent convex problem is $min_{t} (t - 1)^{2}$ subject to $t \geq 0$ .

Example: Geometric program (monomial case). The original nonconvex problem

\begin{aligned} min_{x ≻ 0} & f_{0} (x) = c_{0} x_{1}^{a_{01}} x_{2}^{a_{02}} \dots x_{n}^{a_{0 n}} \\ s.t. & f_{i} (x) = c_{i} x_{1}^{a_{i 1}} x_{2}^{a_{i 2}} \dots x_{n}^{a_{i n}} \leq 1, i = 1, \dots, m \end{aligned}

where $c_{i} > 0$ , becomes convex via the monotone transformation $y_{i} = \log x_{i}$ ( $x_{i} = e^{y_{i}}$ ):

\begin{aligned} min_{y \in R^{n}} & \log c_{0} + a_{01} y_{1} + \dots + a_{0 n} y_{n} \\ s.t. & \log c_{i} + a_{i 1} y_{1} + \dots + a_{i n} y_{n} \leq 0, i = 1, \dots, m \end{aligned}

4.6 Linear Programming (LP)

Definition (Linear Program). An LP minimizes a linear objective subject to linear inequality and equality constraints:
$\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & D x ⪯ d, \\ A x = b \end{aligned}$
An LP is a convex problem with affine objective and constraint functions. It is the most fundamental problem class in convex optimization.

Inequality and Standard Forms

Inequality form:

min_{x} c^{⊤} x s.t. A x ⪯ b

Standard form:

min_{x} c^{⊤} x s.t. A x = b, x ⪰ 0

Conversion between forms:

Replace $a_{i}^{⊤} x \leq b_{i}$ with $a_{i}^{⊤} x + s_{i} = b_{i}$ , $s_{i} \geq 0$ (slack variables).
Replace a free variable $x_{j}$ with $x_{j} = x_{j}^{+} - x_{j}^{-}$ , where $x_{j}^{+}, x_{j}^{-} \geq 0$ .

Example: Diet Problem

Find the cheapest combination of foods that satisfies certain nutritional requirements:

\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & D x ⪰ d \\ x ⪰ 0 \end{aligned}

where $c_{j}$ is the per-unit cost of food $j$ , $d_{i}$ is the minimum required intake of nutrient $i$ , $D_{i j}$ is the content of nutrient $i$ per unit of food $j$ , and $x_{j}$ is the units of food $j$ in the diet.

Example: Piecewise-linear Minimization

Consider minimizing the piecewise-linear function:

f (x) = max_{i = 1, \dots, m} (a_{i}^{⊤} x + b_{i})

This can be transformed to an equivalent LP via its epigraph form:

\begin{aligned} min_{t, x} & t \\ s.t. & a_{i}^{⊤} x + b_{i} \leq t, i = 1, \dots, m \end{aligned}

This is an LP (in inequality form) with variables $x$ and $t$ .

Example: Chebyshev Center of a Polyhedron

Given a polyhedron $P = {x \in R^{n} : A x ⪯ b}$ , find the largest Euclidean ball that fits inside it:

B (x_{c}, r) = {x \in R^{n} : ∥ x - x_{c} ∥_{2} \leq r} = {x_{c} + u : ∥ u ∥_{2} \leq r}

The problem is formulated as

\begin{aligned} max_{x_{c}, r} & r \\ s.t. & a_{i}^{⊤} x_{c} + ∥ a_{i} ∥_{2} r \leq b_{i}, i = 1, \dots, m \end{aligned}

$∥ a_{i} ∥_{2}$ is the Euclidean norm of the normal vector of the $i$ -th face.
Each constraint ensures the ball does not cross the corresponding face.
This is a linear program (LP) in $x_{c}$ and $r$ .
Applications: emergency facility placement, sensor/Wi-Fi placement, urban planning.

Example: Basis Pursuit

Consider an underdetermined linear system $A x = b$ , $A \in R^{m \times n}$ , $m < n$ . The goal is to find the sparsest solution (minimize the number of nonzero entries).

Exact sparsity minimization ( $ℓ_{0}$ -norm) is NP-hard.
Relaxation: minimize the $ℓ_{1}$ -norm instead: $min_{x} ∥ x ∥_{1}$ s.t. $A x = b$ .
The $ℓ_{1}$ -norm encourages sparsity while remaining convex.

By introducing nonnegative variables $u_{i} \geq 0$ to represent $| x_{i} |$ ( $- u_{i} \leq x_{i} \leq u_{i}$ ), the LP formulation is

\begin{aligned} min_{x, u} & \sum_{i = 1}^{n} u_{i} \\ s.t. & A x = b, \\ - u_{i} \leq x_{i} \leq u_{i}, i = 1, \dots, n, \\ u_{i} \geq 0, i = 1, \dots, n \end{aligned}

4.7 Quadratic Programming (QP)

Definition (Quadratic Program). A QP minimizes a quadratic objective subject to linear constraints:
$\begin{aligned} min_{x} & \frac{1}{2} x^{⊤} Q x + c^{⊤} x \\ s.t. & D x ⪯ d, \\ A x = b \end{aligned}$
If $Q ⪰ 0$ (positive semidefinite), the problem is convex. Convex QPs have globally optimal solutions and can be solved efficiently.

Example: Least Squares

Least squares problem:

min_{x} ∥ A x - b ∥_{2}^{2}

Least squares with bounds:

min_{x} ∥ A x - b ∥_{2}^{2} s.t. ℓ ⪯ x ⪯ u

QP formulation:

min_{x} \frac{1}{2} x^{⊤} (2 A^{⊤} A) x - (2 A^{⊤} b)^{⊤} x = \frac{1}{2} x^{⊤} Q x + c^{⊤} x

where $Q = 2 A^{⊤} A ⪰ 0$ , $c = - 2 A^{⊤} b$ .

Example: Distance Between Two Polyhedra

Given polyhedra $P_{1} = {x \in R^{n} : A_{1} x ⪯ b_{1}}$ and $P_{2} = {y \in R^{n} : A_{2} y ⪯ b_{2}}$ , the distance between them is

\begin{aligned} min_{x, y} & ∥ x - y ∥_{2}^{2} \\ s.t. & A_{1} x ⪯ b_{1}, \\ A_{2} y ⪯ b_{2} \end{aligned}

This is a convex QP.

Example: Linear Program with Random Cost

Let cost be $c^{⊤} x$ where $c$ is random with mean $\bar{c} = E [c]$ . A risk-averse objective is

min_{x} {\bar{c}}^{⊤} x + λ Var (c^{⊤} x)

Expanding the variance:

Var (c^{⊤} x) = E [(c^{⊤} x - {\bar{c}}^{⊤} x)^{2}] = x^{⊤} E [(c - \bar{c}) (c - \bar{c})^{⊤}] x = x^{⊤} Σ x

The QP formulation is

\begin{aligned} min_{x} & {\bar{c}}^{⊤} x + λ x^{⊤} Σ x \\ s.t. & D x ⪰ d, x ⪰ 0 \end{aligned}

where $Σ$ is the covariance matrix of $c$ .

Example: LASSO (Basis Pursuit with Noise)

LASSO trades off data fidelity and sparsity.

Penalty formulation:

min_{x} \frac{1}{2} ∥ A x - b ∥_{2}^{2} + λ ∥ x ∥_{1}

Constrained formulation:

min_{x} ∥ A x - b ∥_{2}^{2} s.t. ∥ x ∥_{1} \leq k

Key characteristics:

Handles noisy data (LASSO $\Leftrightarrow$ BP when noise-free).
Sparsity controlled by $λ$ (or $k$ ).
Can be written as a QP.

4.8 Quadratically Constrained Quadratic Programming (QCQP)

Definition (QCQP). A QCQP minimizes a quadratic objective subject to quadratic inequality and linear equality constraints:
$\begin{aligned} min_{x} & \frac{1}{2} x^{⊤} P_{0} x + q_{0}^{⊤} x + r_{0} \\ s.t. & \frac{1}{2} x^{⊤} P_{i} x + q_{i}^{⊤} x + r_{i} \leq 0, i = 1, \dots, m \\ A x = b \end{aligned}$
If $P_{0}, P_{1}, \dots, P_{m} ⪰ 0$ , the problem is convex.

QCQP generalizes QP by allowing quadratic inequality constraints, and in turn is a special case of SOCP.

4.9 Second-Order Cone Programming (SOCP)

Definition (SOCP). An SOCP is of the form:
$\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & ∥ D_{i} x + d_{i} ∥_{2} \leq e_{i}^{⊤} x + f_{i}, i = 1, \dots, m \\ A x = b \end{aligned}$

Second-Order (Lorentz) Cone

The second-order cone (also called the Lorentz cone or ice-cream cone) is

Q = {(x, t) \in R^{n + 1} : ∥ x ∥_{2} \leq t}

The constraint $∥ D_{i} x + d_{i} ∥_{2} \leq e_{i}^{⊤} x + f_{i}$ is equivalent to $(D_{i} x + d_{i}, e_{i}^{⊤} x + f_{i}) \in Q$ .

Why is $Q$ convex? It is the epigraph of the Euclidean norm $∥ x ∥_{2}$ , which is a convex function.

QP $\subset$ SOCP

A QP can be reformulated as an SOCP:

\begin{aligned} min_{x} & \frac{1}{2} x^{⊤} Q x + c^{⊤} x \\ s.t. & D x ⪯ d, \\ A x = b \end{aligned} ⟹ \begin{aligned} min_{x, t} & c^{⊤} x + \frac{1}{2} t \\ s.t. & D x ⪯ d, x^{⊤} Q x \leq t, A x = b \end{aligned}

with $x^{⊤} Q x \leq t ⟺ ∥ Q^{1 / 2} x ∥_{2}^{2} \leq \frac{1}{4} (1 + t)^{2} - \frac{1}{4} (1 - t)^{2}$ , which is an SOC constraint.

Example: Robust Linear Program (Deterministic)

In LP, there may be uncertainty in the data $c$ , $a_{i}$ , or $b_{i}$ . With uncertainty in $a_{i}$ , two approaches:

Deterministic (worst-case guarantee): constraints must hold for all $a_{i}$ in an ellipsoidal uncertainty set

E_{i} = {{\bar{a}}_{i} + P_{i} u : ∥ u ∥_{2} \leq 1}, {\bar{a}}_{i} \in R^{n}, P_{i} \in R^{n \times n}

Robust LP:

min_{x} c^{⊤} x s.t. a_{i}^{⊤} x \leq b_{i} \forall a_{i} \in E_{i}, i = 1, \dots, m

Equivalent SOCP:

min_{x} c^{⊤} x s.t. {\bar{a}}_{i}^{⊤} x + ∥ P_{i}^{⊤} x ∥_{2} \leq b_{i}, i = 1, \dots, m

since $sup_{∥ u ∥_{2} \leq 1} ({\bar{a}}_{i} + P_{i} u)^{⊤} x = {\bar{a}}_{i}^{⊤} x + ∥ P_{i}^{⊤} x ∥_{2}$ .

Example: Robust Linear Program (Stochastic)

Assume $a_{i} \sim N ({\bar{a}}_{i}, Σ_{i})$ is Gaussian. Then $a_{i}^{⊤} x \sim N ({\bar{a}}_{i}^{⊤} x, x^{⊤} Σ_{i} x)$ , and

Pr (a_{i}^{⊤} x \leq b_{i}) = Φ (\frac{b_{i} - {\bar{a}}_{i}^{⊤} x}{∥ Σ_{i}^{1 / 2} x ∥_{2}})

where $Φ$ is the CDF of $N (0, 1)$ . The robust LP with probability constraint

min_{x} c^{⊤} x s.t. Pr (a_{i}^{⊤} x \leq b_{i}) \geq η, i = 1, \dots, m

is equivalent to the SOCP (for $η \geq \frac{1}{2}$ ):

min_{x} c^{⊤} x s.t. {\bar{a}}_{i}^{⊤} x + Φ^{- 1} (η) ∥ Σ_{i}^{1 / 2} x ∥_{2} \leq b_{i}, i = 1, \dots, m

The terms $∥ P_{i}^{⊤} x ∥_{2}$ and $Φ^{- 1} (η) ∥ Σ_{i}^{1 / 2} x ∥_{2}$ are interpreted as the budget margin for robustness.

4.10 Semidefinite Programming (SDP)

Definition (SDP). A semidefinite program is of the form:
$\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & x_{1} F_{1} + x_{2} F_{2} + \dots + x_{n} F_{n} + F_{0} ⪯ 0 \\ A x = b \end{aligned}$
where $F_{i} \in S^{k}$ (symmetric $k \times k$ matrices). The inequality constraint $x_{1} F_{1} + \dots + x_{n} F_{n} + F_{0} ⪯ 0$ is called a linear matrix inequality (LMI).

LP $\subset$ SDP

An LP can be expressed as an SDP by using a diagonal matrix:

min_{x} c^{⊤} x s.t. A x ⪯ b ⟹ min_{x} c^{⊤} x s.t. diag (A x - b) ⪯ 0

SOCP $\subset$ SDP

The SOC constraint $∥ x ∥_{2} \leq t$ is equivalent to the LMI

∥ x ∥_{2} \leq t ⟺ [\begin{matrix} t I & x \\ x^{⊤} & t \end{matrix}] ⪰ 0

This follows from the Schur Complement Theorem: for symmetric $A$ , $C$ with $C ≻ 0$ ,

[\begin{matrix} A & B \\ B^{⊤} & C \end{matrix}] ⪰ 0 ⟺ A - B C^{- 1} B^{⊤} ⪰ 0

Thus an SOCP can be rewritten as an SDP:

\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & ∥ D_{i} x + d_{i} ∥_{2} \leq e_{i}^{⊤} x + f_{i} \end{aligned} ⟹ min_{x} c^{⊤} x s.t. [\begin{matrix} (e_{i}^{⊤} x + f_{i}) I & D_{i} x + d_{i} \\ (D_{i} x + d_{i})^{⊤} & e_{i}^{⊤} x + f_{i} \end{matrix}] ⪰ 0

Example: Eigenvalue Minimization

min_{x} λ_{max} (A (x))

where $A (x) = A_{0} + x_{1} A_{1} + \dots + x_{n} A_{n}$ (with given $A_{i} \in S^{k}$ ). Equivalent SDP:

\begin{aligned} min_{x, t} & t \\ s.t. & A (x) ⪯ t I \end{aligned}

since $λ_{max} (A) \leq t ⟺ A ⪯ t I$ .

4.11 Hierarchy of Problem Classes

The canonical convex optimization problem classes form a nested hierarchy:

LP \subset QP \subset QCQP \subset SOCP \subset SDP \subset Conic Program

Each class is a special case of the next, with increasing modeling expressiveness and computational cost.

Class	Objective	Constraints	Key Structure
LP	Linear $c^{⊤} x$	Linear inequalities $A x ⪯ b$	Polyhedral feasible set
QP	Quadratic $\frac{1}{2} x^{⊤} Q x + c^{⊤} x$ , $Q ⪰ 0$	Linear inequalities	LP with quadratic objective
QCQP	Quadratic, $P_{0} ⪰ 0$	Quadratic inequalities $P_{i} ⪰ 0$	Quadratic in both objective and constraints
SOCP	Linear	$\| D_{i} x + d_{i} \|_{2} \leq e_{i}^{⊤} x + f_{i}$	Second-order cone constraints
SDP	Linear	Linear matrix inequality $\sum x_{i} F_{i} + F_{0} ⪯ 0$	Semidefinite constraints

Conic Program

At the top of the hierarchy is the conic program:

Definition (Conic Program).
$\begin{aligned} min_{x} & c^{⊤} x \\ s.t. & A (x) \in K \\ A x = b \end{aligned}$
where $A : R^{n} \to X$ is an affine map and $K$ is a convex cone.

All the previous classes are special cases of conic programs, depending on the choice of cone $K$ :

Problem	Cone $K$
LP	$K = R_{+}^{n} = {x \in R^{n} : x_{1}, \dots, x_{n} \geq 0}$ (nonnegative orthant)
SOCP	$K = Q = {(x, t) \in R^{n + 1} : \| x \|_{2} \leq t}$ (second-order cone)
SDP	$K = S_{+}^{n} = {X \in S^{n} : X ⪰ 0}$ (positive semidefinite cone)

4.12 Comparison: LP vs. QP in Practice

Aspect	LP	QP
Objective	$min c^{⊤} x$	$min \frac{1}{2} x^{⊤} Q x + c^{⊤} x$
Constraints	$D x ⪯ d$ , $A x = b$	$D x ⪯ d$ , $A x = b$

LP $\subset$ QP (QP recovers LP when $Q = 0$ ).
Often, solving an LP in theory amounts to solving a QP in practice.

Diet problem:

In theory: cost $c$ is deterministic $\to$ LP.
In practice: $c$ is a random vector $\to$ QP.

Basis pursuit:

In theory: samples are noise-free ( $A x = b$ ) $\to$ LP.
In practice: samples are noisy ( $A x \neq b$ ) $\to$ QP (LASSO).

Deterministic vs. stochastic robustness:

In theory: deterministic LP $\to$ LP.
In practice: robust LP under uncertainty $\to$ SOCP.

4.13 Practical Methods for Establishing Convexity

Establishing Convexity of a Set $C$

Apply the definition: $x_{1}, x_{2} \in C$ , $0 \leq θ \leq 1 ⟹ θ x_{1} + (1 - θ) x_{2} \in C$ .
Show $C$ is obtained from simple convex sets by operations that preserve convexity:
- Intersection
- Affine function
- Perspective function
- Linear-fractional function
Show $C$ is a sublevel set of a convex function.
Show $C$ is the epigraph of a convex function.

Establishing Convexity of a Function $f$

Apply the definition: $f (θ x + (1 - θ) y) \leq θ f (x) + (1 - θ) f (y)$ .
Use equivalent conditions (e.g., $\nabla^{2} f (x) ⪰ 0$ ).
Show $f$ is obtained from simple convex functions by operations that preserve convexity:
- Nonnegative weighted sum: $g (x) = \sum_{i = 1}^{m} α_{i} f_{i} (x)$ , $α_{i} \geq 0$ .
- Composition with affine function: $g (x) = f (A x + b)$ .
- Pointwise maximum: $g (x) = max {f_{1} (x), \dots, f_{m} (x)}$ .
- Pointwise supremum.
- Composition with scalar functions.
- Minimization.
- Perspective.

4.14 Summary of Key Theorems

The feasible set of a convex optimization problem is convex.
Every local optimum of a convex problem is a global optimum.
For strictly convex $f_{0}$ , the optimal solution (if it exists) is unique.
First-order optimality condition: $x^{⋆}$ is optimal for $min_{x \in C} f (x)$ iff $⟨ \nabla f (x^{⋆}), y - x^{⋆} ⟩ \geq 0$ for all $y \in C$ .
Problems can be made equivalent through monotone transformations, change of variables, slack variables, epigraph formulation, and relaxation.
The hierarchy LP $\subset$ QP $\subset$ QCQP $\subset$ SOCP $\subset$ SDP $\subset$ Conic Program provides progressively richer modeling frameworks, each with convex structure guaranteeing global optimality.

Chapter 4: Convex Optimization Problems ​

4.1 Optimization Problem in Standard Form ​

Optimal Value ​

Feasible, Optimal, and Locally Optimal Points ​

Implicit Constraints and Domain ​

Feasibility Problem ​

4.2 Convex Optimization Problem ​

4.3 Local and Global Optima ​

4.4 First-Order Optimality Condition ​

Special Cases ​

Example: Unconstrained Quadratic Minimization ​

4.5 Equivalent Transformations ​

Transformations and Change of Variables ​

Eliminating Equality Constraints ​

Introducing Slack Variables ​

Epigraph Formulation ​

Relaxation ​

Converting a Nonconvex Problem to a Convex One ​

4.6 Linear Programming (LP) ​

Inequality and Standard Forms ​

Example: Diet Problem ​

Example: Piecewise-linear Minimization ​

Example: Chebyshev Center of a Polyhedron ​

Example: Basis Pursuit ​

4.7 Quadratic Programming (QP) ​

Example: Least Squares ​

Example: Distance Between Two Polyhedra ​

Example: Linear Program with Random Cost ​

Example: LASSO (Basis Pursuit with Noise) ​

4.8 Quadratically Constrained Quadratic Programming (QCQP) ​

4.9 Second-Order Cone Programming (SOCP) ​

Second-Order (Lorentz) Cone ​

QP ⊂ SOCP ​

Example: Robust Linear Program (Deterministic) ​

Example: Robust Linear Program (Stochastic) ​

4.10 Semidefinite Programming (SDP) ​

LP ⊂ SDP ​

SOCP ⊂ SDP ​

Example: Eigenvalue Minimization ​

4.11 Hierarchy of Problem Classes ​

Conic Program ​

4.12 Comparison: LP vs. QP in Practice ​

4.13 Practical Methods for Establishing Convexity ​

Establishing Convexity of a Set C ​

Establishing Convexity of a Function f ​

4.14 Summary of Key Theorems ​