Skip to content

Chapter 2: Convex Sets

2.1 Mathematical Background

2.1.1 Inner Product of Vectors

Definition (Inner Product). For vectors x,yRn, the inner product (dot product) is

x,y=xy=i=1nxiyi

Geometric Interpretation: The inner product measures the alignment between two vectors:

  • θ=0 vectors are in the same direction.
  • θ=90 vectors are orthogonal (perpendicular).

Example:

x=[12],y=[31]x,y=13+21=5

2.1.2 Angle Between Vectors

The angle θ between x and y is defined by:

cosθ=x,yxy,x=x,x

Example:

x=[12],y=[31]θ=cos1(5510)

2.1.3 Inner Product of Matrices

Definition (Frobenius Inner Product). For matrices A,BRm×n, the Frobenius inner product is

A,BF=i=1mj=1nAijBij=trace(AB)

Frobenius Norm:

AF=A,AF=i,jAij2

Angle Between Matrices:

cosθ=A,BFAFBF

Example:

A=[1203],B=[2110]A,BF=12+21+01+30=4

2.1.4 Common Vector Norms

  1. Euclidean Norm (2 norm):

    x2=i=1nxi2
  2. 1-Norm (1 norm):

    x1=i=1n|xi|
  3. Chebyshev Norm ( norm):

    x=maxi|xi|
  4. p-Norm (general p norm, p1):

    xp=(i=1n|xi|p)1/p

2.1.5 Symmetric Matrices

Definition. A square matrix ARn×n is symmetric if

A=Aor equivalentlyAij=Aji for all i,j

Key Properties:

  • All eigenvalues are real.
  • Eigenvectors corresponding to distinct eigenvalues are orthogonal.
  • Symmetric matrices are diagonalizable by an orthogonal matrix.

2.1.6 Eigenvalue Decomposition

Theorem (Eigenvalue Decomposition). If A is symmetric, it admits

A=QΛQ=i=1nλiqiqi

where:

  • Q=[q1qn] is orthonormal: QQ=I
  • Λ=diag(λ1,,λn) with λiR
  • Each term λiqiqi is a rank-1 symmetric matrix.

2.1.7 Positive Semidefinite (PSD) Matrices

Definition. A symmetric matrix ARn×n is positive semidefinite (PSD) if

xAx0xRn

(denoted as A0)

Key Properties:

  • All eigenvalues of A are nonnegative: λi0.
  • PSD matrices are symmetric, hence diagonalizable by orthogonal matrices.
  • If A0, then A=BB for some BRn×n (Cholesky factorization or matrix square root).
  • The set of PSD matrices forms a convex cone.

2.1.8 Gradient and Hessian of Multivariate Functions

Gradient: For f:RnR, the gradient (direction of steepest increase) is

f(x)=[fx1fxn]Rn

Hessian: The Hessian (local curvature) is the matrix of second-order partial derivatives:

2f(x)=[2fx122fx1xn2fxnx12fxn2]Rn×n

Quadratic Example: Let f(x)=xAx+bx+c, with ARn×n symmetric, bRn, cR. Then

f(x)=2Ax+b,2f(x)=2A

2.2 Lines, Line Segments, and Rays

Let x,yRn and define the direction vector d:=yx.

Line through x and y:

x,y={x+θdθR}

Line segment from x to y:

[x,y]={x+θdθ[0,1]}={(1θ)x+θyθ[0,1]}

Ray starting at x going through y:

xy={x+θdθ0}

2.3 Definition of Convex Sets

Definition (Convex Set). A set CRn is convex if for any x,yC and any θ[0,1],

z=(1θ)x+θyC

z is called a convex combination of x and y.

Intuition: Every line segment between two points in C lies entirely inside C.

Examples of convex sets: line, line segment, ray, single point, empty set, subspace, affine space, hyperplane, halfspace, convex cone, positive semidefinite cone, Euclidean balls and ellipsoids, norm balls and norm cones, polyhedra.

Why convex sets are important: In convex optimization, the feasible region of a convex optimization problem is a convex set. This property guarantees that any local optimum is also a global optimum.


2.4 Subspaces and Affine Sets

2.4.1 Subspace

Definition (Subspace). A subset WRn is a subspace if:

  1. 0W (contains the zero vector)
  2. x,yWx+yW (closed under addition)
  3. xW,αRαxW (closed under scaling)

Examples:

  • span{v1,,vk}={i=1kαivi|αiR} for v1,,vkRn
  • null(M)={xRnMx=0} for MRm×n
  • Any line through the origin in Rn

2.4.2 Affine Space

Definition (Affine Space). A subset ARn is an affine space if for any x,yA and any θR,

(1θ)x+θyA

Intuition: Affine spaces are like subspaces, but they need not pass through the origin. They are "shifted subspaces."

Examples:

  • Any line in R2 (not necessarily through the origin)
  • Any plane in R3 (not necessarily through the origin)
  • More generally: A=x0+W, where x0Rn and W is a subspace

2.4.3 Affine Space vs. Hyperplane

  • A hyperplane in Rn has dimension n1.
  • Affine spaces can have any dimension 0,1,,n.

Examples:

  1. A line in R3: A={(θ,1,0)θR}, dim(A)=1. Not a hyperplane (hyperplanes in R3 have dimension 2).
  2. A plane in R4: A={x0+θ1v1+θ2v2θ1,θ2R}, dim(A)=2. Not a hyperplane (hyperplanes in R4 have dimension 3).

2.5 Hyperplanes and Halfspaces

Definition (Hyperplane). A hyperplane in Rn is a set of the form

H={xRnax=b}

where aRn{0} and bR.

Intuition:

  • a is a normal vector to the hyperplane.
  • b shifts the hyperplane; if b=0, H is a subspace.
  • Hyperplanes have dimension n1.

Examples:

  • In R2: a line (e.g., x1+2x2=3)
  • In R3: a plane (e.g., 2x1x2+x3=5)

Definition (Halfspace). A closed halfspace is defined by

Ha,b={xRnaxb},a0

It contains all points on one side of the hyperplane H.

A hyperplane divides Rn into two halfspaces.


2.6 Convex Cones and Polar Cones

Definition (Cone). A set KRn is a cone if

xK,α0αxK

Definition (Convex Cone). A set KRn is a convex cone if it is a cone and is convex, equivalently:

x,yK,α,β0αx+βyK

αx+βy is called a conic combination of x,y with α,β0.

Example: K={(x1,x2)R2x10,x20} (the first quadrant).

Definition (Polar Cone). The polar cone of K is

K={yRnyx0xK}

It contains all vectors forming a nonpositive inner product with all elements of K.

Example: The polar of the first quadrant (R+2) is the third quadrant (R2).

Properties of Polar Cones:

  • K is always a closed, convex cone, even if K is not convex.
  • If K is closed and convex, (K)=K (bipolar property).
  • K1K2K2K1.

Examples:

  • If K=R+n, then K=Rn.
  • If K is a subspace L, then K=L (the orthogonal complement).

Types of Combinations

TypeFormCoefficient constraintsResulting set
Linear combinationi=1kθixiθiRSubspace (span{xi})
Conic combinationi=1kθixiθi0Convex cone
Affine combinationi=1kθixii=1kθi=1Affine space
Convex combinationi=1kθixiθi0,i=1kθi=1Convex set

2.7 Tangent Cone and Normal Cone

Let CRn be a set and xC.

Definition (Tangent Cone). The tangent cone of C at x is

TC(x)={dRntk0,dkd with x+tkdkC}
  • Represents directions in which you can "leave x while staying in C infinitesimally."
  • Describes feasible directions from x.

Definition (Normal Cone). The normal cone of C at x is

NC(x)={yRnyd0dTC(x)}
  • Contains vectors/directions perpendicular (obtuse) to the tangent cone at x.
  • Often used in optimality conditions and convex analysis.

2.8 Separating Hyperplane Theorem

Theorem (Separating Hyperplane). Let C,DRn be two nonempty, disjoint, convex sets. Then there exists a nonzero vector aRn and a scalar bR such that

axbayfor all xC,yD

This means there exists a hyperplane {xax=b} that separates C and D (i.e., CHa,b, DHa,b+).

Simple interpretation: There is a hyperplane that can separate any point outside a convex set from the set itself.

Examples:

  1. The nonnegative orthant in Rn: R+n={xRnxi0,i=1,,n}

    • Separating hyperplane: a=e1, b=0
  2. Positive semidefinite matrices S+n:

    • Separating hyperplane: a=q1q1, b=0

Note: Convexity is required; non-convex sets may not be separable.


2.9 Generalized Inequalities

Let KRn be a proper cone (a convex cone that is closed, pointed, and has nonempty interior).

Definition (Generalized Inequality). For x,yRn:

xKyyxK

Strict version:

xKyyxint(K)

Examples:

  • K=R+n: xKyxiyi for all i (componentwise inequality).
  • K=S+n (PSD cone): XKYYX0.

Note: We can have xKy and yKx at the same time (incomparable). Generalized inequalities are partial orders in general.


2.10 Minimum and Minimal Elements

Let XRn and K be a proper cone.

Definition (Minimum Element). xX is the minimum if

xKxxX

It is the smallest element; unique if it exists. Equivalently, Xx+K.

Definition (Minimal Element). xX is a minimal if there is no yX such that

yKxandyx

There may be multiple minimal elements. Equivalently, (xK)X={x}.


2.11 Dual Cone

Definition (Dual Cone). Let KRn be a cone. The dual cone of K is defined as

K={yRnyx0xK}

Properties:

  • K is always a closed, convex cone.
  • If K is closed and convex, (K)=K (bipolar property).
  • K1K2K2K1.
  • Relation to polar cone: K=K.

Examples:

  • K=R+nK=R+n (self-dual).
  • K=S+n (PSD cone) K=S+n (self-dual).

2.12 Positive Semidefinite (PSD) Cone

  • Sn={XRn×nX=X} is the set of n×n symmetric matrices.
  • S+n={XSnX0} is the positive semidefinite cone.
    • zXz0 for all zRn.
    • Equivalently, all eigenvalues 0.
  • S++n={XSnX0} is the positive definite cone.

2.13 Operations That Preserve Convexity

Practical methods for establishing convexity of a set C:

  1. Apply the definition of convexity:

    x1,x2C,0θ1θx1+(1θ)x2C
  2. Show that C is obtained from simple convex sets (hyperplanes, halfspaces, norm balls, etc.) by operations that preserve convexity:

    • Intersection
    • Affine function
    • Perspective function
    • Linear-fractional function

2.13.1 Intersection of Convex Sets

Property. The intersection of any (finite or infinite) collection of convex sets is convex:

C=iICi is convex if each Ci is convex.

Example:

C1={xx10},C2={xx20}C1C2=R+2

Interpretation: Adding constraints (intersections) preserves convexity.

2.13.2 Affine Mapping

Property. If C is convex and A is a matrix, b a vector, then the image under an affine map is convex:

{yy=Ax+b,xC} is convex.

Example: Scaling and translating a convex set preserves convexity:

C={xx21}AC+b={yy=Ax+b,x21} is convex.

Interpretation: Linear transformations and translations do not "bend" the set — they preserve convexity.