Cheatsheet: Convexity, Smoothness & Strong Convexity Equivalences
1 Convexity Equivalences
For a twice-differentiable
- Basic Convexity:
, . - First-Order Condition (FOC):
, . - Gradient Monotonicity (GM):
, . - Second-Order Condition (SOC):
(Hessian PSD).
Convexity FOC
(
Take
(
Multiply by
FOC GM
(
(
Subtract the tangent at
By GM, integrand is
GM SOC
(
(
2 L-Smoothness Equivalences
For a twice-differentiable
- Lipschitz Continuous Gradient:
, . - Alternative (Convexity):
is convex. - Descent Lemma:
. - Hessian Bound:
, .
Lipschitz Gradient Hessian Bound
(
Taking norms and using Lipschitz:
(
Hessian Bound Alternative Definition
Alternative Descent Lemma
If
Rearrange:
Direct: Lipschitz Gradient Descent Lemma
By FTC:
Apply Cauchy-Schwarz and Lipschitz:
3 Gradient Descent Convergence
Gradient descent update:
3.1 Smooth Functions (L-smooth, step size )
Descent Lemma:
Convergence to stationary point (
3.2 Smooth & Convex Functions ( )
Convergence rate (
Derivation sketch: From convexity FOC,
Telescoping sum yields the bound.
3.3 Smooth & Strongly Convex Functions ( , )
Linear convergence:
Key inequalities:
- Descent Lemma:
- Gradient Lower Bound (from
-strong convexity): - Substitute:
- Condition number
; smaller gives faster convergence.
4 Strong Convexity Equivalences
For a twice-differentiable
- Alternative Definition:
is convex. - Strong FOC:
. - Strong GM:
. - Strong SOC:
, .
Alternative Strong SOC
Define
Alternative Strong FOC
(
Substitute
Rearrange quadratic terms:
(
By strong FOC, the bracket
Strong FOC Strong GM
(
Add and cancel
(
Add/subtract
Apply strong GM with
Divide by