Proposition 1.4.2.
In the M = 0 case of Theorem 1.4.1, an optimal (order-wise) choice of the smoothing parameter μ,
the corresponding Lipschitz constant L_μ, and the iteration count N as a function of accuracy
ε > 0 is given by
μ = ε / (2 D2), L_μ = (D2 / (2 σ2)) * (‖A‖_{1,2}^2 / ε), and an integer N with
(4 ‖A‖_{1,2} / ε) * sqrt(D1 D2 / (σ1 σ2)) ≤ N + 1 (equation (4.8), up to rounding).
Definition 1.4.1.1.
The book's standard simplex agrees with mathlib's stdSimplex on Fin n → ℝ.
Definition 1.4.1.1.
The matrix-game saddle-point value
min_{x ∈ Δ_n} max_{u ∈ Δ_m} {⟪A x, u⟫_2 + ⟪c, x⟫_1 + ⟪b, u⟫_2} (4.9).
Equations
- One or more equations did not get rendered due to their size.
Instances For
The standard simplex is nonempty when n > 0.
Weak duality: the dual value is bounded above by the saddle-point value.
Definition 1.4.1.2.
The saddle-point value (4.9) equals the primal minimization in (4.10) and upper-bounds the
dual maximization in (4.10_dual) over the standard simplices Δ_n and Δ_m.
A coordinate of a point in the standard simplex lies in [0, 1].
On [0,1], x ↦ x * log x is 1-strongly convex.
The entropy sum x ↦ ∑ i, x i * log (x i) is 1-strongly convex on the standard simplex.
Adding a constant to a strongly convex function preserves strong convexity.
The entropy prox-function has prox-diameter bounded by log n on the standard simplex.
Lemma 1.4.1.1.
In the setting of Definition 1.4.1.2, with ℓ1 norms
‖x‖₁ = ∑ i |x i| and ‖u‖₁ = ∑ j |u j|, and entropy prox-functions on Δ_n and Δ_m,
d1(x) = ln n + ∑ i x^{(i)} ln x^{(i)} and d2(u) = ln m + ∑ j u^{(j)} ln u^{(j)},
the strong convexity parameters satisfy σ1 = σ2 = 1 and the prox-diameters are
D1 = ln n and D2 = ln m.