A saddle point with bounded images forces the minimax equality by showing both sides are zero.
A VI solution yields a saddle point for the gap payoff, hence the minimax equality.
Reformulate the inner max by flipping the sign.
Proposition 1.4.3.3.
A dual max-structure representation of (4.19) comes from
min_{w ∈ Q} max_{v ∈ Q} ⟪B(v), w - v⟫_1 = max_{v ∈ Q} min_{w ∈ Q} ⟪B(v), w - v⟫_1 = - min_{v ∈ Q} max_{w ∈ Q} ⟪B(v), v - w⟫_1.
Equivalently, take E1 = E2 = E, Q1 = Q2 = Q, d1 = d2 = d, A = ℬ,
fhat(x) = ⟪b, x⟫_1 + ⟪ℬ x, x⟫_1, and phihat(u) = ⟪b, u⟫_1. Then
f_μ(x) = max_{u ∈ Q} {⟪ℬ x, u⟫_1 - μ d(u) - ⟪b, u⟫_1}.
In this dual representation M = ‖ℬ‖_{1,2}, and the complexity bound corresponding to (4.21)
is N ≤ (4 D1 ‖ℬ‖_{1,2})/(σ1 ε) + sqrt(D1 ‖ℬ‖_{1,2}/(σ1 ε)).
Definition 1.4.4.1.
Let a_j ∈ E1^* and b^{(j)} ∈ ℝ for j = 1, ..., m. Consider the piecewise linear
minimization problem min_{x ∈ Q1} f(x), where
f(x) := max_{1 ≤ j ≤ m} |⟪a_j, x⟫_1 - b^{(j)}| (4.22).
Equations
Instances For
Proposition 1.4.4.1.
In the setting of Definition 1.4.4.1, take E1 = ℝ^n with Euclidean norm, the prox-function
d1(x) = (1/2) ‖x‖_1^2, and rbar = max_{x ∈ Q1} ‖x‖_1. Let E2 = ℝ^{2m} with simplex
Δ_{2m} and define hatA = (A; -A) and hatb = (b; -b) from rows a_j and scalars
b^{(j)}. Then f(x) = max_{u ∈ Δ_{2m}} {⟪hatA x, u⟫_2 - ⟪hatb, u⟫_2} (4.19). With the entropy
prox-function d2(u) = ln(2m) + ∑ u^{(j)} ln u^{(j)}, we have σ1 = σ2 = 1,
D1 = (1/2) rbar^2, D2 = ln(2m), and ‖hatA‖_{1,2} = max_{1≤j≤m} ‖a_j‖_{1,*}. Therefore the
complexity estimate (4.4) gives
N ≤ 2*sqrt 2 rbar (max_{1≤j≤m} ‖a_j‖_{1,*}) sqrt(ln(2m)) / ε iterations for (4.22), and the smooth
approximation has the explicit form
bar f_μ(x) = μ ln((1/m) ∑_{j=1}^m ξ((1/μ) [⟪a_j, x⟫_1 + b^{(j)}])) with
ξ(τ) = (1/2) (e^τ + e^{-τ}) (4.22_smooth).
Definition 1.4.4.2.
Let a_j ∈ E1^* and b^{(j)} ∈ ℝ for j = 1, ..., m. Consider minimizing the sum of
absolute values (4.23):
min_{x ∈ Q1} f(x) with f(x) := ∑_{j=1}^m |⟪a_j, x⟫_1 - b^{(j)}|.
Equations
Instances For
Unfold the smoothed max for the sum-of-abs objective into a separable quadratic sum.
Split the separable quadratic supremum over Q2 into coordinatewise suprema.
Express the quadratic supremum on [-1,1] via the scaled ψμ on [0,1].
A crude operator-norm bound using the supremum norm on Fin m → ℝ.
Proposition 1.4.4.2.
In the setting of Definition 1.4.4.2, let A be the matrix with rows a_j and take
E2 = ℝ^m, Q2 = {u : |u^{(j)}| ≤ 1}, and
d2(u) = (1/2) ‖u‖_2^2 = (1/2) ∑ ‖a_j‖_{1,*} (u^{(j)})^2. Then the smoothed approximation
has the form
f_μ(x) = max_{u ∈ Q2} {⟪A x - b, u⟫_2 - μ d2(u)} = ∑_{j=1}^m ‖a_j‖_{1,*} ψ_μ(|⟪a_j,x⟫_1 - b^{(j)}| / ‖a_j‖_{1,*}),
where ψ_μ is the function in (4.17). Moreover, with D = ∑ ‖a_j‖_{1,*} we have
D2 = (1/2) D, σ2 = 1, and ‖A‖_{1,2} ≤ D^{1/2}. Therefore, in the case M = 0,
N ≤ (2/ε) * sqrt(2 D1/σ1) * ∑ ‖a_j‖_{1,*} (equation (4.24)).