Adding a convex function preserves strong convexity.
Convexity of the affine sum term in F_k.
Quadratic lower bound at a minimizer of F_k.
Lemma 1.3.1.
Let {α_k}_{k≥0} be positive numbers with α_0 ∈ (0,1] and α_{k+1}^2 ≤ A_{k+1} for all
k ≥ 0, where A_{k+1} = ∑_{i=0}^{k+1} α_i (equation (3.6)). Assume (R_k) holds for some
k ≥ 0. Define τ_k = α_{k+1} / A_{k+1} (equation (tau_def)) and update
x_{k+1} = τ_k z_k + (1-τ_k) y_k, y_{k+1} = T_Q(x_{k+1}) (equation (3.7)).
Then (R_{k+1}) holds.
Lemma 1.3.2.
For k ≥ 0, define α_k = (k+1)/2 (equation (alpha_k_def)). Then
τ_k = 2/(k+3) and A_k = (k+1)(k+2)/4 (equation (3.10)), and the conditions (3.6)
(α_0 ∈ (0,1] and α_{k+1}^2 ≤ A_{k+1}) are satisfied.
Algorithm 1.3.1.
Assume f satisfies (grad_lipschitz) on Q, and let d be a prox-function for Q with
parameter σ. Initialize at a prox-center x0 of Q and set y0 = T_Q(x0). For k ≥ 0,
iterate:
- compute
f(x_k)and∇ f(x_k); - compute
y_k = T_Q(x_k); - compute
z_kas an argmin of(L/σ) d(x) + ∑_{i=0}^k (i+1)/2 [ f(x_i) + ⟪∇ f(x_i), x - x_i⟫ ]overx ∈ Q(equation (alg311:zk)); - set
x_{k+1} = (2/(k+3)) z_k + ((k+1)/(k+3)) y_k(equation (3.11)).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Definition 1.3.6.
Define a modified selection rule: given y_{k-1}, set \tilde y_k = T_Q(x_k) and choose
y_k ∈ argmin { f x : x ∈ {y_{k-1}, x_k, \tilde y_k} } (equation (3.14)).
Equations
- One or more equations did not get rendered due to their size.
Instances For
Proposition 1.3.5.
If {y_k} is chosen by the rule (3.14), then
f(y_k) ≤ f(y_{k-1}) ≤ ⋯ ≤ f(x_0) (equation (3.15)).
The infimum defining ψ_k is bounded above by evaluation at any feasible point.
Upper bound ψ_k by evaluating the affine model at a point x★ ∈ Q.
Convert R_k and an upper bound on ψ_k into a gap estimate.
The invariant R_k holds for all iterates of the optimal scheme.
Theorem 1.3.1.
Let sequences {x_k} ⊆ Q and {y_k} ⊆ Q be generated by Algorithm 1.3.1. Then for any k ≥ 0,
`((k+1)(k+2)/4) f(y_k) ≤ min_{x ∈ Q} { (L/σ) d x + ∑_{i=0}^k (i+1)/2 [ f(x_i)
- ⟪∇ f(x_i), x - x_i⟫ ] }
(equation (3.12)). Ifx* ∈ argmin_{x ∈ Q} f x, thenf(y_k) - f(x*) ≤ 4 L d(x*) / (σ (k+1)(k+2))` (equation (3.13)).