Documentation

Papers.SmoothMinimization_Nesterov_2004.Sections.section04_part3

theorem algorithm311_duality_gap_bound {E1 : Type u_1} {E2 : Type u_2} [NormedAddCommGroup E1] [NormedSpace E1] [FiniteDimensional E1] [NormedAddCommGroup E2] [NormedSpace E2] [FiniteDimensional E2] (Q1 : Set E1) (Q2 : Set E2) (A : E1 →L[] E2 →L[] ) (fhat : E1) (phihat : E2) (d1 : E1) (d2 : E2) (σ1 σ2 M D1 : ) (xSeq ySeq : Q1) ( : E1E2) (N : ) (hatu : Q2) :
have A' := { toFun := fun (x : E1) => (A x), map_add' := , map_smul' := }; have D2 := sSup (d2 '' Q2); have μ := 2 * OperatorNormDef A' / (N + 1) * (D1 / (σ1 * σ2 * D2)); have fbar := SmoothedObjective Q2 A phihat d2 μ fhat; have f := fun (x : E1) => fhat x + sSup ((fun (u : E2) => (A x) u - phihat u) '' Q2); have φ := AdjointFormPotential Q1 Q2 A fhat phihat; LipschitzOnWith M.toNNReal (fun (x : E1) => fderiv fhat x) Q10 < σ10 < σ2StrongConvexOn Q1 σ1 d1StrongConvexOn Q2 σ2 d2IsProxDiameterBound Q1 d1 D1OptimalSchemeAlgorithm Q1 fbar d1 (M + 1 / (μ * σ2) * OperatorNormDef A' ^ 2) σ1 xSeq ySeq(∀ (x : E1), IsSmoothedMaximizer Q2 A phihat d2 μ x ( x))hatu = iFinset.range (N + 1), (2 * (i + 1) / ((N + 1) * (N + 2))) (xSeq i)ConvexOn Q1 fhatDifferentiableOn fhat Q1ConvexOn Q2 phihat(∀ (x : E1), fderiv (SmoothedMaxFunction Q2 A phihat d2 μ) x = LinearMap.toContinuousLinearMap ((AdjointOperator A') ( x)))0 M0 D10 < M + 1 / (μ * σ2) * OperatorNormDef A' ^ 2IsClosed Q1IsOpen Q1ConvexOn Q1 fbarDifferentiableOn fbar Q10 μ(∀ uQ2, 0 d2 u)(∀ (x : E1), BddAbove ((fun (u : E2) => (A x) u - phihat u) '' Q2))BddAbove (d2 '' Q2)Q2.Nonempty(∀ (u : E2), BddBelow ((fun (x : E1) => (A x) u + fhat x) '' Q1))0 f (ySeq N) - φ hatu f (ySeq N) - φ hatu 4 * OperatorNormDef A' / (N + 1) * (D1 * D2 / (σ1 * σ2)) + 4 * M * D1 / (σ1 * (N + 1) ^ 2) ε > 0, N 4 * OperatorNormDef A' / ε * (D1 * D2 / (σ1 * σ2)) + 2 * (M * D1 / (σ1 * ε)) → f (ySeq N) - φ hatu ε

Theorem 1.4.1. Assume the structural model (2.2) and that fhat is differentiable with M-Lipschitz gradient on Q1. Let d1 be a prox-function of Q1 with parameter σ1 > 0 and prox-diameter D1 as in (4.3_D1). Let d2 be a prox-function of Q2 with strong convexity parameter σ2 > 0 and define D2 := max_{u ∈ Q2} d2 u as in Proposition 2.7. Apply Algorithm 3.11 to the smoothed problem (4.1) with μ = μ(N) = (2‖A‖_{1,2}/(N+1)) * sqrt(D1/(σ1 σ2 D2)) (equation (thm3_muN)). After N iterations define \hat x := y_N and \hat u := ∑_{i=0}^N 2(i+1)/((N+1)(N+2)) u_μ(x_i) (4.2). Then 0 ≤ f(\hat x) - φ(\hat u) and the duality-gap bound (4.3) holds, and consequently the ε-solution bound (4.4) holds.