Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
freem
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
Openai/693e3ce6-229c-8008-97dc-ab720cb1f95a
(section)
Add languages
Page
Discussion
English
Read
Edit
Edit source
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
Edit source
View history
General
What links here
Related changes
Special pages
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
=== Assistant: Yes — you can make the margin γ\gammaγ arbitrarily small even 0 for an ℓ1\ell1ℓ1-regularized PageRank instance by choo… === Yes — you can make the margin γ\gammaγ arbitrarily small (even 0) for an ℓ1\ell_1ℓ1-regularized PageRank instance by choosing (α,ρ)(\alpha,\rho)(α,ρ) and a “bad” graph so that you sit right at (or extremely near) a support-change breakpoint of the regularization path. This makes ∣∇f(x⋆)i∣|\nabla f(x^\star)_i|∣∇f(x⋆)i∣ almost equal to the threshold λi=ραdi\lambda_i=\rho\alpha\sqrt{d_i}λi=ραdi for some inactive coordinate, hence γ=λi−∣∇f(x⋆)i∣\gamma=\lambda_i-|\nabla f(x^\star)_i|γ=λi−∣∇f(x⋆)i∣ becomes tiny. (This is exactly what strict complementarity rules out.) Below is a fully explicit, connected graph example where you can compute γ\gammaγ in closed form and force γ<ρ\gamma < \sqrt{\rho}γ<ρ by a wide margin. ==== Take a star graph with: ==== * one center node ccc connected to mmm leaf nodes ℓ1,…,ℓm\ell_1,\dots,\ell_mℓ1,…,ℓm, * seed s=ecs=e_cs=ec, * teleportation α∈(0,1)\alpha\in(0,1)α∈(0,1), * regularization parameter ρ>0\rho>0ρ>0. Degrees: * dc=md_c=mdc=m, * dℓ=1d_{\ell}=1dℓ=1 for each leaf. For the ℓ1\ell_1ℓ1-regularized PPR objective (your equation (3)), the weighted ℓ1\ell_1ℓ1 penalty is * λc=ραm\lambda_c = \rho\alpha\sqrt{m}λc=ραm at the center, * λℓ=ρα\lambda_{\ell} = \rho\alphaλℓ=ρα at each leaf. open-problem-fountoulakis22a Because of symmetry, in a certain parameter regime the optimizer has the form: * xc⋆>0x_c^\star > 0xc⋆>0, * xℓ⋆=0x_{\ell}^\star = 0xℓ⋆=0 for every leaf. ===== In this regime (all leaves are zero), the center coordinate satisfies ===== (Qx⋆−αD−1/2s)c+λc=0,(Qx^\star - \alpha D^{-1/2}s)_c + \lambda_c = 0,(Qx⋆−αD−1/2s)c+λc=0, which gives xc⋆ = 2α(1−ρm)(1+α)m.x_c^\star \;=\; \frac{2\alpha(1-\rho m)}{(1+\alpha)\sqrt{m}}.xc⋆=(1+α)m2α(1−ρm). This is positive as long as ρ<1/m\rho < 1/mρ<1/m. ===== A leaf ℓ\ellℓ stays at zero iff its KKT inequality holds: ===== ∣(Qx⋆−αD−1/2s)ℓ∣≤λℓ=ρα.|(Qx^\star - \alpha D^{-1/2}s)_\ell| \le \lambda_\ell = \rho\alpha.∣(Qx⋆−αD−1/2s)ℓ∣≤λℓ=ρα. For the star, this inequality reduces to a very clean threshold: ρ ≥ ρ0:=1−α2m.\rho \;\ge\; \rho_0 := \frac{1-\alpha}{2m}.ρ≥ρ0:=2m1−α. So for ρ∈[1−α2m, 1m) \boxed{\;\rho \in \Big[\frac{1-\alpha}{2m},\;\frac{1}{m}\Big)\;}ρ∈[2m1−α,m1) the unique optimum has support S⋆={c}S^\star=\{c\}S⋆={c} (so ∣S⋆∣=1≤1/ρ|S^\star|=1\le 1/\rho∣S⋆∣=1≤1/ρ automatically). ===== All inactive coordinates are leaves and are symmetric, so the margin is attained on any leaf: ===== γ=λℓ−∣(Qx⋆−αD−1/2s)ℓ∣.\gamma = \lambda_\ell - |(Qx^\star - \alpha D^{-1/2}s)_\ell|.γ=λℓ−∣(Qx⋆−αD−1/2s)ℓ∣. A direct substitution yields the closed form: γ = α1+α(2ρ−1−αm) = 2α1+α(ρ−ρ0). \boxed{\; \gamma \;=\; \frac{\alpha}{1+\alpha}\Big(2\rho - \frac{1-\alpha}{m}\Big) \;=\; \frac{2\alpha}{1+\alpha}\big(\rho-\rho_0\big). \;}γ=1+αα(2ρ−m1−α)=1+α2α(ρ−ρ0). Two key takeaways: * At the breakpoint ρ=ρ0\rho=\rho_0ρ=ρ0: γ=0⇒γ<ρ (trivially).\gamma = 0 \quad\Rightarrow\quad \gamma < \sqrt{\rho}\ \text{(trivially)}.γ=0⇒γ<ρ (trivially). * If you choose ρ=ρ0+δ\rho=\rho_0+\deltaρ=ρ0+δ with δ>0\delta>0δ>0 tiny, then γ=2α1+αδ,\gamma = \frac{2\alpha}{1+\alpha}\delta,γ=1+α2αδ, so you can make γ\gammaγ as small as you want, hence certainly <ρ<\sqrt{\rho}<ρ. ==== Pick typical PageRank teleportation, say: ==== * α=0.15\alpha = 0.15α=0.15, * star size m=100m = 100m=100. Then ρ0=1−α2m=0.85200=0.00425.\rho_0 = \frac{1-\alpha}{2m}=\frac{0.85}{200}=0.00425.ρ0=2m1−α=2000.85=0.00425. Take ρ=0.004251\rho = 0.004251ρ=0.004251 (so δ=10−6\delta = 10^{-6}δ=10−6). Then γ=2α1+αδ=0.31.15⋅10−6≈2.61×10−7.\gamma = \frac{2\alpha}{1+\alpha}\delta = \frac{0.3}{1.15}\cdot 10^{-6} \approx 2.61\times 10^{-7}.γ=1+α2αδ=1.150.3⋅10−6≈2.61×10−7. But ρ≈0.004251≈6.52×10−2.\sqrt{\rho} \approx \sqrt{0.004251} \approx 6.52\times 10^{-2}.ρ≈0.004251≈6.52×10−2. So γ≈2.6⋅10−7 ≪ 6.5⋅10−2 = ρ.\gamma \approx 2.6\cdot 10^{-7} \;\ll\; 6.5\cdot 10^{-2} \;=\; \sqrt{\rho}.γ≈2.6⋅10−7≪6.5⋅10−2=ρ. This is a clean “bad” case: connected graph, ∣S⋆∣=1≤1/ρ|S^\star|=1 \le 1/\rho∣S⋆∣=1≤1/ρ, yet γ\gammaγ is orders of magnitude smaller than ρ\sqrt{\rho}ρ because ρ\rhoρ is chosen near a support-change breakpoint. ===== Step size ttt, momentum β\betaβ, etc. do not change γ\gammaγ: γ\gammaγ is a property of the problem instance (graph + seed + α,ρ\alpha,\rhoα,ρ). What the algorithm parameters can do is affect how long you linger in the transient, but they don’t alter the KKT slack at the true minimizer. ===== If you want, I can also give a path graph construction where ρ0\rho_0ρ0 is small even with moderate mmm, but the star example is the simplest explicit one where you can compute γ(ρ)\gamma(\rho)γ(ρ) in closed form.
Summary:
Please note that all contributions to freem are considered to be released under the Creative Commons Attribution-ShareAlike 4.0 (see
Freem:Copyrights
for details). If you do not want your writing to be edited mercilessly and redistributed at will, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource.
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)