Gaussian Soap Bubble

  • In high dimensions, gaussian distribution are indistinguishable from uniform distribution on the unit sphere.

  • To see how low dimensional analogy will extend to high dimension, normalization gaussian and then checking the effects can be good technique.

So introduce this problem in mathematical form:

Let's have d-dimensional standard guassian RV. X=[x1,x2,...xd]X= [x_1, x_2, ...x_d]where xiN(0,1)x_i \sim \mathcal{N}(0,1).

Now our intuition will say that probability distribution of XXwill have most of its mass around mean = 0, which is the case in 1 and 2 dimensional gaussian. But in high dimension it's not like this. Not of the mass i.e if many points were to sample from XX, most of them will lie at a certain radius determined by dd.

Let consider the euclidean norm of RV XXand call it Y=X=ixi2Y = \sqrt{X} = \sqrt{\sum_i x_i^2}, now we know that YχkY \sim \chi_kwhere kkis dimensian of standard gaussian RV XX.

Now

E[Y]=μY=2Γ(k+12)Γ(k2)limkinfk12E[Y] = \mu_Y = \sqrt{2}\frac{ \Gamma(\frac{k+1}{2})}{ \Gamma(\frac{k}{2})} \\ \approx \lim_{k \rightarrow \inf} \sqrt{k-\frac{1}{2}}

which means if we have 10000 dimensional standard normal, then the mean of the norm is around 100. This means that most of the samples lie on the sphere of radius 100.

Histrogram of norm of 1000 samples of 10000 dimensional standard normal random variable

Last updated