Processing math: 57%

Hypergeometric Distribution

The hypergeometric random variable with parameters (M,n,N) counts the number of “good “objects in a sample of size N chosen without replacement from a population of M objects where n is the number of “good “objects in the total population.

p(k;N,n,M)=(nk)(MnNk)(MN)N(Mn)kmin

where (defining m=M-n )

\begin{eqnarray*} g\left(N,n,M\right) & = & m^{3}-m^{5}+3m^{2}n-6m^{3}n+m^{4}n+3mn^{2}\\ & & -12m^{2}n^{2}+8m^{3}n^{2}+n^{3}-6mn^{3}+8m^{2}n^{3}\\ & & +mn^{4}-n^{5}-6m^{3}N+6m^{4}N+18m^{2}nN\\ & & -6m^{3}nN+18mn^{2}N-24m^{2}n^{2}N-6n^{3}N\\ & & -6mn^{3}N+6n^{4}N+6m^{2}N^{2}-6m^{3}N^{2}-24mnN^{2}\\ & & +12m^{2}nN^{2}+6n^{2}N^{2}+12mn^{2}N^{2}-6n^{3}N^{2}.\end{eqnarray*}

Implementation: scipy.stats.hypergeom