Inverse-gamma distribution

From Wikipedia, the free encyclopedia

Inverse-gamma

Probability density function

Cumulative distribution function

Parameters α>0{\displaystyle \alpha >0} $\alpha >0$ shape (real)
β>0{\displaystyle \beta >0} $\beta >0$ scale (real)Support x∈(0,∞){\displaystyle x\in (0,\infty )\!} $x\in (0,\infty )\!$ PDF βαΓ(α)x−α−1exp⁡(−βx){\displaystyle {\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{-\alpha -1}\exp \left(-{\frac {\beta }{x}}\right)} ${\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{-\alpha -1}\exp \left(-{\frac {\beta }{x}}\right)$ CDF Γ(α,β/x)Γ(α){\displaystyle {\frac {\Gamma (\alpha ,\beta /x)}{\Gamma (\alpha )}}\!} ${\frac {\Gamma (\alpha ,\beta /x)}{\Gamma (\alpha )}}\!$ Mean βα−1{\displaystyle {\frac {\beta }{\alpha -1}}\!} ${\frac {\beta }{\alpha -1}}\!$ for α>1{\displaystyle \alpha >1} $\alpha >1$ Mode βα+1{\displaystyle {\frac {\beta }{\alpha +1}}\!} ${\frac {\beta }{\alpha +1}}\!$ Variance β2(α−1)2(α−2){\displaystyle {\frac {\beta ^{2}}{(\alpha -1)^{2}(\alpha -2)}}\!} ${\frac {\beta ^{2}}{(\alpha -1)^{2}(\alpha -2)}}\!$ for α>2{\displaystyle \alpha >2} $\alpha >2$ Skewness 4α−2α−3{\displaystyle {\frac {4{\sqrt {\alpha -2}}}{\alpha -3}}\!} ${\frac {4{\sqrt {\alpha -2}}}{\alpha -3}}\!$ for α>3{\displaystyle \alpha >3} $\alpha >3$ Ex. kurtosis 6(5α−11)(α−3)(α−4){\displaystyle {\frac {6(5\,\alpha -11)}{(\alpha -3)(\alpha -4)}}\!} ${\frac {6(5\,\alpha -11)}{(\alpha -3)(\alpha -4)}}\!$ for α>4{\displaystyle \alpha >4} $\alpha >4$ Entropy

α+ln⁡(βΓ(α))−(1+α)ψ(α){\displaystyle \alpha \!+\!\ln(\beta \Gamma (\alpha ))\!-\!(1\!+\!\alpha )\psi (\alpha )} $\alpha \!+\!\ln(\beta \Gamma (\alpha ))\!-\!(1\!+\!\alpha )\psi (\alpha )$

(see digamma function)MGF Does not exist.CF 2(−iβt)α2Γ(α)Kα(−4iβt){\displaystyle {\frac {2\left(-i\beta t\right)^{\!\!{\frac {\alpha }{2}}}}{\Gamma (\alpha )}}K_{\alpha }\left({\sqrt {-4i\beta t}}\right)} ${\frac {2\left(-i\beta t\right)^{\!\!{\frac {\alpha }{2}}}}{\Gamma (\alpha )}}K_{\alpha }\left({\sqrt {-4i\beta t}}\right)$

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the reciprocal of a variable distributed according to the gamma distribution. Perhaps the chief use of the inverse gamma distribution is in Bayesian statistics, where the distribution arises as the marginal posterior distribution for the unknown variance of a normal distribution, if an uninformative prior is used, and as an analytically tractable conjugate prior, if an informative prior is required.

However, it is common among Bayesians to consider an alternative parametrization of the normal distribution in terms of the precision, defined as the reciprocal of the variance, which allows the gamma distribution to be used directly as a conjugate prior. Other Bayesians prefer to parametrize the inverse gamma distribution differently, as a scaled inverse chi-squared distribution.

Characterization[edit]

Probability density function[edit]

The inverse gamma distribution's probability density function is defined over the support x>0{\displaystyle x>0} $x>0$

f(x;α,β)=βαΓ(α)(1/x)α+1exp⁡(−β/x){\displaystyle f(x;\alpha ,\beta )={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}(1/x)^{\alpha +1}\exp \left(-\beta /x\right)} $f(x;\alpha ,\beta )={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}(1/x)^{\alpha +1}\exp \left(-\beta /x\right)$

with shape parameter α{\displaystyle \alpha } $\alpha$ and scale parameter β{\displaystyle \beta } $\beta$ .[1] Here Γ(⋅){\displaystyle \Gamma (\cdot )} $\Gamma (\cdot )$ denotes the gamma function.

Unlike the Gamma distribution, which contains a somewhat similar exponential term, β{\displaystyle \beta } $\beta$ is a scale parameter as the distribution function satisfies:

f(x;α,β)=f(x/β;α,1)β{\displaystyle f(x;\alpha ,\beta )={\frac {f(x/\beta ;\alpha ,1)}{\beta }}} $f(x;\alpha ,\beta )={\frac {f(x/\beta ;\alpha ,1)}{\beta }}$

Cumulative distribution function[edit]

The cumulative distribution function is the regularized gamma function

F(x;α,β)=Γ(α,βx)Γ(α)=Q(α,βx){\displaystyle F(x;\alpha ,\beta )={\frac {\Gamma \left(\alpha ,{\frac {\beta }{x}}\right)}{\Gamma (\alpha )}}=Q\left(\alpha ,{\frac {\beta }{x}}\right)\!} $F(x;\alpha ,\beta )={\frac {\Gamma \left(\alpha ,{\frac {\beta }{x}}\right)}{\Gamma (\alpha )}}=Q\left(\alpha ,{\frac {\beta }{x}}\right)\!$

where the numerator is the upper incomplete gamma function and the denominator is the gamma function. Many math packages allow direct computation of Q{\displaystyle Q} $Q$ , the regularized gamma function.

Moments[edit]

The n-th moment of the inverse gamma distribution is given by[2]

E[Xn]=βn(α−1)⋯(α−n).{\displaystyle \mathrm {E} [X^{n}]={\frac {\beta ^{n}}{(\alpha -1)\cdots (\alpha -n)}}.} $\mathrm {E} [X^{n}]={\frac {\beta ^{n}}{(\alpha -1)\cdots (\alpha -n)}}.$

Characteristic function[edit]

Kα(⋅){\displaystyle K_{\alpha }(\cdot )} $K_{\alpha }(\cdot )$ in the expression of the characteristic function is the modified Bessel function of the 2nd kind.

Properties[edit]

For α>0{\displaystyle \alpha >0} $\alpha >0$ and β>0{\displaystyle \beta >0} $\beta >0$ ,

E[ln⁡(X)]=ln⁡(β)−ψ(α){\displaystyle \mathbb {E} [\ln(X)]=\ln(\beta )-\psi (\alpha )\,} $\mathbb {E} [\ln(X)]=\ln(\beta )-\psi (\alpha )\,$ E[X−1]=αβ,{\displaystyle \mathbb {E} [X^{-1}]={\frac {\alpha }{\beta }},\,} $\mathbb {E} [X^{-1}]={\frac {\alpha }{\beta }},\,$

The information entropy is

H⁡(X)=E⁡[−ln⁡(p(X))]=E⁡[−αln⁡(β)+ln⁡(Γ(α))+(α+1)ln⁡(X)+βX]=−αln⁡(β)+ln⁡(Γ(α))+(α+1)ln⁡(β)−(α+1)ψ(α)+α=α+ln⁡(βΓ(α))−(α+1)ψ(α).{\displaystyle {\begin{aligned}\operatorname {H} (X)&=\operatorname {E} [-\ln(p(X))]\\&=\operatorname {E} \left[-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(X)+{\frac {\beta }{X}}\right]\\&=-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(\beta )-(\alpha +1)\psi (\alpha )+\alpha \\&=\alpha +\ln(\beta \Gamma (\alpha ))-(\alpha +1)\psi (\alpha ).\end{aligned}}}

${\begin{aligned}\operatorname {H} (X)&=\operatorname {E} [-\ln(p(X))]\\&=\operatorname {E} \left[-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(X)+{\frac {\beta }{X}}\right]\\&=-\alpha \ln(\beta )+\ln(\Gamma (\alpha ))+(\alpha +1)\ln(\beta )-(\alpha +1)\psi (\alpha )+\alpha \\&=\alpha +\ln(\beta \Gamma (\alpha ))-(\alpha +1)\psi (\alpha ).\end{aligned}}$

where ψ(α){\displaystyle \psi (\alpha )} $\psi (\alpha )$ is the digamma function.

The Kullback-Leibler divergence of Inverse-Gamma(αp, βp) from Inverse-Gamma(αq, βq) is the same as the KL-divergence of Gamma(αp, βp) from Gamma(αq, βq):

DKL(αp,βp;αq,βq)=E[log⁡ρ(X)π(X)]=E[log⁡ρ(1/Y)π(1/Y)]=E[log⁡ρG(Y)πG(Y)],{\displaystyle D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})=\mathbb {E} \left[\log {\frac {\rho (X)}{\pi (X)}}\right]=\mathbb {E} \left[\log {\frac {\rho (1/Y)}{\pi (1/Y)}}\right]=\mathbb {E} \left[\log {\frac {\rho _{G}(Y)}{\pi _{G}(Y)}}\right],} $D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})=\mathbb {E} \left[\log {\frac {\rho (X)}{\pi (X)}}\right]=\mathbb {E} \left[\log {\frac {\rho (1/Y)}{\pi (1/Y)}}\right]=\mathbb {E} \left[\log {\frac {\rho _{G}(Y)}{\pi _{G}(Y)}}\right],$

where ρ,π{\displaystyle \rho ,\pi } $\rho ,\pi$ are the pdfs of the Inverse-Gamma distributions and ρG,πG{\displaystyle \rho _{G},\pi _{G}} $\rho _{G},\pi _{G}$ are the pdfs of the Gamma distributions, Y{\displaystyle Y} $Y$ is Gamma(αp, βp) distributed.

DKL(αp,βp;αq,βq)=(αp−αq)ψ(αp)−log⁡Γ(αp)+log⁡Γ(αq)+αq(log⁡βp−log⁡βq)+αpβq−βpβp.{\displaystyle {\begin{aligned}D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})={}&(\alpha _{p}-\alpha _{q})\psi (\alpha _{p})-\log \Gamma (\alpha _{p})+\log \Gamma (\alpha _{q})+\alpha _{q}(\log \beta _{p}-\log \beta _{q})+\alpha _{p}{\frac {\beta _{q}-\beta _{p}}{\beta _{p}}}.\end{aligned}}} ${\begin{aligned}D_{\mathrm {KL} }(\alpha _{p},\beta _{p};\alpha _{q},\beta _{q})={}&(\alpha _{p}-\alpha _{q})\psi (\alpha _{p})-\log \Gamma (\alpha _{p})+\log \Gamma (\alpha _{q})+\alpha _{q}(\log \beta _{p}-\log \beta _{q})+\alpha _{p}{\frac {\beta _{q}-\beta _{p}}{\beta _{p}}}.\end{aligned}}$

Related distributions[edit]

If X∼Inv-Gamma(α,β){\displaystyle X\sim {\mbox{Inv-Gamma}}(\alpha ,\beta )} $X\sim {\mbox{Inv-Gamma}}(\alpha ,\beta )$ then kX∼Inv-Gamma(α,kβ){\displaystyle kX\sim {\mbox{Inv-Gamma}}(\alpha ,k\beta )\,} $kX\sim {\mbox{Inv-Gamma}}(\alpha ,k\beta )\,$
If X∼Inv-Gamma(α,12){\displaystyle X\sim {\mbox{Inv-Gamma}}(\alpha ,{\tfrac {1}{2}})} $X\sim {\mbox{Inv-Gamma}}(\alpha ,{\tfrac {1}{2}})$ then X∼Inv-χ2(2α){\displaystyle X\sim {\mbox{Inv-}}\chi ^{2}(2\alpha )\,} $X\sim {\mbox{Inv-}}\chi ^{2}(2\alpha )\,$ (inverse-chi-squared distribution)
If X∼Inv-Gamma(α2,12){\displaystyle X\sim {\mbox{Inv-Gamma}}({\tfrac {\alpha }{2}},{\tfrac {1}{2}})} $X\sim {\mbox{Inv-Gamma}}({\tfrac {\alpha }{2}},{\tfrac {1}{2}})$ then X∼Scaled Inv-χ2(α,1α){\displaystyle X\sim {\mbox{Scaled Inv-}}\chi ^{2}(\alpha ,{\tfrac {1}{\alpha }})\,} $X\sim {\mbox{Scaled Inv-}}\chi ^{2}(\alpha ,{\tfrac {1}{\alpha }})\,$ (scaled-inverse-chi-squared distribution)
If X∼Inv-Gamma(12,c2){\displaystyle X\sim {\textrm {Inv-Gamma}}({\tfrac {1}{2}},{\tfrac {c}{2}})} $X\sim {\textrm {Inv-Gamma}}({\tfrac {1}{2}},{\tfrac {c}{2}})$ then X∼Levy(0,c){\displaystyle X\sim {\textrm {Levy}}(0,c)\,} $X\sim {\textrm {Levy}}(0,c)\,$ (Lévy distribution)
If X∼Inv-Gamma(1,c){\displaystyle X\sim {\textrm {Inv-Gamma}}(1,c)} $X\sim {\textrm {Inv-Gamma}}(1,c)$ then 1X∼Exp(c){\displaystyle {\tfrac {1}{X}}\sim {\textrm {Exp}}(c)\,} ${\tfrac {1}{X}}\sim {\textrm {Exp}}(c)\,$ (Exponential distribution)
If X∼Gamma(α,β){\displaystyle X\sim {\mbox{Gamma}}(\alpha ,\beta )\,} $X \sim \mbox{Gamma}(\alpha, \beta)\,$ (Gamma distribution with rate parameter β{\displaystyle \beta } $\beta$ ) then 1X∼Inv-Gamma(α,β){\displaystyle {\tfrac {1}{X}}\sim {\mbox{Inv-Gamma}}(\alpha ,\beta )\,} $\tfrac{1}{X} \sim \mbox{Inv-Gamma}(\alpha, \beta)\,$ (see derivation in the next paragraph for details)
Note that If X ~ Gamma(k, θ) (Gamma distribution with scale parameter θ ) then 1/X ~ Inv-Gamma(k, θ−1)
Inverse gamma distribution is a special case of type 5 Pearson distribution
A multivariate generalization of the inverse-gamma distribution is the inverse-Wishart distribution.
For the distribution of a sum of independent inverted Gamma variables see Witkovsky (2001)

Derivation from Gamma distribution[edit]

Let X∼Gamma(α,β){\displaystyle X\sim {\mbox{Gamma}}(\alpha ,\beta )} $X\sim {\mbox{Gamma}}(\alpha ,\beta )$ , and recall that the pdf of the gamma distribution is

fX(x)=βαΓ(α)xα−1e−βx{\displaystyle f_{X}(x)={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{\alpha -1}e^{-\beta x}} $f_{X}(x)={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}x^{\alpha -1}e^{-\beta x}$ , x>0{\displaystyle x>0} $x>0$ .

Note that β{\displaystyle \beta } $\beta$ is the rate parameter from the perspective of the gamma distribution.

Define the transformation Y=g(X)=1X{\displaystyle Y=g(X)={\tfrac {1}{X}}} $Y=g(X)={\tfrac {1}{X}}$ . Then, the pdf of Y{\displaystyle Y} $Y$ is

fY(y)=fX(g−1(y))|ddyg−1(y)|=βαΓ(α)(1y)α−1exp⁡(−βy)1y2=βαΓ(α)(1y)α+1exp⁡(−βy)=βαΓ(α)(y)−α−1exp⁡(−βy){\displaystyle {\begin{aligned}f_{Y}(y)&=f_{X}\left(g^{-1}(y)\right)\left|{\frac {d}{dy}}g^{-1}(y)\right|\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha -1}\exp \left({\frac {-\beta }{y}}\right){\frac {1}{y^{2}}}\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha +1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left(y\right)^{-\alpha -1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]\end{aligned}}}

${\begin{aligned}f_{Y}(y)&=f_{X}\left(g^{-1}(y)\right)\left|{\frac {d}{dy}}g^{-1}(y)\right|\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha -1}\exp \left({\frac {-\beta }{y}}\right){\frac {1}{y^{2}}}\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left({\frac {1}{y}}\right)^{\alpha +1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]&={\frac {\beta ^{\alpha }}{\Gamma (\alpha )}}\left(y\right)^{-\alpha -1}\exp \left({\frac {-\beta }{y}}\right)\\[6pt]\end{aligned}}$

Note that β{\displaystyle \beta } $\beta$ is the scale parameter from the perspective of the inverse gamma distribution.

Occurrence[edit]

This section is empty. You can help by adding to it. (January 2015)

References[edit]

^ "InverseGammaDistribution—Wolfram Language Documentation". reference.wolfram.com. Retrieved 9 April 2018.
^ John D. Cook (Oct 3, 2008). "InverseGammaDistribution" (PDF). Retrieved 3 Dec 2018.

Hoff, P. (2009). "A first course in bayesian statistical methods". Springer.
Witkovsky, V. (2001). "Computing the Distribution of a Linear Combination of Inverted Gamma Variables". Kybernetika. 37 (1): 79–90. MR 1825758. Zbl 1263.62022.

hideDiscrete univariate
with finite supportDiscrete univariate
with infinite supportContinuous univariate
supported on a bounded intervalContinuous univariate
supported on a semi-infinite intervalContinuous univariate
supported on the whole real lineContinuous univariate
with support whose type variesMixed continuous-discrete univariate

rectified Gaussian

Multivariate (joint)Directional Degenerate and singularFamilies

Inverse-gamma distribution

Inverse-gamma distribution

Characterization[edit]

Probability density function[edit]

Cumulative distribution function[edit]

Moments[edit]

Characteristic function[edit]

Properties[edit]

Related distributions[edit]

Derivation from Gamma distribution[edit]

Occurrence[edit]

See also[edit]

References[edit]

Recommend

莫让闲置医保卡变成“非法套利卡”

How one multi-cloud-based business manages security controls

Financial crime in 2020: How did my predictions play out?

Mock Java Constructors and Their Object Creation With Mockito

985大学出身！90后博导入选“十大新锐科技人物” 专注新型二维半导体材料研究

东南大学专家团队共同发表的论文获Electronics Letters2020年度最佳论文奖

10 ways to prep for (and ace) a security job interview

Github sas-dummy-blog/NetflixTrip.sas at master · sascommunities/sas-dummy-blog...

The evolving role of AI in drug safety

How to reboot a broken or outdated security strategy

About Joyk