A Gentle Introduction to Markov Chain Monte Carlo (2012)

Applying probabilistic models to data usually involves integrating a complex, multi-dimensional probability distribution. For example, calculating the expectation/mean of a model distribution involves such an integration. Many (most) times, these integrals are not calculable due to the high dimensionality of the distribution or because there is no closed-form expression for the integral available using calculus. Markov Chain Monte Carlo (MCMC) is a method that allows one to approximate complex integrals using stochastic sampling routines. As MCMC’s name indicates, the method is composed of two components, the Markov chain and Monte Carlo integration .

Monte Carlo integration is a powerful technique that exploits stochastic sampling of the distribution in question in order to approximate the difficult integration. However, in order to use Monte Carlo integration it is necessary to be able to sample from the probability distribution in question, which may be difficult or impossible to do directly. This is where the second component of MCMC, the Markov chain, comes in. A Markov chain is a sequential model that transitions from one state to another in a probabilistic fashion, where the next state that the chain takes is conditioned on the previous state. Markov chains are useful in that if they are constructed properly, and allowed to run for a long time, the states that a chain will take also sample from a target probability distribution. Therefore we can construct Markov chains to sample from the distribution whose integral we would like to approximate, then use Monte Carlo integration to perform the approximation.

Here I introduce a series of posts where I describe the basic concepts underlying MCMC, starting off by describing Monte Carlo Integration , then giving a brief introduction of Markov chains and how they can be constructed to sample from a target probability distribution. Given these foundation principles, we can then discuss MCMC techniques such as theMetropolisand Metropolis-Hastings algorithms, theGibbs sampler, and the Hybrid Monte Carloalgorithm.

As always, each post has a somewhat formal/mathematical introduction, along with an example and simple Matlab implementations of the associated algorithms.

Recommend

CVE-2017-11176: A step-by-step Linux Kernel exploitation (part 1/4)

GitHub - 4Catalyzer/astroturf: An "artificial" css-in-js for those tha...

GitHub - jsonmc/jsonmc: JSON Movie Collection

TCP协议详解

米粉因小米不守“与雷总聚餐”承诺起诉赔偿2.4万元 - Xiaomi 小米科技 - cnBeta.COM

联想超炫酷舰式主机亮相：支持第九代酷睿自带投影/1.5万起 - Lenovo 联想 - cnBeta.C...

索菲·特纳：《权力的游戏》结局可能会在观众中引发分歧 - 美剧 - cnBeta.COM

从差点被恒生指数“开除”到股价大涨46%，联想发生了什么？ - Lenovo 联想 - cnBeta.COM

小米孙昌旭评价iPhone XS系列：不值这么多钱 - Xiaomi 小米科技 - cnBeta.COM

腾讯架构大调整: 追赶阿里云狙击头条系 - Tencent 腾讯 - cnBeta.COM

About Joyk