Stochastic Processes & Markov Chains

Fri 27 March 2026

by Daniel Melichar

in Crash Course

tagged statistics, stochastic processes, markov chains, machine learning

Part of the Statistics for Machine Learning series.

Stochastic Processes

Stochastic Processes

:::Remark

For these notes we assume that space is discrete finite or discrete infinite countable. The more general case can be read here.

:::

:::Definition (Stochastic process).

A sequence of random variables $(X_n){n \geq 0}$ is a discrete stochastic process if $n \in \mathbb{N}$ and a continuous stochastic process if $n \in \mathbb{R}^+$ with the initial state $X_0$. We have that each $X_i$ is independent and identically distributed, but there is some dependence on the previous $X$.

:::

:::Definition (Discrete-time Markov Chain).

A discrete-time Markov Chain consists of a tuple $(\lambda, P)$ over some probability space $(\Omega, \mathcal{E}, P)$ where

the initial distribution is a probability measure $\lambda: S \to [0, 1]$ with $\sum_i \lambda_i = 1$ over a finite or countable state space $S$
and the transition matrix $P = (p(x, y)){x,y \in S}$ with finite or continuous indices and with $p(x, y) \in [0,1]$ is called stochastic if and only if the sum of each row is one $\sum p(x, y) = 1$ for a fixed $x \in S$. We call $P$ homogenous if it doesn't change and inhomogenous $P_n$ if it changes at time $n$

is a stochastic process if and only if the following properties hold

the initial state follows $P(X_0 = x) = \lambda(x)$ for some $x \in S$
a state only depends on its predecessor, i.e. the so called Markov property holds $P(X_n = x_n \mid X_{n-1}=x_{n-1}) = p(x_{n-1}, x_n)$ for all $x_i \in S$

We then write $X = (X_n)_{n \geq 0} \sim \text{Markov}(\lambda, P)$.

:::

:::Proposition

The following proposition tells us the probability of states

$$P(X_0 = x_0, X_1 = x_1, \cdots, X_n=x_n) = \lambda(x_0) \prod_{i=1}^n p(x_{i-1}, x_i)$$

:::

:::Example (Time-n transition probability).

In the following a linear algebra perspective is useful. Suppose we wish to know what the probability of a state will be after $n$ steps.

:::