Maximum Likelihood Estimation¶
Context¶
Given a random vector \(\mathbf{Y}\), whose probability distribution is expected to be modeled by a given function \(f_{\mathbf{Y}}(\mathbf{y} ; \mathbf{\theta})\), where
- \(\mathbf{y}\) is a sample of \(\mathbf{Y}\)
- \(\mathbf{\theta}\) is a set of parameters controlling the probability distribution.
A sample \(\mathbf{y}\) is available, while the actual values of \(\mathbf{\theta}\) are not. Without additional information about the probability distribution of \(\mathbf{\theta}\), we want to have some good estimate of \(\mathbf{\theta}\) based on the sample \(\mathbf{y}\).
Problem¶
Given the probability distribution \(f_{\mathbf{Y}}(\mathbf{y} ; \mathbf{\theta})\) and a sample \(\mathbf{y}\), how to make a good estimate of \(\mathbf{\theta}\)?
Solution¶
Find the values of the parameters that maximize the probability function for the given sample \(\mathbf{y}\). In other words, find \(\hat{\theta}\) such that
where \(\mathbf{\Theta}\) is the parameter space consisting of all the possible values of \(\mathbf{\theta}\)
Applications¶
Decoding Signals¶
Given that
- A transmitter sends out a sequence of signal \((x_1, x_2, x_3)\) where \(x_i \in \{0,1\}\)
- \(x_i\) are i.i.d
- The receiver receives a sequence of noisy signal \((0.9, 0.2, 0.8)\)
- the noisy channel can be modeled as \(y_i = x_i + n_i\), where \(n_i \sim \mathcal{N}(0, \sigma^2)\)
- \(n_i\) are i.i.d
The problem of decoding the received signal can be solved by the maximum likelihood estimator whose unknown parameters are
Since both \(x_i\) and \(n_i\) are i.i.d, the probability function of \(\mathbf{Y}\) can be written as
And the solution to this maximum likelihood estimation problem is