In Signal Processing and Communications, one often has to deal with unknown signals. For example, the wireless channel (air medium) distorts the transmitted signal in so many ways (multipath fading, ISI, etc.) that the receiver has to use its own methods to recover the signal. Generally, such signals are modeled as stochastic random processes and statistical methods are used to "demystify" the received waveform. The air medium can be considered to be a time-varying FIR filter and the receiver unit must estimate the properties of this wireless medium. Such techniques are called Channel Estimation methods.
I have begun a series of articles, in which I'll explain the various methods of channel estimation. In this post, I'll explain a method called as Maximum Likelihood Estimate. I used LaTeX and converted it to pdf. You can download the PDF here.
Please comment if you have anything to ask !
For simplicity, let us assume that the channel is linear and time-invariant. Later, we’ll extend the model to time-varying channel.
Let the input sequence of information bits be s[n]. Let v[n] be additive noise. The output y[n] at the receiver can be modelled as
![]() | (1) |
where n = 0, 1,…,N − 1. We have assumed that the channel impulse response is h[k].
The channel estimator is defined as any static (known function of observable random variables that is itself a random variable) whose values are used to estimate τ(𝜃), where τ(⋅) is some funtion of an unknown parameter 𝜃.
The channel vector is given by
![]() | (2) |
where {⋅}T is the transpose of a vector. Suppose that we received N samples of the output
![]() | (3) |
we can rewrite the Eq(1) as
![]() | (4) |
The matrix S is an N × L Toeplitz matrix consisting of the samples of the
input sequence
given by
![]() | (5) |
Let 𝜃 be the vector of unknown parameters that may contain the channel vector h and possibly the entire or part of the input vector s. Assume that the joint probability distribution of noise vector v and the input vector s is known. We can then obtain the probability density function (pdf) of the observation vector y.
The joint pdf of the observation fy(y; 𝜃) is called as the likelihood function. For simplicity, assume that 𝜃 is a single parameter. If the observation samples y[0],y[1],…,y[N − 1] are independent of each other, the likelihood function can be written as the product of individual densities:
![]() | (6) |
Thus, the maximum likelihood estimator is the solution of the equation
![]() | (7) |
Generally, it is easier to find the maximum of the logarithm of the likelihood, ln fy(y; 𝜃) because both fy(y; 𝜃) and ln fy(y; 𝜃) have their maximums at the same value of 𝜃. If, instead of a single unknown 𝜃, there are multiple unknowns, the normal derivative d∕d𝜃 in Eq(7) is replaced by the partial derivative ∂∕∂𝜃.
Assume that the channel impulse response h = h[0],h[1],…,h[N − 1] is a Gaussian random process, that is
![]() | (8) |
with mean −∞ < μ < ∞ and variance σ2,σ > 0. Find the MLE
and
of μ
and σ2 respectively.
It is reasonable to assume that the individual samples h[0],h[1],…,h[N − 1] are independent of each other. In such a case, the likelihood function is given by
![]() | (9) |
The logarithm of the likelihood function is given by
![]() | (10) |
At the maximum (or minimum) of LLF, the first derivative of LLF is 0.
However, at the maximum, the second derivative of LLF is negative. In other
words, we need to solve for μ =
such that
![]() | (11) |
Similarly, for σ2 =
2
![]() | (12) |
where ϕ = σ2.
Wait until next week for the solution of above equations ! See you !
Copyright © 2010, Satish BD. Some rights reserved. All the content in this article is licensed under the Creative Commons BY-NC-SA 2.5 India License. The original location of this document can be found at my site: http://www.bdsatish.in. You can e-mail me at: mail@bdsatish.in