Reference:
Elements of Information Theory, 2nd Edition
Slides of EE4560, TUD
Content
Preliminaries
We analyze a communication system as shown in Figure 7.8.
- W W W: message drawn from the index set { 1 , 2 , ⋯ , M } \{1,2,\cdots,M\} { 1,2,⋯,M}
- X n = f ( W ) X^n=f(W) Xn=f(W): transmitted sequence, consisting of n n n symbols from the channel input alphabet X \mathcal X X
- Y n ∼ p ( y n ∣ x n ) Y^n\sim p(y^n|x^n) Yn∼p(yn∣xn): received sequence, consisting of n n n symbols from the channel output alphabet Y \mathcal Y Y
- W ^ = g ( Y n ) \hat W=g(Y^n) W^=g(Yn): message from the index set { 1 , 2 , ⋯ , M } \{1,2,\cdots,M\} { 1,2,⋯,M}; decoding error in case W ^ ≠ W \hat W \neq W W^=W
Definition 1 (Code):
An ( M , n ) (M,n) (M,n) code for the channel ( X , p ( y ∣ x ) , Y ) (\mathcal X,p(y|x),\mathcal Y) (X,p(y∣x),Y) consists of
- An index set { 1 , 2 , ⋯ , M } \{1,2,\cdots,M\} { 1,2,⋯,M}
- An encoding function f : { 1 , 2 , ⋯ , M } → X n f:\{1,2,\cdots,M\}\to \mathcal X^n f:{ 1,2,⋯,M}→Xn, yielding codewords x n ( 1 ) , ⋯ , x n ( M ) x^n(1),\cdots, x^n(M) xn(1),⋯,xn(M). The set of codewords is called the codebook
- A decoding function g : Y n → { 1 , 2 , ⋯ , M } g:\mathcal Y^n\to \{1,2,\cdots,M\} g:Yn→{ 1,2,⋯,M}
Example [slides 5-7]
Definition 2 (Conditional Probability of Error):
Conditional probability of error given that index i i i was sent:
λ i = Pr ( g ( Y n ) ≠ i ∣ X n = x n ( i ) ) = ∑ y n p ( y n ∣ x n ( i ) ) I ( g ( y n ) ≠ i ) (1) \begin{aligned} \lambda_i&=\Pr (g(Y^n)\ne i|X^n=x^n(i))=\sum_{y^n}p(y^n|x^n(i))I(g(y^n)\ne i) \end{aligned}\tag{1} λi=Pr(g(Yn)=i∣Xn=xn(i))=yn∑p(yn∣xn(i))I(g(yn)=i)(1)
where I ( ⋅ ) I(\cdot) I(⋅) is the indicator function. ( x n x^n xn and y n y^n yn are the possible realizations of X n X^n Xn and Y n Y^n Yn)
Maximal probability of error:
λ ( n ) = max i ∈ { 1 , 2 , ⋯ , M } λ i (2) \lambda^{(n)}=\max _{i\in \{1,2,\cdots,M\}} \lambda_i \tag{2} λ(n)=i∈{
1,2,⋯,M}maxλi(2)
Average probability of error: