Advances in Wireless Communication 课堂笔记（上）

本文链接：https://blog.csdn.net/water_yellow/article/details/135564895

该博客主要介绍纠错码基础，包括无线通信生成、通信系统、信道模型、码设计等。还阐述图形模型解码和消息传递算法，如线性码、最大似然解码器等。重点讲解5G系统信道编码技术，涵盖LDPC结构、信道分割、极化变换、编码复杂度、极化码构造与解码等内容。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

Basics of error-correcting codes

Generation of wireless communications

Gneration	time	technic	max speed
1G	early 1980’s	Analog, FM	12kbps
2G	1991	digital, TDMA	50kbps~1Mbps
3G (include UMTS & CDMA 2000 & 3GPP)	1998	CDMA	20Mbps
4G (LTE)	2008	OFDM	1Gbps
5G	2020		up to 10Gbps

communication system

在这里插入图片描述
channel is characterized by the transition probabilities
$Pr\{Y=y \,| \,X=x\} \,for\,\, any\,\, x\in X, \, y \in Y\\ x=\{0,1\},y=R$
in the case of continuous y, we use condition pdf

Channel model

BSC: binary symetric channel

capacity: bit per channel use, $1-h_2(p)$ , $h_2()$ is a function
BEC: binary erasure channel

capacity: $1-\varepsilon$
AWGN: ADDITIVE WHITE Gaussian Noise channel

$x\in \{-1,+1\}, binary, \\ or\,\, continuous\,\, x=\reals$
x is subject to power constrain $E[X^2]=P$
P is power of transistor
capacity: $\frac 1 2 lg(1+SNR)=\frac P {\delta^2}$
also $W l g (1 + SNR)$ W:bandwidth
Rayleigh fading channel
$x\in X\,\,\,\, y \in \reals \,\,\,\, y=ax+n$ a: RV, obey Rayleigh distribution with scalar parameter $\tau^2$
$n\thicksim \mathcal N (0,\delta^2)$

code design

Code: s structured subset of an ambient set, collection of all codewords.
Encoder: A mapping between the set of message and the set of codewords.
Decoder: Given an elementary $\in A$ , (y is the received symbol or a sequence of such y’s), find the “most likely” codeword/message.

在这里插入图片描述
m: message
c: codeword
C: code
Minimize the prob of error $Pr\{\^{m}\not = m\}$ through structure of code

A natural structure with algebraic strutures to play with a linear subspacce to an ambient vector.

A linear code C of dimension k in A, here, $F^n$ is field: 0, 1

Each element in $c\in C$ is represented as a vector of length n, $c=(c_1,c_2,c_3,...,c_n)\,\, ,c_i\in F$ , c is binary sequence, n is the length of the code.

$(n, k)$ code,n is block length, k is dimension of code, $\le n$

Example:
Let c be an $(n, n - 1)$ linear code as follows
在这里插入图片描述

c is one-parity code.

rate: bit per channel/symbol use

Rate of a code C of length n over an alphabet of size $q$ : $rate(C)=\frac {log_q|C|} n\,|_{|C|=q^k}\,=\frac k n\\ q^k:\,size\,\,of\,\,the\,\,code\,\,of\,\,dimension\,\,k\\ q: possible\,\,number\,\,of\,\,codes$

Hamming distance $d_H(x,y)$
$d_H(x,y)$ =number of positions(bits) in which $x$ and $y$ differ
$x :$ transmitted , $y :$ received

properties:

$d_H(x,y)\ge0$
$d_H(x,y)=0\,\Longleftrightarrow\, x=y$
$d_H(x,y)=d_H(y,x)$
triangle inequality: $d_H(x,z)\le d_H(x,y)+d_H(y,z)$

Hamming weight: number of non-zero entries of $\vec{x}$
Hamming weight at a vector $\vec{x}$ : $w_H(x)=d_H(x,0)$
$zero\,\,code$

Minimum distance of a code
$d_{min}(c)=$ $\atop {x,x'\in c \atop x\not =x'}$ $d_H(x,x')$

for linear code C, the $d_{min}(c)=$ $\atop {c\in C \atop c\not =0}$ $w_H(c)$

if we want to find the minimum distance, just need to find the minimum distance of non-zero codeword to the all-zero codeword.

Theorem: (worse case guarantee) Let $d=d_{min}(c)$ , then c can correct up to $|\frac {d-1} 2|$ errors.

Approach to design code

Construct codes with maximum distance, give a certain rate (or length an size)
Algebraic codes: Turbo Code (3G/4G used), LDPC and polar codes (in 5G)

linear code approach
Consider a basis for an (n,k) linear code C, which cover field $F$ , denoted by $c_1, c_2,...,c_k$ $c=\{ \lambda_1c_1+\lambda_2c_2+...+\lambda_kc_k\, |\,\lambda_i\in F \}$
Let $G=\begin{bmatrix} c_1 \\ c_2 \\...\\ c_k \end{bmatrix}_{k\times n}$ , a generator matrix
for the code c
$c=(\lambda_1, \lambda_2,..., \lambda_k)\times G$
$c=\{VG | V\in F^k\}, V:message\,\, matrix$
the generator matrix is not unique

encoding mapping: $V\rightarrow VG$
$V$ :message of length $k$ , $k$ bit
$V G$ :encoded codeword

example:
one parity check code $(x_1,x_2,...,x_{n-1})\rightarrow (x_1,x_2,...,x_{n-1},\sum_{i=1}^{n-1}n_i)$
在这里插入图片描述
(the left region of the dash line can be any number)

systematic encoder: every encoded codeword contains the original message as follows:
message= $u_1,u_2,...,u_k)$ , codeword= $u_1,u_2,...,u_k, x_{k+1},...,x_n)$
so $G=\begin{bmatrix}I_{k\times k} |A_{k \times (n-k)} \end{bmatrix}_{k\times n}$
no matter what matrix A is.

Theorem: Every linear code has a systematic encoder up to a permittion on the code bits, which can design generator matrix

For a code C with generator matrix $G_{k\times n}$ let $H_{(n-k)\times n}$ denote the kernel of $G_{k\times n}$ , $GH^T=0_{k\times (n-k)}$
all rows of G are orthogonal to all rows of H
H: the parity-check matrix for c

Note: In binary field, non-zero vectors can be self-orthogonal. Any binary vector are even Hamming weight is self-orthogonal.

Example:
For on-parity check code C, C with $G_{(n-1)\times n}\,\,H=[1,1,...,1]_{1\times n}$ . In general, for a systematic $G=\begin{bmatrix} I_{k\times k}| A_{k\times (n-k)} \end{bmatrix}_{k\times n}$ we have $H=[-A^T\,|\,I_{(n-k)\times (n-k)}]_{(n-k)\times n}$

Example:
Let C be a binary linear (6, 3) code with the generator matrix $G=\begin{bmatrix} 1\;0\;1\;1\;0\;1\\ 0\;1\;0\;1\;1\;0\\ 0\;0\;1\;0\;0\;1\\ \end{bmatrix}$
a. Find a systematic generator matrix for C.
systematic form: $G_{sys}=\begin{bmatrix} 1\;0\;0\;1\;0\;0\\ 0\;1\;0\;1\;1\;0\\ 0\;0\;1\;0\;0\;1\\ \end{bmatrix}$
b. Find a parity-check matrix for C.
$G_{sys}=\begin{bmatrix} I_{k\times k}| A_{k\times (n-k)} \end{bmatrix}_{k\times n}\\ H=[-A^T\,|\,I_{(n-k)\times (n-k)}]_{(n-k)\times n}\\ H=\begin{bmatrix} 1\;1\;0\;1\;0\;0\\ 0\;1\;0\;0\;1\;0\\ 0\;0\;1\;0\;0\;1\\ \end{bmatrix}$
c. What is the minimum distance of C?
The minimum distance is at least two, since there is no zero column in $H$
And we do have a codeword of weight 2(the third row of $G$ ), $d_{min}(C)=2$

Lemma properties: let c be a linear (n,k) code, with parity-check matrix H, then we have $c\in C \Longleftrightarrow HC^T=0$

Graphical model representation of decoding, message passing algorithms

linear Code C

parity check matrix $H_{(n-k)\times n}$
$n$ : block length
$k$ : # of information bit $c\in C \Leftrightarrow HC^T=0$ 在这里插入图片描述
each row of H is parity checkequation

For any $y\in F$ , the syndrome of $y$ with respect to the code C with its parity check matrix H is define as $H^y$ , syndrome of y.
$Hy^T$ : matrix ,size $(n-k)\times 1$
number of possible symdroms: $2^{n-k}$

$H(a_i+c_j)^T=Ha^t_i+Hc^t_j$
where $Hc^t_j=0$

Let $S_1, S_2,...,S_{2^{n-k}}$ denote all possible syndromes, also let $a_i$ be the minimum weight vector with $Ha^+_i=S_i$

Coset leader	standard array	syndromes
$a_1$	$a_1+c_1\;\;...\;\;a_1+c_{2^k}$	$S_1$
$a_2$	$a_2+c_1\;\;...\;\;a_2+c_{2^k}$	$S_2$
$...$	$...$
$a_{2^{n-k}}$	$a_{2^{n-k}}+c_1\;\;...\;\;a_{2^{n-k}}+c_{2^k}$	$S_{2^{n-k}}$

$c_1,c_2...,c_k$ denote all the code words, so standard array is the possible y that can be received.

Syndrome decoding - only bitflip error

$y$ :received vector (binary)

compute the syndrome of $y$ - $Hy^T$
locate $S_1=Hy^T$ in the standard array with coset leader $a_i$
output codeword $c=y-a_i$ , $a_i$ :error pattern

The syndrome decoder is a minimum distance decoding, which mapping $y$ to teh closest codeword. Let $d_{min}(C)=d$ , Then all binary vector of weight up to $|\frac {d-1}2|$ will be among coset leaders

Maximum likelihood decoder

consider a BSC(P ), P<0.5
在这里插入图片描述
$Pr\{receiving\; y \;| \;c\; is\; transmitted\}$
$w=w_H(y-c)$ , number of position in which $y$ and $c$ are different, $w=p^w(1-p)^{n-w}=(\frac p {1-p})^w(1-p)^n$
ML decoder $\Leftrightarrow$ maximize the probability $Pr\{y | min(c)\}$ $\Leftrightarrow$ minimize $w$ $\Leftrightarrow$ minimize distance decoder $\Leftrightarrow$ syndrome decoder

for BSC, these decoder are the same

Note that ML decoding has exponential (in n) complexity
Also syndrom decoding needs to search with in an array of exponential size $\Rightarrow$ exponential complexity

LDPC Code: A low-density parity check code is a binary, linear block code for which the parity-check mtrix is sparse. (both row and coloum be sparse in terms of # of 1’s)

A regular LDPC code, has an equal # of 1’s in each row $w_r$ and equal # of 1’s in each colum $w_c$

Note that $w_c\cdot n=w_r\cdot m$ for $H_{m\times n}$

在这里插入图片描述
with $\ge n-k$ for an (n,k) code, this code is refer to as a ( $w_c, w_r$ ) regular LDPC code $$

Example:
A (2,4) regular LDPC code ,n=10, m=5, k=6, $w_c=2$ , $w_r=4$
$H=\begin{bmatrix} 1&1&1&1&0&0&0&0&0&0 \\ 1&0&0&0&1&1&1&0&0&0 \\ 0&1&0&0&1&0&0&1&1&0 \\ 0&0&1&0&0&1&0&1&0 &1\\ 0&0&0&1&0&0&1&0&1&1 \\ \end{bmatrix}_{5\times 10}$ $rank(H)=n-k=4,\;\;k=n-rank(H)$

Gallager’s early work, Gallager’s decoder

There exists a sequence of LDPC codes(regular) with increasing length and positive rate $k / n > 0$ , positive $d_{min}/n >0$

Gallager’s decoder(hard-desicion bit flipping decoder)

fix a threshold S (to be optimized)
compute the syndrome bits $S_j$ ’s,
$Hy^T=\begin{bmatrix} S_1 \\ S_2 \\ ... \\S_m\\ \end{bmatrix} \;\;\;\;\;\;\; y^T=\begin{bmatrix} ...\\ i-th \\... \end{bmatrix}$
y is received vector
of all $S_j$ ’s are 0, then stops
otherwise bit i, i=1, 2, …, n
$g_i$ : number of non-zero syndroms that involve the i-th bit
$A=\{i=g_i>S\}$
flip bit i for all i in A and back to step 2

Belief propagation Algorithm

Belief propagation (BP) is a type of message passing algorithm. It uses a Tanner graph representation of the code (A bi-partite graph)
one part: A node for each information bit (variable node), in other part A check node for each parity 在这里插入图片描述
There is an edge connection f to $x_i$ if the (i,j) entry in matrix H is one

for instane over AWGN, $y_i=(2{x_i}-1)+n_i,\;n_i\sim\mathcal N(0,\sigma^2)$

if $f_1$ connect to $x_1,\;x_2,\;x_3$ , then $x_1+x_2+x_3=0$
$x_i+x_{i'}+x_{i''}+...=0$

BP algorithm is an iterative decoding algorithm
In each iteration

Each variable node sends a message to each check node
each check node sends a message to each variable node
each variable node update its ‘belief’ about $x_i$

Goal of decoding : compute $P(x_i=0 \,|\,y_1,y_2,...y_n \, and\;all\;parity\;bits\;being\;"0")$
Also called:bit map decoder
在这里插入图片描述
$q_{ij}(x)=P(x_i=x \,|\,y_i, all\; the \;extrinsic\; information\; passed \;to\; x_i \;from \;f_j )$
$r_{ji}(x)=P(parity \;bit\; f_j\; is \;satisfied\;|\;x_i=x, other \;bits\; X_{i'}'s \;connected \; to \;f_j\;(other\;than\;X_i)\;are\;distributed\;with \;q_{i',j} )$

How to compute $q, r$
initialization: $q_{i,j}(x)=P(X_i=x\,|\,Y_i=y_i),\;x\in \{0,1\}$

ratio $\frac {P(X_i=0\,|\,Y_i=y_i)}{P(X_i=1\,|\,Y_i=y_i)}$ is likelihood ratio for making decision. In practice, we work with the log(likelihood ratio) LLR, if LLR is positive, ratio>1, $x_i=0$

Notations:
$P_i=P(X_i=1\,|\,Y_i=y_i)\sim$ $L(X_i)$ in the LLR domain
$R_j\sim$ indices of 1(s) in row j of H
$C_i\sim$ indices of 1(s) in colum i of H
$\ i ∼ R_{j \backslash i} \sim$ $R_j$ exclude i (for example, row 1 is [0 1 1 0 1], $\ 2 = { 3 , 5 } R_{1 \backslash 2}=\{3,5\}$ )

Lemma: $A=(a_1,a_2,...,a_L)$ of independent binary random variable with $P(a_i=1)=P_i$ , Then we have, look at $P(\sum_{i=1}^L a_i=0)=\frac 1 2+\frac 1 2 \prod^L_{i=1}(1-2P_i)\\ P(\sum_{i=1}^L a_i=1)=\frac 1 2-\frac 1 2 \prod^L_{i=1}(1-2P_i)$

message passing: $\ i ( 1 − 2 q i ′ , j ( 1 ) ) r j , i ( 1 ) = 1 − r j , i ( 0 ) q i , j ( 0 ) q i , j ( 1 ) = ( 1 − P i ) P i ∏ j ′ ∈ C i \ j r j ′ , i ( 0 ) r j ′ , i ( 1 ) L ( q i , j ) = l o g ( q i , j ( 0 ) q i , j ( 1 ) ) L ( r j , i ) = l o g ( r j , i ( 0 ) r j , i ( 1 ) ) ⇒ { L ( q i , j ) = L ( X i ) + ∑ j ′ ∈ c i \ j L ( r j ′ , i ) L ( r j , i ) = 2 t a n h − 1 ( ∏ i ∈ R j \ i t a n h ( 1 2 L ( q i ′ , j ) ) ) u p d a t e b e l i e f o f X i ′ s L n e w ( X i ) = L ( x i ) + ∑ j ∈ c i L ( r j , i ) r_{j,i}(0)=\frac 1 2 +\frac 1 2 \prod_{i \in R_{j \backslash i}}(1-2q_{i',j}(1))\\ r_{j,i}(1)=1-r_{j,i}(0)\\ \frac{q_{i,j}(0)}{q_{i,j}(1)}=\frac {(1-P_i)}{P_i} \prod_{j'\in C_{i\backslash j}}\frac {r_{j',i(0)}}{r_{j',i}(1)} \\ L(q_{i,j})=log(\frac {q_{i,j}(0)}{q_{i,j}(1)})\\ L(r_{j,i})=log(\frac {r_{j,i}(0)}{r_{j,i}(1)})\\ \Rightarrow \begin{cases} L(q_{i,j})=L(X_i)+\sum_{j'\in c_{i\backslash j}} L(r_{j',i})\\ L(r_{j,i})=2tanh^{-1}(\prod _{i \in R_{j \backslash i}} tanh(\frac 1 2 L(q_{i',j})))\\ update \; belief \;of \;X_i \, {'s} \;\;L_{new}(X_i)=L(x_i)+\sum_{j\in c_i} L(r_{j,i}) \end{cases}$

The step also can be written as $\ i α i ′ , j ϕ − 1 ( ∑ i ′ ∈ R j \ i ϕ ( β i ′ , j ) ) \alpha_{i,j}=Sign(L(q_{i,j}))\\ \beta_{i,j}=|L(q_{i,j})|\\ \phi(n)=log(\frac {e^n+1}{e^n-1})\\ \phi \;\;is\;\; selfinverse:\phi^{-1}=\phi \\ L(r_{j,i})=\prod_{i' \in R_{j\backslash i}} \alpha_{i',j} \phi^{-1}(\sum_{i'\in R_{j\backslash i}}\phi(\beta_{i',j}))$
Min-Sum approximation Lapproximate with $\ i min \atop{i'\in R_{j\backslash i}}$ $\beta_{i',j}$ , the result smaller than $\ i min \atop{i'\in R_{j\backslash i}}$ $\beta_{i',j}$

offset Min-Sum approximation
$\ i α i ′ , j ( L(r_{j,i})=\prod _{i'\in R_{j\backslash i}} \alpha_{i',j} ($ $\ i min \atop{i'\in R_{j\backslash i}}$ $\beta_{i',j}-\alpha)$
$\alpha$ : constant to be optimized per application

Example：
BP decoding with Min-Sum approximation A(2,3) regular LDPC code, $H=\begin{bmatrix} 1 &1& 1& 0& 0 &0 \\ 1 &0&0&1&1&0\\ 0 &1&0&1&0&1\\ 0 &0&1&0&1&1\\ \end{bmatrix}_{4\times 6}\\ n=6,\;m=4,\;k=3,\;w_c=2,\;w_r=3\\ rank(H)=3\implies k=6-3=3$
Tunner graph representation
“1” in H matrix means having connection between check node and variable node.
在这里插入图片描述
suppose we have $L(X_i)=-1,2,3,-4,4,1\\ for\; i=1,2,3,4,5,6$

what is the updated belife of $X_i$ after on iteration of BP with Min-Sum approximation
write down $q_{i,j}$ matching the connection $q_{i,j}=L(X_i)$ (in black, ingnore the arrow point, the $q_{i,j}$ is the message from “circle” to “square”)
在这里插入图片描述
find $r_{j,i}$ (number in red)

updated belief $L^{(new)}(X_i)$
在这里插入图片描述

complexity:BP can be used (in principle) to decode any linear code given H. For LDPC code of “constant” degree(with respect number), the complexity of each iteration is $O (n)$ . Otherwise for a general code it’s $O(n^2)$

The exact LLR calculation (max-product) is rather complex and is often approximated but approximation works well when there are only “a few” terms (in the sum of $\phi {'s}$ )

length of the shortest cycle= $2 l$ , the LLR equations hold up until $l = 1$ iterations
Another issue with BP for general code is the Tanner representation is dense $\Rightarrow$ it will have too many short cycles. And short cycles adversely affect the performance of BP, since the independence of $r_{j,i}$ 's (for a fixed i) or $q_{i,j}$ (for fixed j) would be violate

when to stop
【In practice, after a fixed number of iterations(usually in the range between 5 and 20)
early stopping
【Maybe check H to see if parity-check equations are satified after making hard decision s on $X_i$ ’s (according to update beliefs)
drawback
【expensive

CRC: cyclic redundancy check

An $m\times n$ partiy-check matrix, LLR= $log(\frac{P(X_i=0|Y_j's)}{P(X_i=1|Y_j's)})$
if $LLR>0,\;\hat{X_i} =0$
if $LLR<0,\;\hat{X_i} =1$
this step we called: making the hard decision

matrix $\begin{bmatrix} \hat{X_1}\\ \hat{X_2}\\ ...\\ \hat{X_n} \end{bmatrix}$
if this matrix equals to 0, the decoder is done, output $\hat{X_i}$ 's
if does not equal to 0, still need to continue
This process is complex - comparable complexity to one iteration of BP DECODING

so we want Alternative solution (to stop), called CRC, cyclic redundancy check.
A few extra bits of redundancy (8,12,16,24) using a cyclic code - an algebraic code.

if CRC= 8 bit, $H'=\begin{bmatrix} 1 0 0 1..\\ 01 0 0 1\\ .... \end{bmatrix}_{8\times n}$

each row of matrix H’ like a cyclic shift, CRC length= $l (8)$

overall H of the code + CRC = $\begin{bmatrix}H\\H'\end{bmatrix}_{(m+l)\times n}$
(n,k)linear code +l bits of CRC, # of information bits = $k - l$

$\begin{bmatrix} H_{(n-k)\times n}\\H'_{l\times n} \end{bmatrix}_{(n-k+l)\times n}$

we can check CRC equation at the end of each iteration,
if CRC pass: stop the decoder, is CRC not pass: next iteration

At the end of decode , we also check CRC to see if we have read a “valid” codeword" ©
valid codeword: $HC^T=0(valid\;code)\;\;H'C^T=0(valid\;CRC)$
probability of CRC failure= $\frac 1 2 l$

Transport block
在这里插入图片描述
each codeblock has its own CRC
the entive transport block was another CRC

LDPC code over BEC

when decoding over BEC, LLRs do not matter as each coded bit is either known or erased.
在这里插入图片描述
$q_{ij}= \begin{cases} X_i &\text{if } X_i \;is\; known \\ e &\text{if } X_i \;is \;erased \end{cases}$

$\ i X i = { k n o w n if a l l X i ( i ∈ R j \ i ) a r e k n o w n e o t h e r w i s e r_{ji}=\sum_{i\in R_{j\backslash i}} X_i= \begin{cases} known &\text{if } all \;X_i(i\in R_{j\backslash i}) \;are\; known \\ e &otherwise \end{cases}$

update belief if $X_i$ is erased, $X_i$ is known if at least one of $r_{ji}$ 's, $j\in c_i$ is known $\Rightarrow$ $X_i$ =known $r_{j,i}$ , otherwise it remain erased

Example：
在这里插入图片描述

A stopping set is a set of erased, variables that can not be corrected regardless of other variables (even if all others are known)
How this happen?
Let G denote the set of neighbors of the stopping set $V$ , then every check node in G is connected to at least two variable node in $V$ .
The minimum stopping se $V_{min}$ is the stopping set containing the fewest # of variable node.
Then the code can correct up to $V_{min}|=1$ erasures $\Rightarrow\;\;d_{min}\ge|V_{min}|$ (a code of minimum distance $d_{min}$ can correct up to $d_{min}-1$ erasures)

Density evolution $\rightarrow$ over BEC( p)
consider ( $w_r, w_c$ ), regular LDPC code, the probability that a variable node remains erased after the $l$ -th iteration (assuming independence $l\le L/2$ )

when $l$ is the length of the shortest cycle also refer to as the “girth” of the Tanner graph, denoted by $\varepsilon_l$

$\varepsilon_0=P\\ \varepsilon_l=P\cdot(1-(1-\varepsilon_{l-1})^{w_r-1})^{w_c-1}\;\;for\;l\ge1$ $P$ : bit $X_i$ is originally erased by channel
在这里插入图片描述

$(1-\varepsilon_{l-1})^{w_r-1}$ : the probability that all $\ i X_{i\in R_{j\backslash i}}$ are not erased.
$1-(1-\varepsilon_{l-1})^{w_r-1}$ : the probability that $r_{ji}$ is erased.

Given the degree distribution ( $w_r,w_c$ ) the threshold $\varepsilon$ is the maximum p for which $\varepsilon_l\rightarrow0$ as $l\rightarrow \infty$
n large, girth large

Example:
For the (3,6) regular LDPC as $n\rightarrow \infty$ , we have $\varepsilon^*=0.4294$ , capacity $= 0.5706$ , $R=\frac 1 2$ , low capacity

For (d,2d) regular LDPC codes, $\varepsilon^* \rightarrow 0.5$ as $\rightarrow \infty$ (we can achieve the capacity)
Assuming a random erasurable of all regular (d,2d) LDPC codes, the girth grow large enough as $n$ grows large with probability 1

Note that:

Density evolution only describes the asymptotic performance of the Ramdon erasurable of codes.
Goal of good H design $\begin{cases} \text{High girth} \\ \text{High (or almost full) rank}\\ \text{ large minimum stopping set} \end{cases}$

Channel coding techniques in 5G systems

Structure of LDPC in 5G

Protograph LDPC codes: lifting operation using a base matrix

For a lifting operation of size $z$ , each check node is replaced by $z$ check nodes (and variable node).
Then each edge in the Tanner graph is replaced by a shifted/performance matrix
在这里插入图片描述
it keeps the degree distribution of the Tanner graph (regardless of z)

lifting size : $z$
each entry in the base matrix is a number from -1,0,1,…, $z$
$-1\Rightarrow$ no edges: $\times z$ of all-zero matrix
$\Rightarrow$ Identity matrix, for i=1,2,…, $z - 1$ shifted permutation matrix by $i$

Example:
在这里插入图片描述

There are two types of base graphs/matrixes in 5G LDPC code.
$B_1$ of size 46 $\times$ 68, and $B_2$ of size 42 $\times$ 52.
There are lifting sizes up to 354 maximum block length supported by 5G LDPC is 384 $\times$ 68 = 26112

Polar Code (channel dependence)
channel polarization theory: let W denote the channel BEC§
$w (y ∣ x)$ denotes the probability of receiving $y$ given $x$
在这里插入图片描述
$W(0|0)=1-P\\ W(e|0)=P\\ W(1|0)=0\\ W(1|1)=1-P\\ W(e|1)=P\\ W(0|1)=0$

在这里插入图片描述

now consider two channels, the channel that $u_1$ observes and the channel that $u_2$ observes assuming $u_1$ is known

在这里插入图片描述
( $u_1$ observes)

在这里插入图片描述
( $u_2$ observes assuming $u_1$ is known)

Given $w^-$ is also a BEC, with erasure probability $1-(1-p)^2=2p-p^2$ , because $u_1$ is known/decoded if and only if both $y_1(=u_1+u_2)$ and $y_2(=u_2)$ are non-erasures $u_1=y_1+y_2=u_1+u_2+u_2=u_1$
$w^+$ is also a BEC with erasure probability $p^2$ because $u_2$ is decoded if either $y_1$ or $y_2$ is a non-erasured (if both $y_1$ and $y_2$ are erased, $u_2$ unknown)

Note

the sum-capacity is preserved (capacity of BEC( P) is 1-P),
$c(w^-)+c(w^+)=1-2p+p^2+1-p^2=2(1-p)=2c(w)$
Also, $2p-p^2>p^2$ , for $\sim w^+$ is better than $w^-$

Channel splitting operation

在这里插入图片描述
Then $w^{++}$ input $u_4$ output $y_1\;y_2\;y_3\;y_4\;u_1\;u_2\;u_3$
$w^{+-}$ input $u_3$ output $y_1\;y_2\;y_3\;y_4\;u_1\;u_2$
$w^{-+}$ input $u_2$ output $y_1\;y_2\;y_3\;y_4\;u_1$
$w^{--}$ input $u_1$ output $y_1\;y_2\;y_3\;y_4$

This can continue recursively, for $n=2^m$ , this is called polarization transform of length $n$ denoted by $p^{(n)}$ recursion stop from $n$ to $2 n$

在这里插入图片描述

$w^{++...++...}\leftrightarrow w^{(i)}$ by mapping $i - 1$ into a binary format of length $m\;(m=lg(n))$ and replace “1” by “+” and “0” by “-”.

The sum-capacity is preserved: $\sum_{i=1}^{n} c(w^{(i)})=n*c(w)$
(for symmetric channels)
proof is by chain rule of mutual information assuming input bits $u_i$ ’s are uniform i.i.d(independent and identically distributed) )

polarization tree
在这里插入图片描述
channel polarization: As n grows large, the bit-channels become either completely noiseless (capcacity goes to one) or become completely noise (capacity goes to zero)
(except a vanishing fraction of bit-channels)
further more, the fraction of noiseless channel $\rightarrow c(w)$

Example:
BEC(0.5), n=4
Let $z^{(i)}$ denote the erasure prob of $w^{(i)}$
在这里插入图片描述
A prove for polarization for BEC
$n = 2 m$ bit channels $n\cdot c(w) - \sum_{i=1}^n c(w^{(i)})$
$T_n=\frac 1 n\sum_{i=1}^n(1-z^{(i)})$
$z^{(i)}$ : erasure probability of $w^{(i)}$ $i$ -th bit channel
It is sufficient to prove that $\atop n\rightarrow\infty } T_n=0$
在这里插入图片描述

$z^2(1-z^2)+(2z-z^2)(1-2z+z^2)=2z(1-z)(1-z(1-z))$
define $\alpha_i=z^{(i)}(1-z^{(i)})$
$T_{2n}=\frac 1 {2n}\sum 2\alpha_i(1-\alpha_i)=\frac 1 n \sum \alpha_i-\alpha^2_i\\ = \frac 1 n\sum \alpha_i-\frac 1 n\sum \alpha_i^2 \,\le T_n-T_n^2$
Lemma: $\frac {\sum \alpha_i^2} n\ge(\frac {\sum \alpha_i} n)^2$
Note that sequency of ${T_n\}_{n-1}$ positive and strictly decresing $\rightarrow {lim \atop n\rightarrow\infty } T_n$ exists
Let $T_\infty= {lim \atop n\rightarrow\infty } T_n$ , $T_\infty=T^2_\infty-T_\infty\Rightarrow T_\infty=0$

$\beta_n=\frac {|{i|\varepsilon\le z^{(i)}\le1-\varepsilon}|}n$ for some $\varepsilon>0$
Note that $T_n\ge\beta_n-\varepsilon(1-\varepsilon)$
for any fixed of $\varepsilon$ , $\beta_n\rightarrow0$ since $T_n\rightarrow0$

Polarization transform

在这里插入图片描述

$x_1=u_1+u_2\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\\ x_2=u_2 \qquad \rightarrow [x_1\;\;x_2]=[u_1\;\;u_2]\begin{bmatrix}1\;0\\1\; 1 \end{bmatrix}=[u_1\;\;u_2]G_2$
$G_{2n}=\begin{bmatrix} G_n\;0_{n\times n}\\G_n\;\;\;G_n \end{bmatrix}=G_n\times G_2=\underbrace{G_2\bigotimes G_2\bigotimes G_2\bigotimes ...\bigotimes G_2 }_{\text{m times m=$log_2 n $}} = G_2^{\bigotimes m} \;\;\text{Kronecker power}$

Kronecker product of $A_{m\times n}$ and $B_{p \times q}$
$A\bigotimes B=\begin{bmatrix} a_{11}B..........a_{1n}B\\...\qquad\qquad\quad...\\a_{m1}B..........a_{mn}B \end{bmatrix}_{mp\times nq}\\ G_4=\begin{bmatrix} 1\;0\;0\;0\\ 1\;1\;0\;0\\1\;0\;1\;0\\1\;1\;1\;1\\\end{bmatrix}_{4\times 4}\\ \text{(replacing "0" with "-1" results Hadamard matrix)}\\ \Rightarrow G_n^{-1}=G_n \;\;or \;\;G_n\times G_n=I_{n\times n} \;\;self-inverse$

Encoding complexity

$U_{1\times n}\times G_n$ can be done with $O (n l o g (n))$ complexity function $x(1,n)=G_{multiplier}$
$(u (1 : n))$

if n==1
	x=u
	return
end
x1=G_multiplier(u(1*n/2))
x2=G_multiplier(u(x/2+1*n))
x=(x1+x2 , x2) 
end

( $x_1=... \;\;\; x_2=...$ these two steps can be done in parallel)
( $x_1+x_2$ is entry-wise addition)

output of function: $x_{1\times n}=u_{1\times n}G_n$
$f (n)$ =# of operations to compute this $u_{1\times n}G_n$
$=\begin{cases} f(n)=2f(\frac n 2)+\frac n 2\Rightarrow f(n)=\frac {nlg(n)} 2 \\ f(1)=0 \end{cases}$
latency (time needed, assuming parallelization)
latency of computing $u G$ with the function $G_{multiplier}$
$g (n)$ : the latency, $g(n)=g(\frac n 2)+1\Rightarrow g(n)=lg(n)$ (fast enough)

polar code construction

length $n$ dimension $k$ , channel $w$
pick the indices of the $k$ “best” bit-channels $w^{(i)} s$ in the polarization transform of length $n$
the genrator matrix for ( $n, k$ ) polar code associated with $w$
from matrix $G_{n\times n}$ , select the rows that are indexed by “good” bit-channel

Polar encoder
在这里插入图片描述

example:
n=8 k=4 for BEC(0.5)
在这里插入图片描述
k=4, pick 4 best one indices 4 6 7 8

$u_{1\times 8}=[0\;0\;0\;m_1\;0\;m_2\;m_3\;m_4]$
message bit are $m_1,m_2,m_3,m_4$
$\Rightarrow$ compute $u_{1\times 8}G_8$ to get the encoded codeword

Decoder polar code

Successive cancellation decoder
let A denote the set of indices of “good” bit-channels selected fro the code construction, $A={1,2,...,n}$ For $i = 1, 2, ..., n$
$\^u_i$ :decoded version of $u_i$
$\^u_i$ $\begin{cases} 0 &\text{if } i\in A \\ \text{ML decision of } u_i \text{ given } y_1,y_2,...,y_n \text{ and } \^u_1,\^u_2,...,\^u_n \end{cases}$

Let probability of error $Pe(u_i)=Pe(w^{(i)})$ , assuming that $\^u_1^{i-1}=u_1^{i-1}$ ( $\^u_1^{i-1}:u1,u2,...,u_{i-1}$ )
Lemma: Pe(the polar code associated with A and decoded with SC) $\le \sum_{i\in A}Pe(u_i)$
$Pe(u_i)$ : probability of error of individual bit-channel

PROOF:
by union bound on the error events $\^u_i\ne u_i$ for the first(smallest) i, going back to the construction of polar code, there are two criteria:

For a fixed rate: sort the bit-channels and pick the best $k - n R$ , R:given rate. Finding the rate, $n$ block length, $n=2^m$ rate $R$ , dimension $k = n R$ . Polarization transform of length $n$ , split into $n$ bit-channels $w^{(1)}\;w^{(2)}\;...w^{(n)}\;$ . Sort them (according to capacity or probability of erasure) pick the best $k$ of them, let $A =$ set of indices of the selected/good ones.
For a given bound on $P e$ . Pe(polar code associated with A under SC) $\le \sum_{i\in A}Pe(u_i)$ , $u_i$ : for bit-channel $w^{(i)}$ .
Sort the bit-channels from best to worse $u_{\pi(1)}\;u_{\pi(2)}\;...u_{\pi(n)}\;$ - sorting permutation according to $Re(u_i)$
$u_n$ is always the best $\pi(1)=n$
$u_1$ is always the worst $\pi(n)=1$
$u_{\pi(n)}\rightarrow$ then accumulate as many $u_{\pi(i)}$ 's as possible (starting from $\pi(1)$ till the sum $\sum_{i=1}^k Pe(u_{\pi(i)})$ reaches the bound on $P e$ )

Example:
for n=8 k=4 BEC( $\frac 1 2$ )
在这里插入图片描述
standard : $Pe<\frac 1 3$
$\sum Pe=\frac 1 {256}+\frac {31} {256}+\frac {49} {256}=\frac {81} {256}<\frac {1} {3}$ good
but if $+Pe(u_{\pi(4)})$ , $\sum Pe$ will greater than $\frac 1 3$ , $\sim k=3$ , set of good bit-channels, $A=\{8,7,6\}$