Rate Distortion Theory

最新推荐文章于 2023-12-29 01:49:23 发布

拉普拉斯的汪

最新推荐文章于 2023-12-29 01:49:23 发布

阅读量584

点赞数 1

分类专栏： Information Theory

本文链接：https://blog.csdn.net/qq_39599295/article/details/115365361

版权

率失真理论探讨了在有限比特率下如何优化信号的表示，以最小化预期失真。对于不同的源分布（如二进制源、高斯源），找到最佳量化策略。量化涉及将信号分为决策区间并映射到重构值。率失真函数R(D)给出了在特定失真D下所需的最小比特率，而失真率函数D(R)则给出了在给定比特率R下可达到的最小失真。理论还涵盖了独立同分布源的优化，其中高斯源的率失真函数由21log(D/σ^2)给出。

摘要由CSDN通过智能技术生成

Reference:

Elements of Information Theory, 2nd Edition

Slides of EE4560, TUD

Content

For a stationary discrete source, the minimum number of bits to represent the source signal with arbitrarily small probability of error is given by the entropy rate

H_\infty (X)

In many situations, however, it is not necessary to perfectly represent the source signal.

For instance, the description of an arbitrary real number requires an infinite number of bits, so a finite representation of a continuous random variable can never be perfect.

How well can we do? $\to$ Define the “goodness” of a representation of a source $\to$ Define a distortion measure

Given a source distribution and distortion measure,

What is the minimum expected distortion achievable at a particular bit rate? $D (R)$
What is the minimum rate description required to achieve a particular distortion? $R (D)$

Quantization

Let $\hat X(X)$ denote the representation of the random variable $X$ . Using $R$ bits to represent $X$ , the function $\hat X$ can taken on $2^R$ values.

Problem: Find the optimal set of values for $\hat X$ and the regions that are associate with each value $\hat X$ .

An $L$ -level quantizer is characterized by a set of $L + 1$ decision levels or decision thresholds $x_0<x_1<\cdots<x_L$ and a set $\hat X=\{ \hat x_k,k=1,\cdots,L \}$ such that $\hat x=\hat x_k$ if and only if $x_{k-1}\le x <x_k$ , where $x_0=-\infty$ and $x_L=\infty$ .

The numbers $\hat x_k$ are called the reconstruction values or reproduction levels and the intervals $C_k=[x_{k-1},x_k)$ are usually referred to as the decision intervals or quantization cells.

The map $\hat X:\mathcal X \mapsto \hat {\mathcal X}$ is given by
$\hat X(x)=\hat x_k\quad \text{for }x\in \mathcal C_k,k=1,\cdots,L$
is a staircase function by definition.

在这里插入图片描述

In order to find an optimal quantizer, that is, to find optimal decision and reproduction levels, we need a rule for quantitively assigning a distortion value to every possible approximation of the source samples.

Definition 1 (distortion measure):

A distortion function or distortion measure is a mapping
$\mathcal{X} \times \hat{\mathcal{X}} \mapsto \mathbb{R}^{+}$
from the set of source alphabet-reproduction alphabet pairs into a set of nonnegative numbers. The distortion $\hat{x})$ is a measure of the cost representing the symbol $x$ by the symbol $\hat{x}$ .

Examples:

Hamming distortion (Probability of error distortion measure)
$\hat{x})=\left\{\begin{array}{ll} 0 & \text { if } x=\hat{x} \\ 1 & \text { if } x \neq \hat{x} \end{array}\right.$
$\hat{X})=\operatorname{Pr}(x=\hat{x}) \cdot 0+\operatorname{Pr}(x \neq \hat{x}) \cdot 1=\operatorname{Pr}(x \neq \hat{x})$
Squared-error distortion
$d(x,\hat x)=(x-\hat x)^2$

Assume a squared-error distortion measure. What are the optimal reproduction levels and optimal quantization cells?

That is, we wish to find the function $\hat X(X)$ such that $\hat X$ takes on at most $L=2^R$ values and minimized $E(X-\hat X)^2$ .
$E(X-\hat{X})^{2}=\sum_{k=1}^{L} \int_{\mathcal{C}_{k}}\left(x-\hat{x}_{k}\right)^{2} p(x) d x \tag{1}$

If the quantization cells $\mathcal C_k$ are known:

The optimal reproduction levels are found by
$\left.\frac{\partial E(X-\hat{X})^{2}}{\partial \hat{x}_{k}}\right|_{\hat{x}_{k}=\hat{x}_{k}^{*}}=-2 \int_{x \in C_{k}}\left(x-\hat{x}_{k}^{*}\right) p(x) d x=0$
so that
$\hat{x}_{k}^{*}=\frac{\int_{x \in C_{k}} x p(x) d x}{\int_{x \in C_{k}} p(x) d x}$
Since
$\int_{x \in \mathcal{C}_{k}} p(x) d x=\operatorname{Pr}\left(x \in \mathcal{C}_{k}\right)$
we have, using Bayes’ rule, that
$\frac{p(x)}{\operatorname{Pr}\left(x \in \mathcal{C}_{k}\right)}=\frac{p\left(x | x \in \mathcal{C}_{k}\right)}{\operatorname{Pr}\left(x \in \mathcal{C}_{k} | x\right)}$
so that
$\hat{x}_{k}^{*}=\int_{x \in \mathcal{C}_{k}} x \frac{p(x)}{\operatorname{Pr}\left(x \in \mathcal{C}_{k}\right)} d x=\int_{x \in \mathcal{C}_{k}} x \frac{p\left(x | x \in \mathcal{C}_{k}\right)}{1} d x=E\left(X | x \in \mathcal{C}_{k}\right)\tag{2}$
It is the conditional mean or centroid of quantisation cell $\mathcal{C}_{k}$ .

If the reproduction levels $\hat x_k$ are known:

Given a set $\left\{\hat{x}_{i}\right\}$ of reconstruction points, the distortion is minimized by mapping a source random variable to the representation $\hat{x}_{i}$ that is closest to it. The partition into regions of $\mathcal{X}$ defined by this mapping is called a Voronoi partition.

The Voronoi regions are determined by the optimal reproduction points, whereas the optimal reproduction points are obtained given the Voronoi regions. How to solve this problem?

Iterative descent algorithm (Lloyd, 1957 $)$ :

start with an initial collection of reproduction points
optimize the partitions for these levels by using a minimum distortion mapping (nearest neighbour quantization)
optimize the set of reproduction levels for the given partition (replace the old values by the centroids of the partition cells)

The alternation is continued until convergence to a local, if not global, optimum.

Instead of quantizing a single random variable, let us assume that we are given a set of $n$ i.i.d. random variables $X_{1}, \ldots X_{n}$ drawn from a Gaussian distribution which we want to represent by $n R$ bits

we will represent the entire sequence by a single index taking $2^{n R}$ values
this treatment of entire sequences at once achieves a lower distortion for the same rate than independent quantization of the individual samples

Apparently, rectangular grid points (arising from independent descriptions) do not fill up the space efficiently:

在这里插入图片描述

Definition 2 (dimensionless normalized second moment of inertia):

Let $\nu$ denote the volume of a quantization cell. The dimensionless normalized second moment of inertia $G\left(\mathcal{C}_{k}\right)$ of a quantization cell is defined by
$G\left(\mathcal{C}_{k}\right)=\frac{1}{n \nu^{1+2 / n}} \int_{\mathcal{C}_{k}}\left\|x-\hat{x}_{k}\right\|^{2} d x$