六、Image Compression（图像压缩）

最新推荐文章于 2024-05-31 10:01:06 发布

snowleafzf

最新推荐文章于 2024-05-31 10:01:06 发布

阅读量3.4k

点赞数

分类专栏：图像处理

本文链接：https://blog.csdn.net/snowleafzf/article/details/85962830

版权

图像处理专栏收录该内容

6 篇文章 1 订阅

订阅专栏

一、Lossless Coding

•represent an image signal with the smallest possible number of bits without loss of any information

•speed up transmission and minimizing storage requirements

表示具有最小可能位数的图像信号而不丢失任何信息
加速传输并最大限度地减少存储需求

•Transformation

-Apply a reversible (one-to-one) transformation

-reduce data correlation, alter the data distribution, pack a large amount of information into few data samples or subband regions

-Include differential or predictive mapping, unitary transforms, subband decompositions….

应用可逆（一对一）转换
减少数据相关性，改变数据分布，将大量信息打包到少数数据样本或子带区域
包括差分或预测映射，酉变换，子带分解......

•Data-to-Symbol Mapping

-convert transformed image into symbols

-partitioning

-Running-length coding (RLC)

将变换后的图像转换为符号
分区
行程编码（RLC）：https://baike.baidu.com/item/%E8%A1%8C%E7%A8%8B%E7%BC%96%E7%A0%81/3429991?fr=aladdin

•Lossless Symbol Coding

-assign binary codewords to the input symbols

-variable-length coding (VLC) i.e., entropy coding, such as Huffman and arithmetic coding

-fixed-length coding, such as dictionary (Lempel-Ziv) coding

将二进制码字分配给输入符号
可变长度编码（VLC），即熵编码，例如霍夫曼和算术编码
固定长度编码，如字典（Lempel-Ziv）编码

https://blog.csdn.net/l294265421/article/details/54434643 算术编码

https://www.cnblogs.com/kubixuesheng/p/4397798.html 哈夫曼

•Dictionary-based schemes (Lempel-Ziv)

-do not require a priori knowledge of the source symbol probability distribution;

-dynamically construct encoding and decoding tables;

-Fixed length binary codewords

基于字典的方案（Lempel-Ziv）
不需要源符号概率分布的先验知识;
动态构造编码和解码表;
固定长度二进制码字

二、Block Truncation Coding（块截断编码）

取一定区域算均值，比较，然后大于均值的取均值代替所有大于均值的块的值，小于均值的所有块的均值填入所有小于均值的块中

https://wenku.baidu.com/view/b1c67238580216fc700afd4f.html 详情见块编码论文（本科生的毕设？）

•Statistical and structural methods have been developed for image compression

•Statistical method--the algebraic structure of the pixels in an image

•Structural method--the geometric structure of the image

已经开发了用于图像压缩的统计和结构方法
统计方法 - 图像中像素的代数结构
结构方法 - 图像的几何结构

•a lossy fixed length compression method that uses a Q-level quantizer to quantize a local region of the image

•to preserve the sample mean and sample standard deviation of a gray-scale image in its simplest form

•additional constraints can be added to preserve higher-order moments.

•BTC is a block adaptive moment preserving quantizer

一种有损定长压缩方法，使用Q级量化器来量化图像的局部区域
以最简单的形式保存灰度图像的样本均值和样本标准差
可以添加其他约束以保留更高阶矩。
BTC是块自适应矩保持量化器

•divide the image into nonoverlapping rectangular regions

•let the sample mean of the block be the threshold; a “1” would then indicate if an original pixel value is above this threshold, and “0” if it is below.

将图像分成不重叠的矩形区域
让块的样本均值为阈值; 然后，“1”将指示原始像素值是否高于该阈值，如果低于该阈值则为“0”。

•Applications

-HDTV

-Sun’s CellB video format

-XMovie

三、Vector Quantization（矢量量化）

将若干个标量数据组构成一个矢量，然后在矢量空间给以整体量化，从而压缩了数据而不损失多少信息。

https://blog.csdn.net/lishuiwang/article/details/78483547

•Quantization is a mapping of a large set of values to a smaller set of values.

•LBG Design Algorithm

-Initialization (random selection)

-Encoding of the training vectors

-Computing of the centroids（重心）

https://blog.csdn.net/iron__c/article/details/74939534

LBG算法中的最佳矢量量化器设计的关键是最佳划分和最佳码书的设计。一是给定码书条件下寻找信源空间的最佳划分，使平均失真最小，由码书和NNR得最佳划分，信源空间中的任一点矢量，如果它和码字的失真小于它和其它码字的失真；二是在给定划分条件下，寻找最佳码书，使平均失真最小。其思想如下所示：

  a.随意选取n个图像块作为码字；
  b.由这n个码字对所有的图像块进行划分，即分成n个集合，使每个集合中的图像块，都是与各码字距离中与对应的码字的距离最小的；
  c.由这n个集合的重心，得到n个新的码字；
  d.如果这些个码字与原来的码矢量变化不大（收敛），就完成码书的训练，否则重新进行2、3步。

•Tree-Structured VQ

- A hierarchical arrangement of codevectors

- Searching efficiently

•Mean-Removed VQ

-a codebook may have many similar vectors differing only in their mean

-extract the variation among vectors and code that extracted component separately as a scalar

码本可以具有许多相似的矢量，它们的平均值不同
提取矢量和代码之间的变化，提取组件分别作为标量

三、JPEG Lossy Image Compression Standard

•Part of the multipart set of ISO standards IS 10918-1,2,3 (ITU-T Recommendations T.81,T.83,T.84)

•Entails an irreversible mapping of the image to a compressed bit stream with mechanisms for a controlled loss of information

•produces a bit stream that is usually much smaller in size than that produced with lossless compression

通过控制信息丢失的机制，实现图像与压缩比特流的不可逆映射
产生的比特流通常比无损压缩产生的比特小得多

•Key features of the lossy JPEG standard:

-Both sequential and progressive modes of encoding are permitted.

-Low complexity implementations in both hardware and software are feasible.

-All types of images are permitted.

-A graceful tradeoff in bit rate and quality is offered.

允许顺序和逐行编码模式。
硬件和软件中的低复杂性实现是可行的。
允许所有类型的图像。
提供比特率和质量的优雅权衡。

DCT

•Lossy JPEG compression is based on transform coding that uses the DCT（Discrete Cosine Transform）.

•In DCT coding, each component of the image is subdivided into blocks of 8 x 8 pixels.

•A two-dimensional DCT is applied to each block of data to obtain an 8 x 8 array of coefficients

有损JPEG压缩基于使用DCT的变换编码。
在DCT编码中，图像的每个分量被细分为8×8像素的块。
将二维DCT应用于每个数据块以获得8×8系数阵列。

Quantization（量化）

Coefficient-to-Symbol Mapping and Coding（系数到符号的映射和编码）

•JPEG treats the DC coefficient and the set of AC coefficients differently.

•JPEG uses the Huffman coding or arithmetic coding to represent the symbols.

JPEG以不同方式处理DC系数和AC系数集。
JPEG使用霍夫曼编码或算术编码来表示符号。

利用DCT压缩图像数据，主要是根据图像信号在频率域的统计特性。在空间域看来，图像内容千差万别；但在频率域上，经过对大量图像的统计分析发现，图像经过DCT变换后，其频率系数的主要成分集中于比较小的范围，且主要位于低频部分。利用DCT变换揭示出这种规律后，可以再采取一些措施把频谱中能量较小的部分舍弃，尽量保留传输频谱中主要的频率分量，就能够达到图像数据压缩目的。

https://www.cnblogs.com/stnlcd/p/7261842.html DCT讲解

四、（plus）

Compression efficiency----compression ratio

Compression efficiency------average bit rate in bits per pixel

•Factors of Lossless Symbol Coding

-Coding delay—minimum time required to both encode and decode an input data sample

-Implementation complexity—required number of arithmetic operations per second and the memory requirement

-Robustness—robustness of the coding method to transmission errors

- 编码延迟 - 编码和解码输入数据样本所需的最短时间

- 实现复杂度 - 每秒所需的算术运算次数和内存要求

- 编码方法对传输错误的鲁棒性

低频分量（低频信号）代表着图像中亮度或者灰度值变化缓慢的区域，也就是图像中大片平坦的区域，描述了图像的主要部分。主要对整幅图像强度的综合度量。

高频分量（高频信号）对应着图像变化剧烈的部分，也就是图像的边缘（轮廓）或者噪声以及细节部分。主要是对图像边缘和轮廓的度量。而人眼对高频分量比较敏感。

snowleafzf

关注

0
点赞
踩
6

收藏

觉得还不错? 一键收藏
1
评论
六、Image Compression（图像压缩）

一、Lossless Coding•represent an image signal with the smallest possible number of bits without loss of any information•speed up transmission and minimizing storage requirements表示具有最小可能位数的图像信号而不丢失...
复制链接

扫一扫