哈夫曼编码

最新推荐文章于 2024-06-25 18:31:27 发布

南七程序员毛毛

最新推荐文章于 2024-06-25 18:31:27 发布

阅读量538

点赞数

分类专栏：算法

本文链接：https://blog.csdn.net/Avenger_Tao/article/details/44806623

版权

算法专栏收录该内容

20 篇文章 0 订阅

订阅专栏

Huffman coding.

The basic idea of Huffman coding is to encode symbols that have higher probability to appear with fewer bits of output. Since each symbol can be represented by different amount of bits, it is important that no symbol code is a prefix of other symbol code.

Static Huffman Coding.

Create a list of sorted nodes corresponding to probabilities for each symbol. Probability of a symbol is assigned to be equal to the node weight.
_{Start of loop:}

1.Find and removetwo nodes with smallest probabilities. Mark nodes A and B.

2.Create new nodewith weight[node] = weight[A] +weight[B].

3.Assign left andright children of node to A and B.

4.Insert the newnode back to the sorted list.

5.Repeat the loopuntil the list consist of the only last node.

On each loop in the process of creating Huffman Tree itis possible to decide what child will be the left and what will be right. Inadvance if some symbols or sum of symbols have equal weights, it is possible toselect each of them when minimal-weight node is searched. Therefore it ispossible to create dozen different Huffman Trees. Each tree will be valid Huffman Tree and therefore can be used for compression.

Encoding Static Huffman Code.

Build Huffman Tree and calculate codes.
Start the encoding loop here.

o Read the nextinput symbol s.

o Find or calculatecode for symbol s - code[s].

o Output code[s].

o Continue theencoding loop.

Decoding Static Huffman Code.

Decoding is symmetric to encoding, however, here it is:

o Build Huffman Treeand/or calculate codes.
Start the decoding loop here.

o Find a codecorresponding to the given bits stream, starting from the current position.
Suppose found code is corresponding to symbol s. The length of code[s] is lbits.

o Output symbol s.

o Adjust currentpoint in the input stream by l bits.

o Continue thedecoding loop.

Create CodeBook with codes listed.

CodeBook for Encoding.

It is possible to maintain array, Hash-Map, list, tree, Heap or what ever structure with codes ready for use on each symbol. The key is a symbol.

We want thisprocess could be faster, so we build the Hash-Map to store the code of eachsymbol.

General notes: