多媒体期末总结

最新推荐文章于 2024-06-11 23:22:22 发布

「已注销」

最新推荐文章于 2024-06-11 23:22:22 发布

阅读量2.3k

点赞数 11

分类专栏：实时处理和多媒体期末总结文章标签：多媒体 media 郑州大学伍伦贡

本文链接：https://blog.csdn.net/weixin_44275663/article/details/86006623

版权

实时处理和多媒体期末总结专栏收录该内容

4 篇文章 2 订阅

订阅专栏

$\ \color{blue}有修改的话，会用蓝色标$

多媒体总结

一.简答题汇总

Briefly explain the advantages of logarithmic quantisation when applied to speech signals?

Answer: Logarithmic quantisation provides lower quantisation errors for low amplitude values.(对低幅值有利）

what is the advantage of backward adaptive quantisation compared with forward adaptive quantisation?

Answer: Backward adaptive quantisation does not require transmission of quantisation steps and has lower delay as does not require lookahead.(后向自适应量化不需要传输量化步骤，有着更低的延时)

Describe the difference between lossless and lossy compression,Given two examples of signals where lossy compression is appropriate?

Answer:
Lossless compression prefectly reconstructs the orign signal while lossly compression produces an accurate representation of the orignal signal,Lossly compression is appreciate for speech signal and audio signal.
（无损压缩可以重建原始信号，有损压缩智能精确描述原始信号）

Brifely explain the difference a gray scale digital image and a color digital image?

Answer:
(1)Gray scale image is black and white.Only one colour component per pixel with a range of values from 0 to 255.（灰度图图像每个像素仅有一个颜色成分）
(2) Colour image have multiple component per pixel,representing Red,Green and Blue.（彩色图像每个像素有红绿蓝三种颜色成分）

What is the purpose of the transforming in image compression?

Answer: Transforming aims to compact the energy of the image into a few significant coefficients,This allow for efficient coding.

what criteria should be used when choosing the resolution for a digital image?

Answer:Should be chosen so that Subjective distoration is minimised.（主观失真最小化）

Brifely compare the RGB color space with YC_rC_b color space?

Answer: Y is made up of R,G and B color components,represents the luminance or black and white color components,C_r and C_b is the combination of the luminance and one of the color components .They represent colour

What is the zig-zag scanning pattern designed to achieve in JPEG image compression?

Answer: This is designed to order the coefficients in magnitude.This then allows run/length or entropy encoding as it will lead to long sequences of zeros.

What are the two components that need to be compressed in video compression?

Answer:The image frame and the temporal information.

Explain the difference between interframe coding and intraframe coding of video frames.what advantages does interframe coding have over intraframe coding of video?

Answer:
(1) Interframe coding:coding the information between frames
(2) Intraframe coding :code the information within the frame,
interframe coding leads to lower bit rates than intraframe coding.

Brifely explain motion estimate and motion compensation and why it is used in video coding ?

Answer:
(1)Motion estimate determines how block of pixels have moved from one frame to the next.
(2)Motion vector derived from motion estimate predicts the current frame based on previous frame.
Motion estimate and motion compensation are used to code the temporal information of video using interframe coding and reduce bit rates compared to intraframe coding only.

Brifely explain the 3-step algorithm used in motion estimate and what advantage it has compared with the Exhaustive Block Matching Algorithm?

Answer:
(1)step 1: predict the motion using 9 points in search regin,finding the closest match.
(2)step 2:center the search block of 0.5 the size as step 1 at the vector resulting in the minimum error from step 1,perform 8 matches around this point.
(3)Step 3:Repeat step 2 for a search regin 0.5 the size of the step 2,choose the vector giving the closest match as the motion vector,This has the advantage of a much shorter search time.

Brifely explain how MPEG-4 treats video and compare with how MPEG-1 and MPEG-2 treats video,Brifely explain one advantage that the MPEG-4 approach to video compression has over MPEG-1 and MPEG-2 approach?

Answer:
MPEG-4:object-based,coded video object separately.(基于object)
MPEG-1/MPEG-2:frame based,Doesn’t divided the frame into objects.(基于frame)
An advantage of MPEG-4 is that is has more scability,more interactivity and lower bit rates.

why is it important to study HVS in order to develop still image and video related application?

Answer:
Sensitive to moderately changing patterns.
Not sensitive to rapidly changing patterns.
Accurate measurement requires stabilizing intrinic rapid eye-movement.

Describe the important steps in compressing an image using JPEG?

Answer: Source Image data ---->DCT Transform ---->Quantisation—>Entropy Encoding

What are the major improvements in JPEG 2000?

Answer:
(1)Excellent low bit performance.
(2)Superior quaility in text compression
(3)No blockiness (没有块效应）
块效应：块效应是所有基于DCT技术压缩可能出现的现象。造成的原因主要是传输误码。

Why was the YUV system capable of catering for both black & white and colour television?

Answer:Y is the luminance,it is the grayscale image ,The inital transmissions were in Y only,Later when Y,U,V were telecast,black and white tvs could display black and white using Y only and colour could use all the components to display colour.

Explain the difference between I ,P and B frames used in MPEG-1 video compression,What is the advantage of increasing number of B frames compared to the number of I and P frames?

Answer:
$\color{blue} I frames---Intraframe \ coded \ frames （帧内编码帧）$
P frames-- Backward predicted frames
B frames-- Bi-directionally prediction frames
For a given distortion rate,using more B frames leads to a lower bit rate.
$\color{blue}B \ frame \ has \ higher \ compression \ ratio \ than \ P \ frame.$

which transform is capable of producing the maximum energy compaction?

Answer: DCT

What are the drawbacks in JPEG?

Answer: Blockiness , poor quality in text compression. The poblem lies in 8*8 blok size

How to improve this scheme?

Answer: Using a large block size 8* 8 in JPEG 2000,this is tackled,Again JPEG tackles poor text quality

what are the key stages involved in the JPEG encoder?

Answer:
(1) Perform the DCT on a block of 8-by-8 pixels.
(2) Quantise the DCT coefficients using uniform quantzation and the quantization table
(3) Perform a zig-zag scan of the DCT coefficients
(4) Perform entropy coding of the resulting quantized coefficients.

Why is the DCT used in image compression?

Answer:
The DCT of an image result in many transform coefficients that are close to 0,this means that they do not need to be used in the reconstruction transform.Hence,only a few coefficients,corresponding the those with the highest magnitude,need to be used to get a good quality reconstruction.

二.计算题

RGB–>YUV
(1) (0,255,0) (2) (0,255,255)
$\left[ \begin{matrix} Y \\ U\\ V \end{matrix} \right] = \left[ \begin{matrix} 0.299 & 0.587 & 0.114\\ -0.147 & -0.289 & 0.436\\ 0.615 & -0.515 & -0.100 \end{matrix} \right] \left[ \begin{matrix} R \\ G\\ B \end{matrix} \right]$
Answer

$\left[ \begin{matrix} Y1 \\ U1\\ V1 \end{matrix} \right] = \left[ \begin{matrix} 0.299 & 0.587 & 0.114\\ -0.147 & -0.289 & 0.436\\ 0.615 & -0.515 & -0.100 \end{matrix} \right] \left[ \begin{matrix} 0 \\ 255\\ 0 \end{matrix} \right]=\left[ \begin{matrix} 149.685 \\ -73.695\\ -131.325 \end{matrix} \right]$
$\left[ \begin{matrix} Y2 \\ U2\\ V2 \end{matrix} \right] = \left[ \begin{matrix} 0.299 & 0.587 & 0.114\\ -0.147 & -0.289 & 0.436\\ 0.615 & -0.515 & -0.100 \end{matrix} \right] \left[ \begin{matrix} 0 \\ 255\\ 255 \end{matrix} \right]=\left[ \begin{matrix} 178.755 \\ 37.485\\ -156.825 \end{matrix} \right]$

Use Dithering to map the following 4*4 image into an image with a range with a range 0-16
$\left[ \begin{matrix} 35& 37&89&127 \\ 198& 245&258&21\\ 78&124&128&56\\ 198&187&39&128 \end{matrix} \right]$
Dithering matrix
$\left[ \begin{matrix} 0& 8&2&10 \\ 12& 4&14&6\\ 3&11&1&9\\ 15&7&13&5 \end{matrix} \right]$

Answer

1. $\left[ \begin{matrix} 35& 35&35&35 \\ 35& 35&35&35\\ 35&35&35&35\\ 35&35&35&35 \end{matrix} \right] \times\frac{1}{17}= \left[ \begin{matrix} 2& 2&2&2 \\ 2& 2&2&2\\ 2&2&2&2\\ 2&2&2&2 \end{matrix} \right]$
compared with dithering matrix:
$\left[ \begin{matrix} 2& 2&2&2 \\ 2& 2&2&2\\ 2&2&2&2\\ 2&2&2&2 \end{matrix} \right]-\left[ \begin{matrix} 0& 8&2&10 \\ 12& 4&14&6\\ 3&11&1&9\\ 15&7&13&5 \end{matrix} \right]= \left[ \begin{matrix} 2& -6&0&-8\\ -10& -2&-12&-4\\ -1&-9&1&-7\\ -13&-5&-11&-3 \end{matrix} \right]$
if elements is greater than 0,the value is 1,else 0
$\left[ \begin{matrix} 1& 0&0&0\\ 0& 0&0&0\\ 0&0&1&0\\ 0&0&0&0 \end{matrix} \right]$
others are same as the above.

Use the threshold of 128 to map the following image to a binary image.(you may use 0 and 1 binary values).Repeat the process for threshold value of 78?

234	123	47	89
25	167	67	49
88	157	89	189
38	178	34	89

Answer:

int a[] []=new int[][]{(234,123,47,89},{25,167,47,89},{88,157,89,189},  {38,178,34,89}};
int b[][]= new int[4][4];
int T=128;
for(int i=0;i<4;i++){
  for(int j=0;j<4;j++){
     if(a[i][j]<T)
            b[i][j]=0;
    else
         b[i][j]=1;
      }
  }

1	0	0	0
0	1	0	0
0	1	0	1
0	1	0	0

when T=87,the operation is same as above.

Map the above image into a negative image,logarithmic image using c=1.3,Also map it into a power-law mapping c=1.3 and gamma equal to 3?
Answer:
注释：考试不写程序，此处写只是为了描述方便

4.1 negative transform

int a[] []=new int[][]{(234,123,47,89},{25,167,47,89},{88,157,89,189},  {38,178,34,89}};    
int b[][]= new int[4][4];
for(int i=0;i<4;i++){
  for(int j=0;j<4;j++){
    b[i][j]=255-a[i][j];
      }
  }

4.2 logarithmic transform

  int a[] []=new int[][]{(234,123,47,89},{25,167,47,89},{88,157,89,189},  {38,178,34,89}};
  int b[][]= new int[4][4];
  int c=1.3;
for(int i=0;i<4;i++){
  for(int j=0;j<4;j++){
    b[i][j]=c*Math.log(1+a[i][j])/Math.log(2);
      }
  }

4.3 Power-law transform

  int a[] []=new int[][]{(234,123,47,89},{25,167,47,89},{88,157,89,189},  {38,178,34,89}};
  int b[][]= new int[4][4];
  int c=1.3;
  int r=3;
for(int i=0;i<4;i++){
  for(int j=0;j<4;j++){
    b[i][j]=c*Math.power(a[i][j]/255,r);
      }
  }

Use 3-pt medain filter on the sequence below?
16 14 15 12 2 13 15 52 51 50 49

$\color{blue}Answer:$
step1:在队首和队尾填充数据，填充的数据可以比队列里面最小的值还要小的数，或者比队列里面最大的值还要大的数（保证填充的数据不会是中值）
0 16 14 15 12 2 13 15 52 51 50 49 0
step 2:连续三个为一组，对组内三个数据进行排序，求出中值，直至所有组结束
$\begin{matrix} \underbrace{ 0,16,14 ,} \\ \color{Blue}14 \end{matrix} \begin{matrix} \underbrace{ 16,14 ,15,} \\ \color{Blue}15 \end{matrix} \begin{matrix} \underbrace{ 14,15 ,12,} \\ \color{Blue}14 \end{matrix} \begin{matrix} \underbrace{ 15,12 ,2,} \\ \color{Blue}12 \end{matrix} \begin{matrix} \underbrace{ 12,2,13,} \\ \color{Blue}12 \end{matrix} \begin{matrix} \underbrace{ 2,13 ,15,} \\ \color{Blue}13 \end{matrix} \begin{matrix} \underbrace{ 13,15 ,52,} \\ \color{Blue}15 \end{matrix} \begin{matrix} \underbrace{ 15,52 ,51,} \\ \color{Blue}51 \end{matrix} \begin{matrix} \underbrace{ 52,51 ,50,} \\ \color{Blue}51 \end{matrix} \begin{matrix} \underbrace{ 51,50 ,49,} \\ \color{Blue}50 \end{matrix} \begin{matrix} \underbrace{ 50,49 ,0,} \\ \color{Blue}49 \end{matrix}$

Imagine an image of size 1280*760,if the image is down-sampled to 2:1 in both directions,what is the size of the image?If the image is compressed 16:1,what is the size of the image in kilobytes?(assume a gray-scale image of 8-bit)

$\color{blue}Answer:$
down-sampled: $size=\frac{1280\times760\times 8}{2\times2\times8\times1024}=237.5kB$
compressed:
$size=\frac{1280\times760\times 8}{16\times1024\times8}=59.375kB$

7.Table 1A lists the probabilities of occurence of the symbols A to E,find Huffman codes to represent each of these symbols?

symbol	probability
A	0.16
B	0.29
C	0.05
D	0.36
E	0.14

Answer:

$\color{red}备注：哈夫曼编码的编码结果不唯一，但平均码字长度是一样的$
Average codeword length=3*0.16+2*0.29+4*0.05+1*0.36+4*0.14=0.48+0.58+0.2+0.36+0.56=2.18bits/symbol
Compression rate=2.18/3=0.727
$\color{blue}此处除的3，是等长编码所需要的最小位数，五个状态（A,B,C,D,E),因为2^3 >5>2^2，所以取3$

「已注销」

关注

11
点赞
踩
12

收藏

觉得还不错? 一键收藏
8
评论
多媒体期末总结

多媒体总结一.简答题汇总Briefly explain the advantages of logarithmic quantisation when applied to speech signals?Answer: Logarithmic quantisation provides lower quantisation errors for low amplitude value...
复制链接

扫一扫