图像解码之二——使用libpng解码png图片

weixin_55025383

已于 2023-09-23 16:25:52 修改

阅读量509

点赞数

文章标签： mfc c++

于 2023-09-23 16:25:00 首次发布

上文《图像解码之一——使用libjpeg解码jpeg图片》介绍了使用libjpeg解码jpeg图片。png图片应用也非常广泛，本文将会简单介绍怎样使用开源libpng库解码png图片。

libpng的数据结构

png_structp变量是在libpng初始化的时候创建，由libpng库内部使用，代表libpng的是调用上下文，库的使用者不应该对这个变量进行访问。调用libpng的API的时候，需要把这个参数作为第一个参数传入。

png_infop变量，初始化完成libpng之后，可以从libpng中获得该类型变量指针。这个变量保存了png图片数据的信息，库的使用者可以修改和查阅该变量，比如：查阅图片信息，修改图片解码参数。在早期的版本中直接访问该变量的成员，最新的版本建议是通过API来访问这些成员。

libpng的使用

0、判断是否为libpng数据

这步是可选的，在利用libpng继续数据处理之前，可以调用png_sig_cmp函数来检查是否为png数据，请参阅libpng手册了解详细内容。

1、初始化libpng


   
   
     
     
      
      
     
     
     
     
      
        
       
       1: 
       
       /
       
       * Create 
       
       and 
       
       initialize the png_struct 
       
       with the desired 
       
       error handler
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:  
       
       * functions.  
       
       If you want 
       
       to 
       
       use the 
       
       default stderr 
       
       and longjump 
       
       method,
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3:  
       
       * you can supply 
       
       NULL 
       
       for the 
       
       last three parameters.  We 
       
       also supply the
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4:  
       
       * the compiler header 
       
       file version, so that we know 
       
       if the application
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       5:  
       
       * was compiled 
       
       with a compatible version 
       
       of the library.  REQUIRED
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       6:  
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       7: png_ptr 
       
       = png_create_
       
       read_struct(PNG_LIBPNG_VER_
       
       STRING,
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       8:    png_voidp user_
       
       error_ptr, user_
       
       error_fn, user_warning_fn);

初始化libpng的时候，用户可以指定自定义错误处理函数，如果不需要指定自定义错误处理函数，则传NULL即可。 png_create_read_struct函数返回一个png_structp变量，前面已经提到该变量不应该被用户访问，应该在以后调用 libpng的函数时传递给libpng库。

如果你需要提供自定义内存管理模块则需要调用png_create_read_struct_2来完成对libpng的初始化：


   
   
     
     
      
      
     
     
     
     
      
       
       
       1: png_structp png_ptr 
       
       = png_create_
       
       read_struct_
       
       2
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:    (PNG_LIBPNG_VER_
       
       STRING, (png_voidp)user_
       
       error_ptr,
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3:     user_
       
       error_fn, user_warning_fn, (png_voidp)
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4:     user_mem_ptr, user_malloc_fn, user_
       
       free_fn)

2、创建图像信息——png_infop变量


   
   
     
     
      
      
     
     
     
     
      
      
       
       1: 
       
       /
       
       * 
       
       Allocate
       
       /
       
       initialize the memory 
       
       for image information.  REQUIRED. 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2: info_ptr 
       
       = png_create_info_struct(png_ptr);
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3: 
       
       if (info_ptr 
       
       =
       
       = 
       
       NULL)
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4: {
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       5:    png_destroy_
       
       read_struct(
       
       &png_ptr, png_infopp_
       
       NULL, png_infopp_
       
       NULL);
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       6:    
       
       return (
       
       ERROR);
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       7: }

如前面所说，用户将会通过png_infop变量来获得图片的信息，设置图片解码参数等。

3、设置错误返回点

上文libjpeg解码jpeg图片中提到用setjmp/longjmp函数来处理异常。libpng库默认集成这种机制来完成异常处理，如下代码初始化错误返回点：


   
   
     
     
      
      
     
     
     
     
      
       
       
       1: 
       
       /
       
       * 
       
       Set 
       
       error handling 
       
       if you 
       
       are 
       
       using the setjmp
       
       /longjmp 
       
       method (this 
       
       is
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:  
       
       * the normal 
       
       method 
       
       of doing things 
       
       with libpng).  REQUIRED unless you
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3:  
       
       * 
       
       set 
       
       up your own 
       
       error handlers 
       
       in the png_create_
       
       read_struct() earlier.
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4:  
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       5: 
       
       if (setjmp(png_jmpbuf(png_ptr)))
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       6: {
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       7:    
       
       /
       
       * 
       
       Free 
       
       all 
       
       of the memory associated 
       
       with the png_ptr 
       
       and info_ptr 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       8:    png_destroy_
       
       read_struct(
       
       &png_ptr, 
       
       &info_ptr, png_infopp_
       
       NULL);
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       9:    
       
       /
       
       * 
       
       If we 
       
       get here, we had a problem reading the 
       
       file 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       10:    
       
       return (
       
       ERROR);
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       11: }

正如上面注释中提到情况，只有在初始化libpng的时候未指定用户自定义的错误处理函数情况下，才需要设置错误返回点。如果设置了用户自定义的错误处理函数，libpng将会调用用户自定义错误处理函数，而不会返回到这个调用点。

当libpng库出现错误的时候，libpng将会自动调用longjmp函数返回到这个点。在这个点我们可以进行必要的清理工作。

4、设置libpng的数据源

我在上文《图像解码之一——使用libjpeg解码jpeg图片》中提到，一个好的代码库应该能够运行用户输入各式各样的数据，而不能把输入数据定死。libpng在这方面做得非常的好，它提供了默认的文件输入流的支持，并且提供了用户自定义回调函数来完成png数据的输入。

对于文件流数据数据设置代码如下：


   
   
     
     
      
      
     
     
     
     
      
       
       
       1: 
       
       /
       
       * One 
       
       of the following I
       
       /O initialization methods 
       
       is REQUIRED 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2: def streams 
       
       /
       
       * PNG 
       
       file I
       
       /O 
       
       method 
       
       1 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3: 
       
       /
       
       * 
       
       Set 
       
       up the 
       
       input 
       
       control 
       
       if you 
       
       are 
       
       using 
       
       standard C streams 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4: png_init_io(png_ptr, fp);

用户自定义回调函数设置libpng数据源的代码如下：


   
   
     
     
      
      
     
     
     
     
      
       
       
       1: 
       
       /
       
       * 
       
       If you 
       
       are 
       
       using replacement 
       
       read functions, instead 
       
       of calling
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:  
       
       * png_init_io() here you would 
       
       call:
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3:  
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4: png_
       
       set_
       
       read_fn(png_ptr, (void 
       
       *)user_io_ptr, user_
       
       read_fn);
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       5: 
       
       /
       
       * where user_io_ptr 
       
       is a structure you want available 
       
       to the callbacks 
       
       *
       
       /

如果你已经使用png_sig_cmp函数来检查了png数据，需要调用png_set_sig_bytes函数来告诉libpng库，这样库处理数据的时候将会跳过相应的数据，具体请参考libpng手册。

5、png图像处理

这步有两种设置方案一种称为高层处理，一种称为底层处理。

高层处理

当用户的内存足够大，可以一次性读入所有的png数据，并且输出数据格式为如下libpng预定义数据类型时，可以用高层函数，下libpng预定义数据类型为：


   
   
     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_IDENTITY          
       
       No transformation
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_STRIP_
       
       16          Strip 
       
       16-
       
       bit samples 
       
       to
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
                                                    
       
       8 bits
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_STRIP_ALPHA     Discard the alpha channel
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_PACKING           Expand 
       
       1, 
       
       2 
       
       and 
       
       4-
       
       bit
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
                                                     samples 
       
       to bytes
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_PACKSWAP        Change 
       
       order 
       
       of packed
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
                                                     pixels 
       
       to LSB 
       
       first
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_EXPAND            
       
       Perform 
       
       set_expand()
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_INVERT_MONO   Invert monochrome images
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_SHIFT              Normalize pixels 
       
       to the
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
                                                     sBIT depth
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_BGR                 Flip RGB 
       
       to BGR, RGBA
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
                                                    
       
       to BGRA
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_SWAP_ALPHA     Flip RGBA 
       
       to ARGB 
       
       or GA
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
                                                    
       
       to AG
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_INVERT_ALPHA  Change alpha 
       
       from opacity
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
                                                    
       
       to transparency
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_SWAP_ENDIAN   Byte-swap 
       
       16-
       
       bit samples
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
       PNG_TRANSFORM_GRAY_
       
       TO_RGB   Expand grayscale samples
      
      
     
     

     
     
      
      
     
     
     
     
      
       
      
      
     
     

     
     
      
      
     
     
     
     
      
                                                    
       
       to RGB (
       
       or GA 
       
       to RGBA)

高层读取函数如下：


   
   
     
     
      
      
     
     
     
     
      
       
       
       1: 
       
       /
       
       *
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:  
       
       * 
       
       If you have enough memory 
       
       to 
       
       read 
       
       in the entire image 
       
       at once,
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3:  
       
       * 
       
       and you need 
       
       to specify only transforms that can be controlled
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4:  
       
       * 
       
       with one 
       
       of the PNG_TRANSFORM_
       
       * bits (this presently excludes
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       5:  
       
       * dithering, filling, setting background, 
       
       and doing gamma
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       6:  
       
       * adjustment), 
       
       then you can 
       
       read the entire image (including
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       7:  
       
       * pixels) 
       
       into the info structure 
       
       with this 
       
       call:
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       8:  
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       9: png_
       
       read_png(png_ptr, info_ptr, png_transforms, png_voidp_
       
       NULL);

该函数将会把所有的图片数据解码到info_ptr数据结构中。png_transforms为整型参数，为上面libpng预定义的数据类型进行or操作得到。调用了该函数，就不可以再调用png_set_transform函数来设置输出数据。

该函数相当于调用底层函数（下文将会介绍）如下调用顺序：

a)调用png_read_info函数获得图片信息。

b)根据png_transforms所指示的，调用png_set_transform设置输出格式转换的函数。

c)调用png_read_image来解码整个图片的数据到内存。

d)调用png_read_end结束图片解码。

当你调用png_read_png之后，则可以调用如下函数得到png数据：

  1: row_pointers = png_get_rows(png_ptr, info_ptr);

底层处理

a)读取输入png数据的图片信息：


   
   
     
     
      
      
     
     
     
     
      
       
       
       1: 
       
       /
       
       * The 
       
       call 
       
       to png_
       
       read_info() gives us 
       
       all 
       
       of the information 
       
       from the
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:  
       
       * PNG 
       
       file 
       
       before the 
       
       first IDAT (image 
       
       data chunk).  REQUIRED
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       3:  
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       4: png_
       
       read_info(png_ptr, info_ptr);

该函数将会把输入png数据的信息读入到info_ptr数据结构中。

b)查询图像信息

前面提到png_read_info将会把输入png数据的信息读入到info_ptr数据结构中，接下来需要调用API查询该信息。


   
   
     
     
      
      
     
     
     
     
      
        
       
       1: png_
       
       get_IHDR(png_ptr, info_ptr, 
       
       &width, 
       
       &height, 
       
       &
       
       bit_depth, 
       
       &color_
       
       type,
      
      
     
     

     
     
      
      
     
     
     
     
      
         
       
       2:     
       
       &interlace_
       
       type, int_p_
       
       NULL, int_p_
       
       NULL);

c)设置png输出参数（转换参数）

这步非常重要，用户可以指定输出数据的格式，比如RGB888，ARGB8888等等输出数据格式。通过png_set_xxxxx函数来实现，例如如下代码：


   
   
     
     
      
      
     
     
     
     
      
        
       
       /
       
       / expand images 
       
       of 
       
       all color-type 
       
       and bit-depth 
       
       to 
       
       3x
       
       8 
       
       bit RGB images
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       /
       
       / let the library process things like alpha, transparency, background
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       if (
       
       bit_depth 
       
       =
       
       = 
       
       16)
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
             png_
       
       set_strip_
       
       16(png_ptr);
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       if (color_
       
       type 
       
       =
       
       = PNG_COLOR_
       
       TYPE_PALETTE)
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
             png_
       
       set_expand(png_ptr);
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       if (
       
       bit_depth
       
       <
       
       8)
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
             png_
       
       set_expand(png_ptr);
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       if (png_
       
       get_
       
       valid(png_ptr, info_ptr, PNG_INFO_tRNS))
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
             png_
       
       set_expand(png_ptr);
      
      
     
     

     
     
      
      
     
     
     
     
      
        
       
       if (color_
       
       type 
       
       =
       
       = PNG_COLOR_
       
       TYPE_GRAY ||
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
             color_
       
       type 
       
       =
       
       = PNG_COLOR_
       
       TYPE_GRAY_ALPHA)
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
             png_
       
       set_gray_
       
       to_rgb(png_ptr);

如上代码将会把图像转换成RGB888的数据格式。这种转换函数还很多，请参阅libpng手册了解他们的作用。

虽然有很多设置输出参数的函数可以调用，但是用户的需求是无限的，很多输出格式libpng并不是原生支持的，比如YUV565，RGB565，YUYV 等等。幸好libpng提供了自定义转换函数的功能，可以让用户注册转换回调函数给libpng库，在libpng对输出数据进行转换的时候，先对 png_set_xxxxx函数设置的参数进行转换，最后将会调用用户自定义的转换函数进行转换。

png_set_read_user_transform_fn(png_ptr,  read_transform_fn);

read_transform_fn为用户自定义的数据转换函数。具体实现可以参考pngtest.c中的实现。

另外你可以通过png_set_user_transform_info告诉libpng你的转换函数的用户自定义数据结构和输出数据的详细信息，比如颜色深度，颜色通道（channel）等等。你可能会问为什么要告诉libpng呢？libpng将会根据这些信息来更新png图片详细信息，后面会介绍。定义如下：

png_set_user_transform_info(png_ptr, user_ptr,user_depth, user_channels);

usr_ptr是用户自定义的数据结构，在用户自定义转换函数read_transform_fn中可以通过png_get_user_transform_ptr函数得到该数据结构，例如：

 voidp read_user_transform_ptr =png_get_user_transform_ptr(png_ptr);

d)更新png数据的详细信息

经过前面的设置png数据的图片信息肯定会有一些变化，则需要调用png_read_update_info函数更新图片的详细信息：

  png_read_update_info(png_ptr, info_ptr);

  该函数将会更新保存于info_ptr变量中的图片数据信息，然后可以再调用png_get_IHDR重新查询图片信息。

e)读取png数据

可以到用png_read_image函数，一次性把所有的数据读入内存，例如：

 png_read_image(png_ptr, row_pointers);

也可以调用png_read_rows一次读入1行或多行到内存中，比如：


   
   
     
     
      
      
     
     
     
     
      
      
       
       for (y 
       
       = 
       
       0; y 
       
       < height; y
       
       +
       
       +)
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
          {
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
              png_
       
       read_rows(png_ptr, 
       
       &row_pointers[y], png_bytepp_
       
       NULL, 
       
       1);
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
           }

f)结束读取数据

通过png_read_end结束读取png数据，代码如下：


   
   
     
     
      
      
     
     
     
     
      
       
       
       /
       
       * 
       
       Read rest 
       
       of 
       
       file, 
       
       and 
       
       get additional chunks 
       
       in info_ptr
       
        - REQUIRED 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
           png_
       
       read_
       
       end(png_ptr, info_ptr);

6、释放libpng的内存

调用png_destroy_read_struct来释放libpng的内存，代码如下：


   
   
     
     
      
      
     
     
     
     
      
         
       
       /
       
       * Clean 
       
       up 
       
       after the 
       
       read, 
       
       and 
       
       free 
       
       any memory allocated
       
        - REQUIRED 
       
       *
       
       /
      
      
     
     

     
     
      
      
     
     
     
     
      
      
       
           png_destroy_
       
       read_struct(
       
       &png_ptr, 
       
       &info_ptr, png_infopp_
       
       NULL);