＜DataWhale＞- 语义分割 - RLE编码

最新推荐文章于 2024-02-26 19:15:02 发布

lpliner

最新推荐文章于 2024-02-26 19:15:02 发布

阅读量2.2k

点赞数 6

分类专栏：学习文章标签： cv RLE 语义分割

本文链接：https://blog.csdn.net/wuda19920215/article/details/113865418

版权

这篇博客探讨了RLE编码在语义分割中的应用，包括其运作原理、在mask.csv文件中的含义以及源码解析。作者通过举例说明RLE如何对连续像素进行编码，并分析了在图像数据中使用RLE编码的优势。此外，还讨论了源码中处理图像数据的特定技巧，如防止误删有效数据的方法。

摘要由CSDN通过智能技术生成

Task 01 语义分割-RLE编码

def rle_encode(img: np.ndarray) -> str:
    """
    将传入的图片编码成RLE格式
    :img: 需要编码的图片数据，type -> np.ndarray
          其中像素数据表示为：1 -> mask, 0 -> background
    :return: 
    """
    
    # 将图片数据进行扁平化，转换成一维数组                                                           e.g. array = np.array([[1,2],[3,4]])
    # 其中order = {'C', 'F', 'A', 'K'}                                                                 res = array.flatten(order=<order>)
    # C：means to flatten in row-major (C-style) order.                                                              >>> res = [1,2,3,4]
    # F：means to flatten in column-major (Fortran- style) order.                                                    >>> res = [1,3,2,4]
    # A: means to flatten in column-major order if a is Fortran contiguous in memory, row-major order otherwise. >>> res = [1,2,3,4]
    # K: means to flatten a in the order the elements occur in memory.                                             >>> res = [1,2,3,4]
    # 默认值为：C
    pixels = img.flatten(order='F')
    
    # 数组拼接  numpy.concatenate((a1,a2,...), axis=0)
    # >>> pixels = [0, *pixels, 0]
    # TODO: Q1:这里在数组的前后都追加0，的目的是什么？为什么这么做？@lpliner@2021年2月19日11:26:36
    # 因为下方会通过切片取值然后按位比较数据的差异，以此来记录数据更替的交界点，如果不使用追加数据0，那么在切片过程中可能会导致数据的丢失
    pixels = np.concatenate([[0], pixels, [0]])
    
    # np.where(condition)
    # >>> pixels[1:] = [*pixels, 0]
    # >>> pixels[:-1] = [0, *pixels]
    # 当pixels[1:] != pixels[:-1] 依次按照坐标对比，当符合条件的则记下对应的下标，并返回tuple（array（...），）
    # TODO: Q2:这里对比像素的目的是什么？为什么之后又要+1？@lpliner@2021年2月19日11:27:37
    runs = np.where(pixels[1:] != pixels