文档图像二值化论文 document images Binarization paper 2

yddcs

已于 2024-03-25 16:07:54 修改

阅读量1.1k

点赞数 1

分类专栏： # 文档图像二值化文章标签：机器学习人工智能深度学习计算机视觉 ieee论文

于 2021-04-21 10:55:43 首次发布

本文链接：https://blog.csdn.net/qq_35200351/article/details/115917806

版权

文档图像二值化专栏收录该内容

4 篇文章 3 订阅

订阅专栏

文档图像二值化DIB paper

在文档分析方面，降级文档图像的二值化是一个具有挑战性的问题。二值化是一种分类过程，其中将图像像素分配给以下两个类中的任一个：前景文本和背景。大多数算法以无监督的方式构建在低级特征上，因此随后的输入域知识的充分利用的禁止显着限制了从前景区分背景噪声。

2018 Binarization of degraded document images based on hierarchical deep supervised network
(PR code 放在Google云盘了)

【文档图像二值化数据集 databases】
【文档图像二值化 paper 系列 -1-】

找paper搭配 Sci-Hub 食用更佳 (๑•̀ㅂ•́)و✧
Sci-Hub 实时更新 : https://tool.yovisun.com/scihub/
公益科研通文献求助：https://www.ablesci.com/

Highlights：
•We propose a supervised binarization method based on the deep supervised networks.

•The multi-scale deep supervised network for binarization has not been reported yet.

•A hierarchical architecture is designed to distinguish text from background noises.

•Different feature levels are dealt by the multi-scale architecture.

•The performance results are considerably better than state-of-the-art methods.

提出一种基于多尺度深度监督网络的监督二值化方法。模型直接从图像区域训练，使用像素值作为输入，以二进制地面真值作为标签。通过提取高层特征，网络可以区分文本像素和背景噪声，从而可以处理文档图像中出现的严重退化。与传统算法相比，该方法生成的二值图像具有更清晰的背景和更好的笔划保留，在广泛使用的DIBCO数据集上取得了最新的结果。

DSN架构的设计基于鲁棒二值化方法的两个主要标准。第一个标准是从前景中区分嘈杂背景的能力，旨在产生干净的背景区域。第二个标准是保留前景的高视觉质量和细节的能力。

由于输入图像的详细信息（例如边缘和边界）通常在更高的特征级别丢失，因此较低级别的特征可用于保留前景的高细节。因此，不同特征级别的集成可能导致更好的性能。文档图像二值化开发的DSN模型包括用于从文档图像中学习不同级别的文本特征的层级结构，从而文本和背景被分类。
在这里插入图片描述
在训练期间，从训练数据集中的图像修补程序和地面实际二进制映射被输入到三个DSN中。在测试期间，获取来自文档图像的本地图像修补作为输入。然后通过选择每个像素位置处的最小前景地图值，将从三个DSN生成的每个图像修补程序的预测前景图集成到一个位置。最后，通过完整前景地图的阈值处理获得二进制文档图像。

2019 DeepOtsu: Document enhancement and binarization using iterative deep learning （PR）

文章思路和这篇深度学习去噪论文比较像。

Highlights
•We propose a novel iteration deep learning which can improve the input image iteratively.

•We apply the proposed iterative deep learning for document enhancement and binarization in two possible ways: recurrent refinement and stacked refinement.

•Our proposed method provides a new, clean version of the degraded image, one that is suitable for visualization and which shows promising results for binarization using Otsu’s global threshold.
在这里插入图片描述
提出的降解模型的示意描述。假设降级图像中的劣化图案X是理想（均匀）模式Xu和降级e的总和

红色虚线表示神经网络的输出可以用作具有不同迭代的迭代微调的输入。

所提出的模型直接从原始图像中学习统一的图像Xu。但是，学习的X可能并不完美;如果网络不提供良好的结果，它也可以被视为劣化的图像X。因此，如果为神经网络设置X = Xu，则可以递归地改进或增强学习Xu.
在这里插入图片描述
2018 Robust regression for image binarization under heavy noise and nonuniform background (PR)

属📕传统方法 Highlights
•This paper advances the background subtraction approach for image binarization.

•Our approach formulates a robust regression to estimate an image background.

•The proposed approach does not require any prior identification of edge pixels.

•The propose threshold selector binarizes noisy images better after background subtraction.

2020 Historical Document Image Binarization: A Review doi:10.1007/s42979-020-00176-1

文章总结的二值化方法有阈值法、文字边缘估计、图像形变、混合模型、CRF、博弈论、ML、DL、tuning等。

另一个强大的趋势是在先前成功的算法上执行自动参数调整。许多算法具有足够丰富的参数空间，使用正确设置的参数（全局或local）可以为几乎所有图像产生良好的二值化。
在这里插入图片描述

2018 博弈论 GiB: a Game theory Inspired Binarization technique for degraded document images TIP

深度学习方法系列讲解可见 paper1 及 database

2015 Document Image Binarization using LSTM: A Sequence Learning Approach

2017 Using Convolutional Encoder-Decoder for Document Image Binarization
在这里插入图片描述
2018 PDNet: Semantic Segmentation integrated with a Primal-Dual Network for Document binarization code lua

2018 PR A selectional auto-encoder approach for document image binarization code: https://github.com/ajgallego/document-image-binarization

2019 Generating Realistic Binarization Data with Generative Adversarial Networks code pix2pix
在这里插入图片描述
2019 Cascading Modular U-Nets for Document Image Binarization

可以看PR的2021 Complex image processing with less data—Document image binarization by integrating multiple pre-trained U-Net (PR)类似。好吧是同一波人￣□￣｜｜

2019 Document Binarization via Multi-Resolutional Attention Model with DRD Loss
在这里插入图片描述
2019 Cluster-based Sample Selection for Document Image Binarization

code：https://github.com/masyagin1998/robin

The implementation, as well as the raw data are available at https://github.com/krntz/Cluster-based-Sample-Selection

2019 ICDAR Learning 2D Morphological Network for Old Document Image Binarization code

在这里插入图片描述

2020 UDBNET: Unsupervised Document Binarization Network via Adversarial Game code

在这里插入图片描述
2019 Improving Document Binarization via Adversarial Noise-texture Augmentation code

和上一篇不是一样的吗？
在这里插入图片描述

在这里插入图片描述

yddcs

关注

1
点赞
踩
5

收藏

觉得还不错? 一键收藏
打赏
2
评论
文档图像二值化论文 document images Binarization paper 2

与传统算法相比，该方法生成的二值图像具有更清晰的背景和更好的笔划保留，在广泛使用的DIBCO数据集上取得了最新的结果。由于输入图像的详细信息（例如边缘和边界）通常在更高的特征级别丢失，因此较低级别的特征可用于保留前景的高细节。文档图像二值化开发的DSN模型包括用于从文档图像中学习不同级别的文本特征的层级结构，从而文本和背景被分类。在文档分析方面，降级文档图像的二值化是一个具有挑战性的问题。大多数算法以无监督的方式构建在低级特征上，因此随后的输入域知识的充分利用的禁止显着限制了从前景区分背景噪声。二值化论文
复制链接

扫一扫