Table Transformer 模型教程

最新推荐文章于 2024-08-09 08:07:56 发布

赖蓉旖Marlon

最新推荐文章于 2024-08-09 08:07:56 发布

阅读量640

点赞数 12

本文链接：https://blog.csdn.net/gitblog_01030/article/details/141045915

版权

Table Transformer 模型教程

table-transformerTable Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.项目地址:https://gitcode.com/gh_mirrors/ta/table-transformer

1. 项目目录结构及介绍

在 table-transformer 开源项目中，主要的目录结构如下：

table-transformer
├── data                 # 存放数据集的目录
│   └── pubtables_1m      # PubTables-1M 数据集
├── models                # 存放模型定义的目录
│   ├── table-transformer       # Table Transformer 模型代码
│   └── table-transformer-detection    # 表格检测模型
├── scripts               # 存放脚本的目录，用于训练、评估等操作
│   ├── train.py          # 训练脚本
│   ├── eval.py           # 评估脚本
│   └── inference.py      # 推理脚本
└── config.py             # 配置文件

data: 包含训练和测试所需的数据集。
models: 存放模型的代码实现，其中 table-transformer 是基础模型，table-transformer-detection 是专门进行表格检测的版本。
scripts: 提供了训练模型、评估模型性能以及运行推理的实用脚本。
config.py: 全局配置文件，用来设置模型参数、数据路径等。