NLP十大Baseline论文简述(十) - sgm

最新推荐文章于 2024-06-29 17:43:55 发布

HHVic

最新推荐文章于 2024-06-29 17:43:55 发布

阅读量1k

点赞数

分类专栏： NLP Paper 文章标签：自然语言处理人工智能深度学习

本文链接：https://blog.csdn.net/landian0531/article/details/121098479

版权

多标签分类序列生成模型注意力机制标签相关性文本重要性

关键词由CSDN通过智能技术生成

NLP Paper 专栏收录该内容

10 篇文章 8 订阅

订阅专栏

文章目录

前言：
目录
1. Paper：
2. 背景介绍
3. 论文摘要
4. 研究意义
5. 论文总结

前言：

如果需要对基础概念不了解，可以参考这里。我汇总了论文中涉及的大部分概念，以便更好的理解论文。

1. Paper：

SGM: Sequence Generation Model for Multi-Label Classification
使用序列生成模型做多标签文本分类

2. 背景介绍

多标签文本分类是自然语言处理的重要任务，多标签文本分类可以用到文本分类，推荐以及信息检索中。
但是目前的多标签文本分类模型存在两个问题：没有注意到标签之间的相关性以及不同文本对于不同标签分类的重要性不同。
为了解决这两个问题，我们使用Seq2Seq模型学习标签之间的相关性，使用注意力机制学习不同文本的重要性。
实验证明，我们的模型能够在两个多标签分类数据集上大幅度领先基准模型，并且实验结果表明我们的模型可以学习到标签之间的相关性以及文本对于不同标签的重要性

3. 论文摘要

Multi-label classification is an important yet challenging task in natural language processing. 多标签分类是自然语言处理中的一项重要而富有挑战性的任务。

Itis more complex than single-label classification in that the labels tend to be correlated. 它比单标签分类更复杂，因为标签往往是相关的。

Existingmethods tend to ignore the correlations between labels. 现有的方法往往忽略标签之间的相关性。

Besides, different parts of the text cancontribute differently to predicting different labels, which is not considered by existing models. 此外，文本的不同部分对不同标签的预测作用也不同，这是现有模型没有考虑到的。

In this paper, we propose to view the multi-label classification task as a sequence generationproblem, and apply a sequence generation model with a novel decoder structure to solve it. 在本文中，我们提出将多标签分类任务看作一个序列生成问题，并应用一个具有新的解码器结构的序列生成模型来解决它。

Extensive experimental results show that our proposed methods outperform previous work bya substantial margin. 大量的实验结果表明，我们提出的方法在很大程度上优于以前的工作。

Further analysis of experimental results demonstrates that the proposedmethods not only capture the correlations between labels, but also select the most informativewords automatically when predicting different labels. 对实验结果的进一步分析表明，所提出的方法不仅能捕获标签之间的相关性，而且能在预测不同标签时自动选择最有信息量的词。