ACL2023论文-系列1

YJII

已于 2023-07-19 11:02:09 修改

阅读量970

点赞数 2

分类专栏：论文记录文章标签：人工智能

于 2023-07-19 09:52:35 首次发布

本文链接：https://blog.csdn.net/Hekena/article/details/131801020

版权

论文记录专栏收录该内容

147 篇文章

订阅专栏

文章探讨了如何将生成的常识知识融入到prompt中以辅助推理，以及通过对比学习和角度空间中的三元组视角优化句子表示。研究了知识质量、数量和融合策略对方法效果的影响，并提出了AdditiveAngularMargin对比损失函数以增强文本表示的区分性。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

文章目录

Prompt——1.Generated Knowledge Prompting for Commonsense Reasoning
Contrastive learning——A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular Space
- - 核心

Prompt——1.Generated Knowledge Prompting for Commonsense Reasoning

核心

是把常识知识融入到prompt，用于推理。
生成知识提示，包括从语言模型中生成知识，然后在回答问题时提供知识作为额外输入。

在这里插入图片描述

生成知识提示包括：
(i) 使用少量演示，从语言模型中生成与问题相关的知识陈述；
(ii) 使用第二个语言模型对每个知识陈述进行预测，然后选择置信度最高的预测。

论文贡献

1.调研了外部知识是否对于常识推理有帮助
2.从LLM中产生有用的知识，然后将这些知识融入到带问题的prompt中。

方法效果的影响因素

1.知识的质量
2.知识的数量
3.融入知识的策略（strategy for integrating knowledge during inference）——（1.no knowledge 2. random sentence 3. context sentences 4.template-generated knowledge 5. retrieval-based knowledge ）

方法实现

1.知识生成：利用语言模型基于question生成knowledge statements。
2. 知识融合：生成的知识融入，用于决策推断。
3.在推理时，使用每个generated knowledge statement 做预测，然后，选择highest-confidence 作为最终的prediction。
q表示question，k表示knowledge
在这里插入图片描述
选择置信度最大的作为最终的结果输出。

Contrastive learning——A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular Space

pairwise （成对）
triple-wise （三元组）

核心

用角度代替了infoloss中的distance，要克服PLM学习到的semantic represenation是各项异性的缺点.
训练目标： Additive Angular Margin Contrastive Loss (ArcCon Loss)。它通过最大化角度空间中的判定余量来增强成对判别能力。
positive pairs: 同一个sentence做的不同rate的dropout.
negetive pairs: the representations of different sentences within the same batch.