论文阅读笔记 | 《Constraint-Based Question Answering with Knowledge Graph》

最新推荐文章于 2022-03-20 17:16:55 发布

ttliu_kiwi

最新推荐文章于 2022-03-20 17:16:55 发布

阅读量2.4k

点赞数 1

分类专栏：问答系统学习总结

本文链接：https://blog.csdn.net/ting0922/article/details/84782471

版权

学习总结同时被 2 个专栏收录

40 篇文章 0 订阅

订阅专栏

问答系统

0 篇文章 0 订阅

订阅专栏

1. 主要内容

在KBQA（基于知识库的问答）的基础上，由于其他问答系统都是回答简单问题，本文提出一种方法可以回答复杂问题（多个限制下的问题），并且提供一个数据集，名为ComplexQuestion，以此评测回答复杂问题的KBQA系统性能。

2个被用作benchmark的数据集：WebQuestions，SimpleQuestions.

多限制问题可以分为六种：多实体限制；答案类型限制；明确的时间限制；不明确的时间限制；序数限制；聚集限制

主要技术：基础查询图（basic query graph）+限制序列（constraint sequence）

2. 重要引用和影响

改进了2015年Yih等人的stage graph，使得实体限制可以加在多个KB fact上，并且考虑的约束要比之前多（stage graph对于非实体约束的处理不恰当）

Pengcheng Yin, Nan Duan, Ben Kao, Junwei Bao, and Ming Zhou. 2015. Answering questions with complex semantic constraints on open knowledge bases. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pages 1301–1310. ACM.

2.1 生成基础查询图

通过实体链接方法检测问题中提到的实体，并且以这个实体为顶点，在KB中找到一跳或两跳的路径作为基础查询图（即查询子图）。

Yi Yang and Ming-Wei Chang. 2015. S-mart: Novel tree-based structured learning algorithms applied to tweet entity linking.

同时，基于CNN模型计算question和基础查询图中的路径（the path of the basic query graph）之间的相似性，以此来评估构建的基础查询图的质量。

Jianfeng Gao, Li Deng, Michael Gamon, Xiaodong He, and Patrick Pantel. 2015. Modeling interestingness with
deep neural networks, December 17. US Patent 20,150,363,688.

2.2 约束检测与绑定

包括两个阶段：约束检测（Constraint Detection）和约束绑定（Constraint Binding）。

接下来讲了怎样将六种限制加到基础的查询图上。

1) 识别问题的类型（type），用到Yao和Durme提出的简单高效的方法

Xuchen Yao and Benjamin Van Durme. 2014. Information extraction over structured data: Question answering
with freebase. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics,
ACL 2014, June 22-27, 2014, Baltimore, MD, USA, Volume 1: Long Papers, pages 956–966.

2) 使用Stanford NER检测时间短语（只能检测明确的时间短语）

Jenny Rose Finkel, Trond Grenager, and Christopher Manning. 2005. Incorporating non-local information into
information extraction systems by gibbs sampling. In Proceedings of the 43rd Annual Meeting on Association
for Computational Linguistics, pages 363–370. Association for Computational Linguistics.

对于表述不明确的时间短语（如：after the Civil War stared），只能利用预定义的关键字进行匹配（when, before, after, during, etc）

3) 使用WordNet提供的手工收集的序号列表和最高级词汇来检测序号和函数操作。