【Part one: Introduction】 Relation Extraction with Distant Supervision(DS)

最新推荐文章于 2022-11-13 13:31:00 发布

yywang_hit

最新推荐文章于 2022-11-13 13:31:00 发布

阅读量458

点赞数

文章标签： Distant Supervision;NLP;Relati

本文链接：https://blog.csdn.net/yywang_hit/article/details/79646730

版权

1. Relation extraction：

Extracting relation facts from sentence.

Sentence	Relation
1. Steve Jobs and Wozniak co-founded Apple in 1976.	Founder
2. Washington D.C. is the capital of United states.	CapitalOf

2. Previous method：

Training a relational extractor with manually labeled supervised dataset.

3. Problem:

1) The human annotation is costly.

2) Limited by the number of relation and data size.

4. Distant supervision in RE:

Mintz et al. applying the DS method to RE task for the first time. The DS method

extracts training instances by aligning KB with text.Two steps:

1) Find the target relation and its associated entity pair in KB.

2) Extract the sentences containing this entity pair in the text.

5. Challenge for DS:

1) Finding the fit KB for open domain relation extraction.

3) Error propagation caused by feature engineering using NLP tools.

2) Wrong label problem. (Following).

6. Wrong label:

Extract sentences in text based on the assumption: If two entities have a relationship in a known knowledge base, then all sentences that mention these two entities will express that relationship in some way. So the training data are labeled automatically as follows: for a triplet r(e₁,e₂)¹in the KB, all sentences that mention both entities e₁ and e₂ are regarded as the training instances of relation r.

But a sentence that mentions two entities may not express the relation which links them in a KB. It is possible that the two entities may just appear in the same

sentence because they are related to the same topic.

When the entity pair does not have any relationship , it is defined as NA .