Introduction for Structured Text Summarization

    Given the dramatic growth of digital content, new solutions are needed for us to be able to get a quick overview of pertinent information without being inundated by irrelevant details. While there has been ample research on automatic summarization methods, summaries may still be somewhat convoluted and hard to absorb. In our system, we propose the novel task of structured text summarization, which we address by combining ranking techniques with open information extraction. This method yields an uncluttered, more easily digestible overview of key insights from a text.

     In light of the staggering growth of digital content now vying for our attention at any given point in time, reading every long article that we come across in full detail is no longer practical. For instance, having assumed a leadership role at a multinational company, a person will likely lack the time for an in-depth reading of a long, detailed article recounting, say, the political deliberations surrounding the possible introduction of new labor laws in Thailand. Instead, such a person may just wish to receive a very concise overview of the proposals being discussed. While executives may be able to rely on consultants or staff to provide brief executive summaries, it would be helpful to have innovative new technological solutions to this problem. These could enable us to more quickly get an overview of the key information in a long article without being inundated by irrelevant details.

    Past research along these lines has focused on the task of automatic summarization, which compresses a given text to distill a shorter version. While this may go a long way, the resulting summaries may still be poorly organized and convoluted.
    In our system , we propose the novel task of structured text summarization,which seeks to produce structured lists of textual items that are less cluttered and constitute more easily digestible overviews of key insights from a text. We address this task by combining salience-based ranking techniques to identify important content with methods based on open domain information extraction (Open IE) to convert sentences to a structured form, from which the main thoughts are more easily discernible.


Consider the following input sentences:
      Harvard University is a private Ivy League research university in Cambridge, Massachusetts. Harvard is the United States’ oldest institution of higher learning. 


Our system combines and converts these two sentences into a structured form as follows:


评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值