Two minutes NLP — Quick intro to Text Style Transfer

最新推荐文章于 2024-07-10 13:49:21 发布

asd8705

最新推荐文章于 2024-07-10 13:49:21 发布

阅读量123

点赞数 1

分类专栏：自然语言处理文章标签：自然语言处理

原文链接：https://medium.com/nlplanet/two-minutes-nlp-quick-intro-to-text-style-transfer-61de9cbd4083

版权

自然语言处理专栏收录该内容

19 篇文章 0 订阅

订阅专栏

Parallel and Non-parallel data, Disentanglement, and Prototype Editing

Text Style Transfer (TST) is an important task in natural language generation, which aims to control certain attributes in the generated text, such as politeness, emotion, humor, and many others while preserving the content. It has a long history in the field of Natural Language Processing and recently has re-gained significant attention thanks to the rise of deep neural models.

I suggest reading the paper Deep Learning for Text Style Transfer: A Survey for a more in-depth explanation.

Applications

Intelligent Bots for which users prefer distinct and consistent persona (e.g., empathetic and informal) instead of emotionless or inconsistent persona.
Development of intelligent writing assistants that help to polish writings to better fit their purpose, e.g., more professional, polite, objective, humorous, or other advanced writing requirements.
Automatic text simplification, i.e. from complex to simple text.
Debiasing online text, i.e. from biased to objective text.
Fighting against offensive language, i.e. from offensive to non-offensive text

Parallel corpora vs non-parallel corpora

Text Style Transfer algorithms can be developed in a supervised way with parallel corpora, i.e. text that comes twice with the same content but with different styles, and in an unsupervised way with non-parallel corpora.

Here is a list of common TST subtasks with corresponding datasets.

List of common subtasks of TST with corresponding datasets. Image from the paper “Deep Learning for Text Style Transfer: A Survey”.

The size of the datasets corresponds to the number of sentences contained. The last column indicates whether the dataset is parallel or non-parallel.

Methods on parallel data

Most supervised methods adopt the standard neural sequence-to-sequence (seq2seq) model with the encoder-decoder architecture, the same commonly used for neural machine translation and text generation tasks such as summarization. The encoder-decoder seq2seq model can be implemented by either LSTM or Transformer.

Methods on non-parallel data

There are mainly three types of unsupervised approaches for non-parallel data:

Disentanglement: disentangle text into its content and attribute in the embeddings latent space, and then apply generative modeling.
Prototype Editing: works by deleting only the parts of the sentences with the wrong attributes (e.g. formal in place of informal) and replacing them with words with the correct attributes, making sure that the resulting text is still fluent.
Pseudo-parallel Corpus Construction: used to train the model as if in a supervised way with pseudo-parallel data. One way to construct pseudo-parallel data is through retrieval, namely extracting aligned sentence pairs from two mono-style corpora.

How to evaluate

A successful style-transferred output not only needs to demonstrate the correct target style, but also, due to the uncontrollability of neural networks, we need to verify that it preserves the original semantics, and maintains natural language fluency.

Therefore, the commonly used practice of evaluation considers the following three criteria:

Transferred style strength.
Semantic preservation, i.e. maintains the same meaning.
Fluency.

As it’s not always easy to compute automatically these metrics, both automatic evaluation and human evaluation are commonly employed.

Thank you for reading! If you are interested in learning more about NLP, remember to follow NLPlanet on Medium, LinkedIn, and Twitter!

asd8705

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Two minutes NLP — Quick intro to Text Style Transfer

Parallel and Non-parallel data, Disentanglement, and Prototype EditingText Style Transfer (TST) is an important task in natural language generation, which aims to control certain attributes in the generated text, such as politeness, emotion, humor, and m
复制链接

扫一扫