Emotion, Event detection
Emotion Detection
What triggers emotions?
stimulus event
- external
- natural phenomena
- other people’s behavior
- internal
- Neuroendocrine or Physiological Changes
- memories
Universal emotion categories:
anger, disgust, fear, happiness, sadness and surprise,
also are basic (primary) emotions
can be reduced to 4 categories:
- happiness, sadness, fear/surprise, and anger/disgust
Second order (derived) emotions:
Emotional states that are not so basic, like chagrin, irritation
Sentiment vs emotion:
-
Sentiments can be formed and retained for longer time
-
Emotion lasts for shorter time
-
Sentiments are target centred, hence directed
-
Emotions are not target centred
-
A text can have multiple emotions
Emotion detection
- eg: '7 dead in apartment building fire’
- Anger 10%, disgust 1%, fear 30%, joy 0, sadness 50% surprise 5%
Problems in data collection
- Uncertainty, incompleteness and even mistakes among the ground truth label due to annotators expertise or task’s difficulty
Use hashtag based on data collection
- Direct access to user’s intent
- List the emotion hashtags of 28 affected words or extend the list by WordNet synsets
- Collect tweets that contain one or more hashtags that fall in the defined list of emotions hashtags
- Consider tweets only with hashtags
- Add score based on this
- 直接访问用户的意图
- 列出 28 个受影响词的情感标签或通过 WordNet 同义词集 扩展列表
- 收集包含一个或多个主题标签的推文,这些主题标签属于已定义的情绪标签列表
- 只考虑带有主题标签的推文
- 根据标签算分
emoticons
符号表示心情例如
https://emojipedia.org/twitter/
- List of emoticons and their associations to eight emotions
- Annotate a tweet into that category if the emoticons appear at the end
Sometimes we meet conflicts, (happy+sad) - 表情符号列表及其与八种情绪的关联
- 如果表情符号出现在末尾,则将推文注释到该类别中
有时我们会遇到冲突,(快乐+悲伤),这时候需要比较 emotion word lexicon,hashtags lexicon 和 Emoticon lexicon 的score,哪个高就分给哪个分类
整体流程
Event Detection
Supplementary
-
Tokenization, Lemmazation, Stopword
-
Named Entity Recognition
i)Detect a named entity
ii) Categorize the entity- Person
- Organization
- Time
- Location
-
Part-of-Speech Tagging
-
Word sense disambiguation
-
Textual Entailment
Extract a directional relation between text fragments
提取文本片段之间的方向关系
If you help the needy, God will reward you → \rightarrow →- Giving money to a poor man has good consequences
- Giving money to a poor man has no consequences
- Giving money to a poor man will make you better person
-
Automatic summarization
Extract a readable summary from text (news, articles, documents…)
从文本中提取可读的摘要(新闻、文章、文件……) -
Sentiment Analysis
Exact subjective polarity from documents: positive, negative, or neutral
来自文档的确切主观极性:正面、负面或中性 -
Vector-Space Representation of Documents
TDM or DTM matrix
维基百科
https://zh.wikipedia.org/wiki/%E5%80%92%E6%8E%92%E7%B4%A2%E5%BC%95
“对相同的文字,我们得到后面这些完全反向索引,由文档数量和当前查询的单词结果组成的的成对数据。 同样,文档数量和当前查询的单词结果都从零开始。所以,“banana”: {(2, 3)} 就是说 "banana"在第三个文档里 ({\displaystyle T_{2}}T_{2}),而且在第三个文档的位置是第四个单词(地址为 3)。”