Web Science笔记 Emotion, Event detection

Emotion Detection

What triggers emotions?
stimulus event

  • external
    • natural phenomena
    • other people’s behavior
  • internal
    • Neuroendocrine or Physiological Changes
    • memories

Universal emotion categories:
anger, disgust, fear, happiness, sadness and surprise,
also are basic (primary) emotions
can be reduced to 4 categories:

  • happiness, sadness, fear/surprise, and anger/disgust

Second order (derived) emotions:

Emotional states that are not so basic, like chagrin, irritation

Sentiment vs emotion:

  • Sentiments can be formed and retained for longer time

  • Emotion lasts for shorter time

  • Sentiments are target centred, hence directed

  • Emotions are not target centred

  • A text can have multiple emotions

Emotion detection

  • eg: '7 dead in apartment building fire’
  • Anger 10%, disgust 1%, fear 30%, joy 0, sadness 50% surprise 5%

Problems in data collection

  • Uncertainty, incompleteness and even mistakes among the ground truth label due to annotators expertise or task’s difficulty

Use hashtag based on data collection

在这里插入图片描述

  • Direct access to user’s intent
  • List the emotion hashtags of 28 affected words or extend the list by WordNet synsets
  • Collect tweets that contain one or more hashtags that fall in the defined list of emotions hashtags
  • Consider tweets only with hashtags
  • Add score based on this
  • 直接访问用户的意图
  • 列出 28 个受影响词的情感标签或通过 WordNet 同义词集 扩展列表
  • 收集包含一个或多个主题标签的推文,这些主题标签属于已定义的情绪标签列表
  • 只考虑带有主题标签的推文
  • 根据标签算分

在这里插入图片描述

emoticons

符号表示心情例如
在这里插入图片描述
https://emojipedia.org/twitter/

  • List of emoticons and their associations to eight emotions
  • Annotate a tweet into that category if the emoticons appear at the end
    Sometimes we meet conflicts, (happy+sad)
  • 表情符号列表及其与八种情绪的关联
  • 如果表情符号出现在末尾,则将推文注释到该类别中
    有时我们会遇到冲突,(快乐+悲伤),这时候需要比较 emotion word lexicon,hashtags lexicon 和 Emoticon lexicon 的score,哪个高就分给哪个分类

整体流程

在这里插入图片描述

Event Detection

Supplementary

  • Tokenization, Lemmazation, Stopword
    在这里插入图片描述

  • Named Entity Recognition
    i)Detect a named entity
    ii) Categorize the entity

    • Person
    • Organization
    • Time
    • Location
  • Part-of-Speech Tagging
    在这里插入图片描述

  • Word sense disambiguation
    在这里插入图片描述

  • Textual Entailment
    Extract a directional relation between text fragments
    提取文本片段之间的方向关系
    If you help the needy, God will reward you → \rightarrow

    • Giving money to a poor man has good consequences
    • Giving money to a poor man has no consequences
    • Giving money to a poor man will make you better person
  • Automatic summarization
    Extract a readable summary from text (news, articles, documents…)
    从文本中提取可读的摘要(新闻、文章、文件……)

  • Sentiment Analysis
    Exact subjective polarity from documents: positive, negative, or neutral
    来自文档的确切主观极性:正面、负面或中性

  • Vector-Space Representation of Documents
    TDM or DTM matrix在这里插入图片描述

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值