人工学习模式学习机器学习
I got my attention on multimodal learning from Facebook recent Hateful Meme Challenge 2020 on Driven Data. The challenge is about how to make an effective tool for detecting hate speech, and how it must be able to understand content the way people do. Seems pretty cool challenge as it makes use of both text and image for analysing content which is similar to what humans do. Let's dive deep into Multimodal Machine Learning to get what it is actually.
我从Facebook最近关于驱动数据的仇恨Meme挑战2020中获得了对多模式学习的关注。 挑战在于如何制作有效的工具来检测仇恨言论,以及如何必须能够以人们的方式理解内容。 看起来很酷的挑战,因为它同时使用文本和图像来分析类似于人类所做内容的内容。 让我们深入了解多模式机器学习以了解它的实际含义。
多模式学习 (Multimodal Learning)
As per definition Multimodal means that we have two and or more than two modes of communication through combinations of two or more modes. Modes include written language, spoken language, and patterns of meaning that are visual, audio, gestural, tactile and spatial.
按照定义,多模式意味着我们通过两种或多种模式的组合来拥有两种或两种以上的通信模式。 模式包括书面语言,口头语言以及视觉,听觉,手势,触觉和空间的