(转)Understanding Memory in Deep Learning Systems: The Neuroscience, Psychology and Technology Perspe...

Understanding Memory in Deep Learning Systems: The Neuroscience, Psychology and Technology Perspectives

2018-08-05 18:50:06

 

This blog is copied fromhttps://towardsdatascience.com/understanding-memory-in-deep-learning-systems-the-neuroscience-psychology-and-technology-78922ae6a1dd 

 

 
 

 

Today I decided to assemble several articles form last year in a longer essay that explains the foundations of memory in deep learning systems.

Memory modeling is an active area of research in the deep learning space. In recent years, techniques such as Neural Turing Machines(NTM) have made significant progress setting up the foundation for building human-like memory structures in deep learning systems. In the past, I’ve written extensibly about the role of memory in artificial intelligence(AI) so I am not planning to bore you restating the same points. Instead, I would like to approach the subject from a different angle and attempt to answer three fundamental questions that we should have in mind when thinking about memory in deep learning models:

a) What makes memory such a complex subject in deep learning systems?

b) Where can we draw inspiration about memory architectures?

c)What are the main techniques used to represent memories in deep learning models?

In order to effectively answer the first two questions, we should look at both the biological and psychological theories of memory. That should take us to the two schools of thought that have influenced our knowledge about memory the most: neuroscience and cognitive psychology. Following that same trend of thought we are going to structure this essay in three main parts. The first part will explain the neuroscience theory of memory. The second part will approach memory from the perspective of cognitive psychology while the final segment will focus on how deep learning is drawing inspiration from those disciplines to incorporate memory into neural networks. So let’s start in the place where memories are created: the human brain.

The Neuroscience Theory of Memory

Understanding how memories are created and, sometimes, destroyed as well as the differences between long and short terms memory have been an important area of neuroscience research in the last decade. One of the iconic subjects that inspired that level of research was been known as patient HM.

Henry Gustav Molaison(HM) suffered an accident at the age of nine that caused him to experience convulsions regularly for the following years. In 1952, at the age of twenty-five, HM underwent a surgery to relieve his symptoms. The procedure was considered initially successful until the doctors discovered that they have accidentally cut part of HM’s hippocampus. as a result, HM was unable to retain new memories.

The idea of living without new memories is the analog of always living in the present. Trust me, I am not talking about the mindfulness way but in the way that you can’t relate to a recent event in the past or envision an event in the future. Patient HM went about his day only retaining information for a few minutes, greeting the same people and asking the same questions over and over again. The HM case was pivotal to help neuroscientists understand how memories are created, stored and recalled.

The modern neuroscience theory of memory involves three fundamental areas of the brain: the thalamus, the prefrontal; cortex and the hippocampus. The thalamus can b considered a router that processes sensory information(vision, touch, speech) and relays is to the sensory lobes of the brain for evaluation. The evaluated information eventually reaches the prefrontal cortex where it enters our consciousness forming short term memories. The information is also sent to the hippocampus which distributes different fragments to various cortices forming long term memories. One of the biggest challenges neuroscience today is to understand how those scattered fragments of memories can be reassembled into cohesive memory experiences. This is known in neuroscience as the “binding problem”.

The Binding Problem

Considered one of the most puzzling aspects of the neuroscience theory of memory, the binding problem challenges the concept of recreating memories from other sensory information. Take the experience of going to a concert with your loved one. Memories about the event will be broken down and stored across different regions of the brain. However, it will only take one experience such as listening to a melody of the same band or seeing your wife dancing to recall the entire memory of the concept. How is this possible?

One theory that solves the binding problem states that memory fragments are linked by electromagnetic vibrations that are constantly flowing through the brain. There vibrations create a temporal(not spatial) link between memory fragments allowing them to activate together as a cohesive memory.

The neuroscience theory of memory give us the foundation to understand some of the main components of an intelligent memory architecture. However, human memory is not only a by-product of the components of the brain but it is also deeply influenced by contextual circumstances. That will be the subject of the next post.

The Cognitive Psychology Theory of Memory

The “binding problem” of the neuroscience theory of memory explains how scattered memory fragments can be recalled into cohesive memories. It turns out that, in order to explain the binding problem, we need to expand beyond the architecture of our brain and evaluate all sorts of psychological contextual elements that deeply influence how memories are recalled. One of the main theories in cognitive psychology that tries to explain the associative nature of memory is known as the Priming Effect.

Associative Memory and the Priming Effect

Like all good theories in cognitive psychology, let’s try to explain the Priming Effect in the context of experiments. Think about the first thing that comes to mind when you hear the word DINNER. Was it wine?( for me it was), dessert?, maybe a Saturday night date? As you can see, something as simple as a word can evoke a mixed set of emotions and even other related words. We are effectively recalling associated memories.

One of the most remarkable results of the prior experiments is to notice how fast you were able to retrieve those related words or memories. That happens because associated memories are part of what Economics Nobel Prize winner Daniel Kanehman calls System 1; they happen quickly and they produce a series of related emotional and physical responses. In psychology, that type of phenomenon is known as Associatively Coherent.

Going back to our word game; the fact that the word DINNER evokes the idea of WINE or DESSERT is known as a priming effect in the sense that “dinner primes dessert”. Priming has an important role explaining how memory works. The priming effect does not only applies to words but also to emotions, physical reactions, instincts and other cognitive phenomenon’s. In the context of memory, the priming effect tells us that memories are not only recalled by associated ideas but by “primed ideas”.

The Availability Heuristic

Another important element of the cognitive psychology theory of memory covers how we recall the frequency of events. For instance, if I ask you “how many concerts have you attended in the last decade? “ you are likely to overestimate the number if the answer feels fluent or you have recently attended a concern. Otherwise, if you don’t enjoy your last concert experience, the number might be too low. This cognitive process is known as the Availability Heuristic and explains how our memories are deeply influenced by the rapid availability of an answer.

By now we have an idea of how we can think about memory in the context of the brain(neuroscience) and our social settings(cognitive psychology). How are those theories imitated in deep learning algorithms?

Memory and Deep Learning

From the neuroscience and cognitive psychology theories of memories we know that any artificial memory system should have a specific set of characteristics to resemble human memory.

a)Partition a memory into segments that describe different areas of knowledge

b)Reassemble disparate segments into cohesive information structures

c)Retrieve data based on contextual and not directly related information as well as external data references

No discipline in computer science can benefit more from a human-like memory system than deep learning. Since its early days, there have been efforts in the deep learning space to model systems that simulate some of the key characteristics of human memory.

Deep Learning and Explicit Memory

In order to understand the relevance of memory in deep learning models, we should differentiate between the concepts of implicit and explicit knowledge. Implicit knowledge is typically subconscious and, consequently, hard to explain. We can find examples of implicit knowledge in areas such as speech and vision analysis such as recognizing a monkey in a picture or the tone and mood in a spoken sentence. Contrasting with that model, explicit knowledge is easily modeled declaratively. For instance, understanding that a monkey is a kind of animal or that certain adjectives are offensive are classic examples of explicit knowledge. We know that deep learning algorithms have made incredible progress representing implicit knowledge byyt they still struggle modeling and “memorizing” explicit knowledge.

What makes explicit knowledge so difficult in the context of deep learning algorithms? If you think about the traditional architecture of neural networks with millions of interconnected nodes, we will realize that they lack the equivalent of a working memory system that can store fragments of inferred pieces of knowledge and their relationships so that it can be easily acceded from different layers in the network. Recently, new deep learning techniques have been created to address this limitation.

Neural Turing Machines

The rapid evolution of deep learning algorithms has triggered the need for memory systems that can resemble the characteristics of human memory when processing explicit knowledge. One of the most popular techniques in the memory modeling space is known as Neural Turing Machines(NTM) and was introduced by DeepMind in 2014.

 

NTM works by expanding a deep neural network with memory cells that can store complete vectors. One of the greatest innovations of NTM is that it uses heuristics to read and write information. For instance, NTM implements a mechanism known as content-based addressing that can retrieve vectors based on input patterns. This is similar to the way humans recall memories based on ctextual experiences. Additionally, NTM includes mechanics for increasing the prominence of memory cells based on how often they are recalled.

NTM is not the only techniques that enables memory capabilities in deep learning systems but is certainly one of the most popular. Imitating the biological and psychological functions of human memory is not an easy endeavor and has become one of the most important areas of research in the deep learning space.

 

 

 

 

转载于:https://www.cnblogs.com/wangxiaocvpr/p/9426761.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
### 回答1: 社会推理的认知神经科学是研究人类在社会情境中进行推理和进行决策的过程的领域。它结合了认知神经科学和社会心理学的理论框架,以揭示大脑在社会推理中发挥的角色。 社会推理是指人们对他人行为、观点、意图和情感的推理和解释。这种推理过程涉及到一系列的认知活动,包括注意力、记忆、情感处理和决策。通过研究认知神经科学,我们能够深入了解这些认知过程在大脑中的执行机制。 大脑中涉及社会推理的区域包括前额叶皮层、颞叶和扣带回等。例如,前额叶皮层特别与我们对他人的立场和意图的推理有关,颞叶则与面部表情和语言的理解相关。此外,研究还表明,神经系统中的一些神经递质,如奖赏系统中的多巴胺和仿内啡肽,也在社会推理中发挥重要作用。 通过使用功能性磁共振成像技术和其他神经影像学方法,研究者们能够观察大脑在不同的社会推理任务中的活动模式。例如,一些研究表明,在判断他人的情感时,大脑的扣带回和颞叶皮层活动增加。而在判断他人的动机和意图时,前额叶皮层的活动增加。 这些研究对于理解人类社会互动和决策行为的基础机制至关重要。它们也对心理健康问题,如自闭症谱系障碍和精神疾病等的研究具有重要的影响。通过深入研究社会推理的神经机制,我们可以为改善人类社会交往和心理健康提供更好的理解和治疗方法。 ### 回答2: 社会推理是我们理解和解释他人行为和意图的能力。认知神经科学致力于研究社会推理背后的神经基础。通过使用成像技术,如功能性磁共振成像(fMRI)和脑电图(EEG),研究人员能够观察到在进行社会推理时大脑不同区域的活动。 研究表明,社会推理涉及多个大脑网络。其中一个关键的网络是Theory of Mind(ToM)网络,它涉及前额叶、颞叶和顶叶的区域。这个网络在理解他人的信念、欲望和意图方面起着重要作用。通过观察这些区域的活动,研究人员能够了解他人意图和期望对我们自己的行为和决策产生的影响。 另一个与社会推理相关的网络是镜像神经网络。这个网络涉及到大脑的运动执行区域,主要与模仿和共情相关。研究表明,当我们观察他人的行为时,这个网络会被激活,使我们能够模拟和理解他们的行为意图。 此外,情绪和情感处理也在社会推理中起着重要作用。大脑的情感处理区域,如杏仁核和带状回,与理解他人情感和意向的情感共鸣有关。这些区域的活动可以影响我们对他人行为的解释和反应。 总的来说,社会推理是一个复杂的认知过程,涉及到多个大脑区域的协同作用。通过神经科学的研究,我们可以更好地理解社会推理的基础,并深入探索人类在社交互动中的行为和感知模式。 ### 回答3: 社交推理是指人们对他人的态度、意图和信念进行理解和推断的认知过程。社交推理是人类社会互动的重要组成部分,它涉及到我们如何理解他人的心理状态,以及用于解释和预测他人行为的能力。最近的神经科学研究揭示了社交推理的神经基础。 研究发现,社交推理涉及多个脑区的协同工作。前额叶皮层特别重要,负责处理社交情境中的认知冲突和不一致信息。我们的前额叶皮层还参与理解他人的意图、推断他人的目标和判断他人情感的能力。此外,扣带回皮层也被认为在社交推理中起着重要的作用,它与理解他人的信念、思考其他人的观点和评估他人社交行为的能力有关。 此外,神经科学研究还揭示了情感处理在社交推理中的重要性。边缘系统,尤其是扣带回和扣带回前部的活动,与理解和共感他人的情绪状态密切相关。此外,杏仁核被认为是社会情绪处理的关键结构,它对情绪信息进行加工和情绪体验的产生起着重要作用。 最近的研究还揭示了基因对社交推理的影响。例如,一些研究发现,基因对于影响情感识别、社交记忆和社交行为的能力具有重要作用。此外,某些神经发育相关基因也被发现与社交推理能力的差异有关。 总之,社交推理是人类社会交往中至关重要的认知过程。神经科学的研究揭示了脑区间的协同工作以及情感处理在社交推理中的重要性。此外,基因也被发现与个体社交推理能力的差异有关。这些研究为我们更好地理解人类社会互动的认知过程提供了重要线索。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值