[论文精读]BrainLM: A foundation model for brain activity recordings

夏莉莉iy

已于 2024-07-13 20:56:37 修改

阅读量994

点赞数 13

分类专栏：论文精读文章标签：人工智能笔记深度学习计算机视觉学习分类神经网络

于 2024-07-11 12:34:11 首次发布

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/Sherlily/article/details/140337372

版权

论文精读专栏收录该内容

169 篇文章

订阅专栏

论文网址：pdf (openreview.net)

英文是纯手打的！论文原文的summarizing and paraphrasing。可能会出现难以避免的拼写错误和语法错误，若有发现欢迎评论指正！文章偏向于笔记，谨慎食用

目录

1.2. 论文总结图

2. 论文逐段精读

2.2. Introduction

2.3. Related work

2.4.1. Datasets and preprocessing

2.4.2. Model architecture & training procedure

2.4.3. Clinical variable prediction

2.5.1. Model generalization

2.5.2. Prediction of clinical variables

2.5.3. Prediction of future brain states

2.5.4. Interpretability via attention analysis

2.5.5. Functional network prediction

2.6. Discussion

1. 省流版

1.1. 心得

（1）好简单的模型啊...

（2）预测fMRI“当下的未来”有什么意义？fMRI不是循环吗？就像呼吸一样。作者实际上不能预测患者真正病灶发展的未来吧？

1.2. 论文总结图

2. 论文逐段精读

2.1. Abstract

①Model name: Brain Language Model (BrainLM)

②Recording: 6700 hours fMRI

③Supervision method: self-supervised

④⭐Task: extracting functional connectivity (FC) without supervised network

2.2. Introduction

①⭐Previous work focus on specific and narrow task

②Plight: large amount unlabeled fMRI data

③Method of BrainLM: Transformer based

④Ability of BrainLM: prediction of future brain states, decoding cognitive variables, and discovery of functional networks

⑤Overview of BrainLM:

with 77298 samples and 6700 hours, they pretrained BrainLM by spatiotemporal masking and reconstruction

myriad n. 无数，大量；（多用于古典历史剧中）一万 adj. 无数的，大量的

2.3. Related work

作者觉得其他的要么太focus on specific task了要么就样本量太小，对于大语言模型的话其他工作也主要是再寻找brain recordings的表征相似性（我不知道为什么要找表征相似性我不是这个领域的）

2.4. Methods

2.4.1. Datasets and preprocessing

①Datasets: the UK Biobank (UKB) with 76,296 rs-fMRI recordings and the Human Connectome Project (HCP) with 1002 fMRI data

②80% UKB data for training. 20% UKB data and all the HCP data for testing.

③Preprocessing: standard

④Atlas: AAL-424

2.4.2. Model architecture & training procedure

①Task: predict the original signal of masked patches

②BrainLM:

③Training: randomly select 200 time points in each fMRI data, and divide them into 10 sections with 20 time points each. Converting each section to vector with 512 dimension, masking them as 20%, 75%, and 90%（我猜测是N*10个section中随机mask20%，75%或者90%）

④Order of ROI: change the order of ROI to the real y-axis of the ROI in brain based order

⑤Model framework: constructed by 4 self-attention layers and 4 heads for training unmasked data, and 2-layer Transformer decoder for predicting masked and unmasked vectors

⑥Batch: 512

⑦Optimizer: Adam

⑧Epoch: 100

⑨Goal: minimizing the MSE of original signal and reconstructed signal（只比较Mask部分）

2.4.3. Clinical variable prediction

①Enchancement of prediction: adding 3-layer MLP head in encoder

②Regression task: age, neuroticism, PTSD, and anxiety disorder scores

③Approach:

age	Z-score normalization
neuroticism	min-max scaling to [0, 1]
PTSD (PCL-5) and anxiety disorder (GAD-7) scores	distributeb them exponentially by log transformation

④Dropout rate: 10% for encoder and MLP head

2.5. Results

2.5.1. Model generalization

①The reconstruction performance on UKB and HCP. The red lines denote predicted result and the black points are the real recording:

（HCP是拿来证明泛化能力的）

2.5.2. Prediction of clinical variables

①Reconstruction performance:

②Latent encoding learning:

③Performance table:

delve vi. 钻研；探究；挖 vt. 钻研；探究；挖 n. 穴；洞

2.5.3. Prediction of future brain states

①They applied 180 time steps to train and 20 following to test

②MSE on each time step:

2.5.4. Interpretability via attention analysis

①Mean attention socre on each ROI:

glean vt. 收集（资料）；拾（落穗） vi. 收集；拾落穗

2.5.5. Functional network prediction

①7 subnetworks: visual, somatomotor, dorsal attention, ventral attention, limbic, frontoparietal, and default mode networks

②Region segmentation comparason table:

2.6. Discussion

①Predicting masked distribution

②Predicting mental disorders （？）（把解码的数据送去卷积？还是直接就有结果啊？）

③Recognizing FC（哪里？怎么感觉像脑区分割呢）

3. Reference

Caro, J. O. et al. (2024) 'BrainLM: A foundation model for brain activity recordings', ICLR.

博客等级

码龄3年

231
原创

3743
点赞

4128
收藏

2657
粉丝

关注

私信

热门文章

分类专栏

展开全部收起

最新评论

静息态功能磁共振成像(rs-fMRI)原理与数据分析学习笔记（1）：Resting-State fMRI
Mithlos: 感谢笔记，借帖记录一下自己在听课的时候想补充的 Functional connectivity：其实是correlation，没有物理意义上的连接。 DMN：task independent 体素voxel：大脑的最小单元，可设置不同的成分。体素脑镜像同伦功能连接VMHC：如果功能信号相似性降低，代表脑区之间的功能连接下降，尤其是对于那些一开始左右两区域体素的相关性强的地方。胼胝体受损会直接影响。局部一致性ReHo：表征脑部的活动，检测神经活动是否同步。与功能专一化有关。 ALFF：不同脑区的波频不同，通过fraction归一化抑制了噪声。 Regular图：局部效率高，全局效率低，无随机性。 Random图：局部效率低，全局效率高，有随机性。 Small-world图：局部上脑区内部紧密连接，全局上任意两个脑区只需少量“跳转”即可连通。 Hub：具有高度连接性和关键中介作用的节点。 Module：部连接紧密、外部连接稀疏的脑区子集。 hub连接module，module依赖hub
[arXiv 2024]BrainMAE: A Region-aware Self-supervised Learning Framework for Brain Signals
m0_71258492: 请问这个论文博主有没有复现
[ICLR 2025]Biologically Plausible Brain Graph Transformer
君莫笑∽GL: 博主你好，文中计算FM-Attn(i)的公式最右侧乘了个f(hi)，这个应该相当于是自注意力中的V吧，hi应该是功能模块提取器的输出吧？但是文中框架图的V是通过节点重要性编码NE模块的输出计算的。我就感觉这一点好像图和公式没对上。还一个点就是图中为什么V又和FM-Attn的输出拼接了？
[论文精读]Brain Network Transformer
qq_43025979: 你好，请问作者跑通了代码吗
[数据集]fMRI数据集汇总
魔猴悟菩提€: 您好，可以分享一下REST-meta-MDD数据集吗

大家在看

最新文章

2025

目录

展开全部

收起

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。