WGCNA | 不止一个组的WGCNA怎么分析嘞！？~（二）（共识网络分析-第二步-构建网络与模块-Blockwise）

最新推荐文章于 2024-08-31 11:04:07 发布

生信漫卷

最新推荐文章于 2024-08-31 11:04:07 发布

阅读量1k

点赞数

文章标签：后端

本文链接：https://blog.csdn.net/m0_72224305/article/details/130894381

版权

本文介绍了在临床工作日益繁忙的背景下，作者如何利用R语言中的WGCNA包进行超大数据集的网络构建与模块识别。首先，作者提出挑选软阈值的方法，通过计算和可视化来确定合适的参数。接着，采用Block-wise的方式构建网络，简化分析过程。最后，对比了一步法和Block-wise方法的结果，显示两者效果相近。

摘要由CSDN通过智能技术生成

1写在前面

临床工作越来越忙了，更新的频率也开始降低了，希望各位小伙伴可以理解一下。😭

最近看到塞尔达-王国之泪发售了，真的是想买一个，但想想根本没有时间去玩，买回来只能吃灰。🥲

想问问小伙伴们有什么想看的教程吗，可以给我留言，如果合适的话可以做上几期。🧐

OK，今天的教程是超大数据集的网络构建与模块识别，当然如果你的电脑或者服务器够好的话，可以跳过这期。🥳

2用到的包

rm(list = ls())
library(tidyverse)
library(WGCNA)

3示例数据

我们还是和之前一样，把之前清洗好的输入数据拿出来吧。😗

load("./Consensus-dataInput.RData")

4提取数据集个数

首先和之前一样提取一下我们的数据集个数，后面会用到。🤓

nSets <-  checkSets(multiExpr)$nSets

5挑选软阈值并可视化

5.1 创建power

先创建一下power向量吧。🐵

powers <-  c(seq(4,10,by=1), seq(12,20, by=2))

5.2 计算power

和之前一样，我们为每个数据集计算一下power，挑选软阈值。🥰

这里的机制是通过选择合适软阈值β来提高共表达相似度，进一步计算邻接度，趋近一个无尺度的拓扑结构。🤓

powerTables = vector(mode = "list", length = nSets)
for (set in 1:nSets){
  
powerTables[[set]] = list(data = pickSoftThreshold(multiExpr[[set]]$data, powerVector=powers,

                                                   verbose = 2)[[2]])
collectGarbage()
}

5.3 可视化相关参数设置

设置一下一会可视化出图的参数，提高颜值，嘿嘿！~😘

colors = c("black", "red")
plotCols = c(2,5,6,7)
colNames = c("Scale Free Topology Model Fit", "Mean connectivity", "Median connectivity",
"Max connectivity")

ylim = matrix(NA, nrow = 2, ncol = 4);
for (set in 1:nSets)
{
for (col in 1:length(plotCols))
{
ylim[1, col] = min(ylim[1, col], powerTables[[set]]$data[, plotCols[col]], na.rm = TRUE);
ylim[2, col] = max(ylim[2, col], powerTables[[set]]$data[, plotCols[col]], na.rm = TRUE);
}
}

5.4 可视化一下吧

来搞一下可视化看看吧，颜值还是高的诶。😂

sizeGrWindow(8, 6)
par(mfcol = c(2,2));
par(mar = c(4.2, 4.2 , 2.2, 0.5))
cex1 = 0.7;
for (col in 1:length(plotCols)) for (set in 1:nSets)
{
if (set==1)
{
plot(powerTables[[set]]$data[,1], -sign(powerTables[[set]]$data[,3])*powerTables[[set]]$data[,2],
xlab="Soft Threshold (power)",ylab=colNames[col],type="n", ylim = ylim[, col],
main = colNames[col]);
addGrid();
}
if (col==1)
{
text(powerTables[[set]]$data[,1], -sign(powerTables[[set]]$data[,3])*powerTables[[set]]$data[,2],
labels=powers,cex=cex1,col=colors[set]);
} else
text(powerTables[[set]]$data[,1], powerTables[[set]]$data[,plotCols[col]],
labels=powers,cex=cex1,col=colors[set]);
if (col==1)
{
legend("bottomright", legend = setLabels, col = colors, pch = 20) ;
} else
legend("topright", legend = setLabels, col = colors, pch = 20) ;
}

6Block-wise的方式构建网络

为简单起见，假设硬件极限是可以同时分析的基因数量为2000。😅

基本思想是使用two-level聚类。🤒

首先，我们使用快速、计算成本低且相对粗略的聚类方法，将基因预聚类到大小接近但不超过2000个基因block大小。😘

然后我们在每个block中执行完整的共识网络分析和模块识别。🤓

最后，合并特征基因在所有数据集中高度相关的模块。🥳

bnet <-  blockwiseConsensusModules(
  multiExpr, maxBlockSize = 2000, power = 6, minModuleSize = 30,
  deepSplit = 2,
  pamRespectsDendro = F,
  mergeCutHeight = 0.25, numericLabels = T,
  minKMEtoStay = 0,
  saveTOMs = T, verbose = 5)

7对比一下

为了对比一下Block-wise是否可以达到一步法的效果，我们把一步法计算的结果加载进来，对比一下。😗

7.1 一步法结果

加载一步法结果进来吧，把label和color匹配一下哦。🤨

load(file = "./Consensus-NetworkConstruction-auto.RData")
bwLabels <-  matchLabels(bnet$colors, moduleLabels, pThreshold = 1e-7)
bwColors <-  labels2colors(bwLabels)

table(bwLabels)

7.2 block分别可视化

我们对每个block都进行一下可视化。😘

sizeGrWindow(12,6)
layout(matrix(c(1:4), 2, 2), heights = c(0.8, 0.2), widths = c(1,1))

nBlocks = length(bnet$dendrograms)

for (block in 1:nBlocks){
  plotDendroAndColors(bnet$dendrograms[[block]], 
                      moduleColors[bnet$blockGenes[[block]]],
                      "Module colors",
                      main = paste("Gene dendrogram and module colors in block", block),
                      dendroLabels = F, hang = 0.03,
                      addGuide = T, guideHang = 0.05,
                      setLayout = F)
  }

7.3 对比一下结果

我们把一步法和Block-wise得到的结果画在一张图上，对比一下吧。🤓

差的不多，还是可以接受的。🐵

sizeGrWindow(12,9)
plotDendroAndColors(consTree,
                    cbind(moduleColors, bwColors),
                    c("Single block", "Blockwise"),
                    dendroLabels = F, hang = 0.03,
                    addGuide = T, guideHang = 0.05,
                    main = "Single block consensus gene dendrogram and module colors")

最后祝大家早日不卷!~

点个在看吧各位~ ✐.ɴɪᴄᴇ ᴅᴀʏ 〰

本文由 mdnice 多平台发布

生信漫卷

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫