homer分析pca之多样本比较

最新推荐文章于 2024-03-25 09:57:49 发布

qq_39306047

最新推荐文章于 2024-03-25 09:57:49 发布

阅读量749

点赞数

分类专栏： Linux

本文链接：https://blog.csdn.net/qq_39306047/article/details/107467425

版权

Linux 专栏收录该内容

78 篇文章 3 订阅

订阅专栏

homer分析pca之多样本比较https://wiki.bits.vib.be/index.php/Homer
》》》》》首先呀安装limma包
#######
!!! Warnining - something likely failed during R execution.
!!! R script that was used:
####### basic script for running limma for PC1/TADs (generated by HOMER) ########
#### input is matrix data - first column is TAD id, others are TAD inclusion scores
#### Exp1 Exp2
#### chr1:300000-400000 2.45 2.01
library(limma)
#Read Data in
countData <- read.delim(“0.560185876981453.edgeR.in.data.txt”,row.names=1)
colData <- read.delim(“0.560185876981453.edgeR.groups.data.txt”)

design <-model.matrix(~Treatment, data=colData)
fit <- lmFit(countData, design)
fit2 <- eBayes(fit)
res <- topTable(fit2,n=Inf)
tidyResult <- data.frame(TAD=rownames(res), res)
write.table(tidyResult,file="0.560185876981453.edgeR.out.data.txt",sep="\t",row.names=FALSE)

!!! R execution details:
Error in library(limma) : there is no package called ‘limma’
Execution halted

》》》》》》》

##step3
#If you have PC1 bedGraphs from replicate experiments across two conditions, you can identify significantly changing compartments using the following:
annotatePeaks.pl /data/zhangyong/yyp/yypold/hic/hicdata/validpairs/valid/WT_DM_BL-HiC_Rep1.allValidPairs-HiC/WT_DM_BL-HiC_Rep1.allValidPairs-HiC.25x50kb.PC1.txt mm10 -noblanks -bedGraph /data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_DM_BL-HiC_Rep1.allValidPairs-HiC.25x50kb.PC1.bedGraph /data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_DM_BL-HiC_Rep2.allValidPairs-HiC.25x50kb.PC1.bedGraph /data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_GM_BL-HiC_Rep1.allValidPairs-HiC.25x50kb.PC1.bedGraph /data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_GM_BL-HiC_Rep2.allValidPairs-HiC.25x50kb.PC1.bedGraph > outputstep3.txt
getDiffExpression.pl outputstep3.txt DM DM GM GM -pc1 -export outputPrefix > output2Diff.txt

Peak file = /data/zhangyong/yyp/yypold/hic/hicdata/validpairs/valid/WT_DM_BL-HiC_Rep1.allValidPairs-HiC/WT_DM_BL-HiC_Rep1.allValidPairs-HiC.25x50kb.PC1.txt
Genome = mm10
Organism = mouse
Will remove rows with data values of '' or 'NA'
bedGraph Files:
	/data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_DM_BL-HiC_Rep1.allValidPairs-HiC.25x50kb.PC1.bedGraph
	/data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_DM_BL-HiC_Rep2.allValidPairs-HiC.25x50kb.PC1.bedGraph
	/data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_GM_BL-HiC_Rep1.allValidPairs-HiC.25x50kb.PC1.bedGraph
	/data/zhangyong/yyp/yypold/hic/hicdata/pca/WT_GM_BL-HiC_Rep2.allValidPairs-HiC.25x50kb.PC1.bedGraph
Peak/BED file conversion summary:
	BED/Header formatted lines: 0
	peakfile formatted lines: 103857
	Duplicated Peak IDs: 0

Peak File Statistics:
	Total Peaks: 103857
	Redundant Peak IDs: 0
	Peaks lacking information: 0 (need at least 5 columns per peak)
	Peaks with misformatted coordinates: 0 (should be integer)
	Peaks with misformatted strand: 0 (should be either +/- or 0/1)

Peak file looks good!

Reading Positions...
-----------------------
Finding Closest TSS...
Annotating:.............................
	Annotation	Number of peaks	Total size (bp)	Log2 Ratio (obs/exp)	LogP enrichment (+values depleted)
	3UTR	847.0	20769155	0.097	-3.631
	miRNA	1.0	31126	-0.247	0.404
	ncRNA	130.0	3444819	-0.015	0.741
	TTS	1041.0	27332353	-0.001	0.702
	pseudo	20.0	554446	-0.080	0.779
	Exon	1394.0	34653721	0.077	-3.760
	Intron	37785.0	950229103	0.061	-54.352
	Intergenic	61361.0	1653660826	-0.039	60.235
	Promoter	1143.0	30220057	-0.011	0.914
	5UTR	101.0	2376100	0.157	-1.904
	snoRNA	0.0	19	-0.001	0.001
	rRNA	0.0	5631	-0.281	0.215
NOTE: If this part takes more than 2 minutes, there is a good chance
	your machine ran out of memory: consider hitting ctrl+C and rerunning
	the command with "-noann"
To capture annotation stats in a file, use "-annStats <filename>" next time
Annotating:.........................................
	Annotation	Number of peaks	Total size (bp)	Log2 Ratio (obs/exp)	LogP enrichment (+values depleted)
	3UTR	847.0	20769155	0.098	-3.667
	Other	288.0	7952166	-0.073	1.596
	RC?	1.0	10979	1.257	-1.073
	RNA	5.0	113962	0.203	-0.826
	miRNA	1.0	31126	-0.246	0.404
	ncRNA	130.0	3444819	-0.014	0.735
	TTS	1041.0	27332353	-0.001	0.688
	LINE	21179.0	544526405	0.030	-7.739
	LINE?	1.0	8168	1.684	-1.319
	srpRNA	2.0	43307	0.277	-0.711
	SINE	7891.0	196555829	0.075	-13.912
	RC	4.0	65629	0.678	-1.416
	tRNA	13.0	266727	0.355	-1.495
	DNA?	8.0	142070	0.563	-1.715
	pseudo	20.0	554446	-0.079	0.776
	DNA	1137.0	28683535	0.057	-2.372
	Exon	1394.0	34653721	0.078	-3.807
	Intron	23990.0	602909260	0.062	-31.534
	Intergenic	29211.0	823289671	-0.103	111.499
	Promoter	1143.0	30220057	-0.011	0.896
	5UTR	101.0	2376100	0.158	-1.912
	snoRNA	0.0	19	-0.001	0.001
	LTR?	4.0	192563	-0.875	1.935
	scRNA	23.0	602525	0.003	-0.646
	CpG-Island	152.0	3122603	0.353	-6.205
	Low_complexity	754.0	19402023	0.028	-1.209
	LTR	12054.0	313371409	0.014	-1.994
	Simple_repeat	2232.0	57559904	0.025	-1.588
	snRNA	10.0	237324	0.145	-0.872
	Unknown	67.0	2405524	-0.452	5.461
	SINE?	3.0	29758	1.404	-2.240
	Satellite	149.0	4592921	-0.232	3.705
	rRNA	2.0	166488	-1.665	3.031
Counting Tags in Peaks from each directory...
Organism: mouse
Loading Gene Informaiton...
Outputing Annotation File...
Removed 0 with missing data (out of 103857)
Done annotating peaks file

Treating input as file generated by annotatePeaks.pl (-peaks)
Will report raw counts in output table (no normalization)
Using limma to calculate differential expression/enrichment...
Output Stats DM vs. GM:
	Total Genes: 103250
	Total Up-regulated in GM vs. DM: 313 (0.303%) [log2fold>1, FDR<0.05]
	Total Dn-regulated in GM vs. DM: 301 (0.292%) [log2fold<-1, FDR<0.05]

	Total continuous domains: 26 (Up) and 24 (Down)