提取核酸序列

最新推荐文章于 2024-06-03 12:01:01 发布

猿大人007

最新推荐文章于 2024-06-03 12:01:01 发布

阅读量127

点赞数

分类专栏：宏基因组

本文链接：https://blog.csdn.net/rojyang/article/details/81223995

版权

宏基因组专栏收录该内容

16 篇文章 15 订阅

订阅专栏

根据每个fq 文件比对到基因集的统计其丰度（相对丰度）

再根据丰度（相对丰度）从基因集里提取序列

getDataFromList.pl
#!/usr/bin/perl -w
use strict;
use Getopt::Long;
my $usage = <<_USAGE_;
usage : 
version 1.0
getDataFromList.pl -i inputList -l log -td TMP_DONE -d target -p perl dir -q query column -t target column -o output dir -m mark
_USAGE_

my ($inputList, $totalRunLog, $TMP_DONE, $target, $queryColumn, $targetColumn, $perl_dir, $outputDir, $mark);
GetOptions(
	"i=s" => \$inputList,
	"d=s" => \$target,
	"l=s" => \$totalRunLog,
	"td=s" => \$TMP_DONE,
	"q=s" => \$queryColumn,
	"t=s" => \$targetColumn,
	"o=s" => \$outputDir,
	"p=s" => \$perl_dir,
	"m=s" => \$mark
);
die $usage if (!$inputList || !$totalRunLog || !$TMP_DONE || !$target || !$queryColumn || !$targetColumn || !$outputDir|| !$perl_dir);

$queryColumn--;
$targetColumn--;
#my @targets = split/\s+/, $targetLine;
#foreach my $target (@targets){
#	$target =~ /.+\/(.+)\./;
#	my $mark = $1;
	my %target;
	open  (my $t, "$target") || die "$!:$target\n";
	while (<$t>){
		chomp;next if ($_ eq '');s/\r//g;
		my @entries = split/\t/;
		if ($target{$entries[$targetColumn]}){
			push (@{$target{$entries[$targetColumn]}}, $_);
		}else{
			$target{$entries[$targetColumn]}->[0] = $_;
		}
	}
	close $t;

	$inputList =~ /.+\/(.+?)\./;
	my $sample = $1;
	open (my $in, "$inputList" ) || die "$!:$inputList\n";
	open (my $out, ">$outputDir/$sample.$mark.output") || die "$!:$outputDir/$sample.$mark.output\n";
	while (<$in>){
		chomp;next if ($_ eq '');s/\r//g;
		my @entries = split/\t/;
		my $key = $entries[$queryColumn];
		if ($target{$key}){
			foreach my $annot ( @{$target{$key}}){
				print $out "$annot\n";
			}
		}
	}
	close $in;
	close $out;
#}

猿大人007

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
提取核酸序列

根据每个fq 文件比对到基因集的统计其丰度（相对丰度）再根据丰度（相对丰度）从基因集里提取序列getDataFromList.pl#!/usr/bin/perl -wuse strict;use Getopt::Long;my $usage = &lt;&lt;_USAGE_;usage : version 1.0getDataFromList.pl -i inputLi...
复制链接

扫一扫

专栏目录