合并featurecount产生的多个表达矩阵文件

终是蝶衣梦晓楼

于 2024-05-27 16:09:06 发布

阅读量122

点赞数 1

文章标签：矩阵线性代数

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/sumpluss/article/details/139240857

版权

import os
import pandas as pd

# 设置文件路径
file_paths = [
    "F:\\Desktop\\daily_project\\pycharm_project\\",
    "F:\\Desktop\\daily_project\\pycharm_project\\",
    "F:\\Desktop\\daily_project\\pycharm_project\\"
   #这里填写绝对路径，并且一定要加上反义字符\
]

# 读取并合并 featureCounts 文件
dfs = []
for file_path in file_paths:
    # 打印文件路径以确保正确
    print(f"Processing file: {file_path}")

    # 读取文件时只选择 Geneid 和最后一列（计数列）
    df = pd.read_csv(file_path, sep='\t', comment='#')

    # 获取最后一列的列名（计数列）
    count_col = df.columns[-1]

    # 重命名列以方便合并
    df = df[['Geneid', count_col]]
    df.columns = ['Geneid', os.path.basename(file_path).replace('.count', '')]

    dfs.append(df)

# 合并所有数据框
merged_df = dfs[0]
for df in dfs[1:]:
    merged_df = pd.merge(merged_df, df, on='Geneid')

# 保存合并结果
output_path = "F:\\Desktop\\daily_project\\pycharm_project\\featurecounts_merge\\merged_counts.txt"
merged_df.to_csv(output_path, sep='\t', index=False)
print(f"Merged file saved to: {output_path}")

终是蝶衣梦晓楼

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
合并featurecount产生的多个表达矩阵文件

【代码】合并featurecount产生的多个表达矩阵文件。
复制链接

扫一扫

终是蝶衣梦晓楼 CSDN认证博客专家 CSDN认证企业博客

码龄5年

102: 原创

7万+: 周排名

2万+: 总排名

7万+: 访问

: 等级

1513: 积分

102: 粉丝

140: 获赞

39: 评论

368: 收藏

私信

关注

热门文章

分类专栏

生物学 6篇
开发日志 3篇
程序代码 12篇
嵌入式 7篇
python 36篇
FPGA 1篇
笔记 39篇
stm32 1篇
Linux 4篇
蓝桥杯 2篇
学习笔记 30篇
操作系统 2篇
计算机二级 6篇
NCRE 2篇
Lua 1篇
游戏开发 1篇
数据分析 3篇
爬虫 3篇

最新评论

go注释！
CSDN-Ada助手: 不知道 Go 技能树是否可以帮到你：https://edu.csdn.net/skill/go?utm_source=AI_act_go
利用Python实现Smithwaterman算法
普通网友: 优质好文，支持支持。【我也写了一些相关领域的文章，希望能够得到博主的指导，共同进步！】
crossover2.py
Henrietta_L: 第一
SourceCode.py
Henrietta_L: 老公好棒
Python实现蚁群算法(样例）
终是蝶衣梦晓楼: import numpy as np class AntColony: def __init__(self, distances, n_ants, n_best, n_iterations, decay, alpha=1, beta=1): """ Args: distances (2D numpy.array): 两点之间的距离的二维数组 n_ants (int): 每次迭代运行的蚂蚁数量 n_best (int): 每次迭代中存放信息素的最佳蚂蚁数量 n_iteration (int): 迭代次数 decay (float): 信息素衰减速率。信息素值乘以衰减速率，因此0.95会导致衰减，0.5会导致更快的衰减。 alpha (int or float): 信息素的指数，较高的alpha给予信息素更大的权重。默认为1 beta (int or float): 距离的指数，较高的beta给予距离更大的权重。默认为1 """ self.distances = distances self.pheromone = np.ones(self.distances.shape) / len(distances) # 初始化信息素矩阵 self.all_inds = range(len(distances)) self.n_ants = n_ants self.n_best = n_best self.n_iterations = n_iterations self.decay = decay self.alpha = alpha self.beta = beta def run(self): shortest_path = None shortest_path_length = np.inf for i in range(self.n_iterations): all_paths = self.gen_all_paths() self.spread_pheronome(all_paths, self.n_best, shortest_path=shortest_path, shortest_path_length=shortest_path_length) shortest_path, shortest_path_length = self.pick_best_path(all_paths) self.pheromone * self.decay return shortest_path, shortest_path_length def spread_pheronome(self, all_paths, n_best, shortest_path, shortest_path_length): sorted_paths = sorted(all_paths, key=lambda x: x[1]) for path, path_length in sorted_paths[:n_best]: for move in path: self.pheromone[move] += 1.0 / self.distances[move] # 更新路径上的信息素 def pick_best_path(self, all_paths): # 按长度对所有路径进行排序 all_paths = sorted(all_paths, key=lambda x: x[1]) best_path = all_paths[0] # 获取最短路径 return best_path def gen_path_dist(self, path): total_dist = 0 for ele in path: total_dist += self.distances[ele] return total_dist def gen_all_paths(self): all_paths = [] for i in range(self.n_ants): path = self.gen_path(0) all_paths.append((path, self.gen_path_dist(path))) return all_paths def gen_path(self, start): path = [] visited = set() visited.add(start) prev = start for i in range(len(self.distances) - 1): move = self.pick_move(self.pheromone[prev], self.distances[prev], visited) path.append((prev, move)) prev = move visited.add(move) path.append((prev, start)) # 返回起始点 return path def pick_move(self, pheromone, dist, visited): pheromone = np.copy(pheromone) pheromone[list(visited)] = 0 row = pheromone ** self.alpha * (( 1.0 / dist) ** self.beta) norm_row = row / row.sum() move = np_choice(self.all_inds, 1, p=norm_row)[0] return move # 辅助函数 def np_choice(a, size, replace=True, p=None): """numpy的random.choice不允许选择不替换的选项，编写一个简单的函数来实现""" idx = np.random.choice(range(len(a)), size=size, replace=replace, p=p) return np.array(a)[idx] # 示例用法 if __name__ == '__main__': # 示例距离矩阵（对称） distances = np.array([[0, 2, 2, 5], [2, 0, 1, 2], [2, 1, 0, 1], [5, 2, 1, 0]]) # 参数设置 n_ants = 3 n_best = 2 n_iterations = 10 decay = 0.95 # 创建蚁群实例 ant_colony = AntColony(distances, n_ants, n_best, n_iterations, decay) # 运行优化 shortest_path, shortest_path_length = ant_colony.run() print("最短路径:", shortest_path) print("最短路径长度:", shortest_path_length)

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

终是蝶衣梦晓楼 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。