计算序列之间的相似性矩阵

终是蝶衣梦晓楼

于 2024-07-24 16:42:51 发布

阅读量48

点赞数

文章标签：矩阵线性代数

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/SUMPLUSS/article/details/140667125

版权

# Let's read the sequences from the uploaded file and calculate the sequence similarity matrix.

# Read the sequences from the uploaded file
file_path = '/mnt/data/similar_seq.txt'
with open(file_path, 'r') as file:
    sequences = file.read().split('>')[1:]  # Split the sequences and ignore the first empty split

# Parse the sequences into a dictionary
seq_dict = {}
for seq in sequences:
    lines = seq.strip().split('\n')
    header = lines[0].split()[0]
    sequence = ''.join(lines[1:])
    seq_dict[header] = sequence

# Extract headers and sequences
headers = list(seq_dict.keys())
sequences = list(seq_dict.values())

# Function to calculate sequence similarity
def calculate_similarity(seq1, seq2):
    matches = sum(a == b for a, b in zip(seq1, seq2))
    return matches / min(len(seq1), len(seq2))  # Normalize by the length of the shorter sequence

# Calculate the similarity matrix
import numpy as np

num_sequences = len(sequences)
similarity_matrix = np.zeros((num_sequences, num_sequences))

for i in range(num_sequences):
    for j in range(num_sequences):
        similarity_matrix[i, j] = calculate_similarity(sequences[i], sequences[j])

similarity_matrix, headers

终是蝶衣梦晓楼

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
计算序列之间的相似性矩阵

【代码】计算序列之间的相似性矩阵。
复制链接

扫一扫

终是蝶衣梦晓楼 CSDN认证博客专家 CSDN认证企业博客

码龄5年

109: 原创

12万+: 周排名

2万+: 总排名

7万+: 访问

: 等级

1629: 积分

128: 粉丝

186: 获赞

39: 评论

400: 收藏

私信

关注

热门文章

分类专栏

生物学 6篇
开发日志 3篇
程序代码 12篇
嵌入式 7篇
python 36篇
FPGA 1篇
笔记 39篇
stm32 1篇
Linux 4篇
蓝桥杯 2篇
学习笔记 30篇
操作系统 2篇
计算机二级 6篇
NCRE 2篇
Lua 1篇
游戏开发 1篇
数据分析 3篇
爬虫 3篇

最新评论

go注释！
CSDN-Ada助手: 不知道 Go 技能树是否可以帮到你：https://edu.csdn.net/skill/go?utm_source=AI_act_go
利用Python实现Smithwaterman算法
普通网友: 优质好文，支持支持。【我也写了一些相关领域的文章，希望能够得到博主的指导，共同进步！】
crossover2.py
Henrietta_L: 第一
SourceCode.py
Henrietta_L: 老公好棒
Python实现蚁群算法(样例）
终是蝶衣梦晓楼: import numpy as np class AntColony: def __init__(self, distances, n_ants, n_best, n_iterations, decay, alpha=1, beta=1): """ Args: distances (2D numpy.array): 两点之间的距离的二维数组 n_ants (int): 每次迭代运行的蚂蚁数量 n_best (int): 每次迭代中存放信息素的最佳蚂蚁数量 n_iteration (int): 迭代次数 decay (float): 信息素衰减速率。信息素值乘以衰减速率，因此0.95会导致衰减，0.5会导致更快的衰减。 alpha (int or float): 信息素的指数，较高的alpha给予信息素更大的权重。默认为1 beta (int or float): 距离的指数，较高的beta给予距离更大的权重。默认为1 """ self.distances = distances self.pheromone = np.ones(self.distances.shape) / len(distances) # 初始化信息素矩阵 self.all_inds = range(len(distances)) self.n_ants = n_ants self.n_best = n_best self.n_iterations = n_iterations self.decay = decay self.alpha = alpha self.beta = beta def run(self): shortest_path = None shortest_path_length = np.inf for i in range(self.n_iterations): all_paths = self.gen_all_paths() self.spread_pheronome(all_paths, self.n_best, shortest_path=shortest_path, shortest_path_length=shortest_path_length) shortest_path, shortest_path_length = self.pick_best_path(all_paths) self.pheromone * self.decay return shortest_path, shortest_path_length def spread_pheronome(self, all_paths, n_best, shortest_path, shortest_path_length): sorted_paths = sorted(all_paths, key=lambda x: x[1]) for path, path_length in sorted_paths[:n_best]: for move in path: self.pheromone[move] += 1.0 / self.distances[move] # 更新路径上的信息素 def pick_best_path(self, all_paths): # 按长度对所有路径进行排序 all_paths = sorted(all_paths, key=lambda x: x[1]) best_path = all_paths[0] # 获取最短路径 return best_path def gen_path_dist(self, path): total_dist = 0 for ele in path: total_dist += self.distances[ele] return total_dist def gen_all_paths(self): all_paths = [] for i in range(self.n_ants): path = self.gen_path(0) all_paths.append((path, self.gen_path_dist(path))) return all_paths def gen_path(self, start): path = [] visited = set() visited.add(start) prev = start for i in range(len(self.distances) - 1): move = self.pick_move(self.pheromone[prev], self.distances[prev], visited) path.append((prev, move)) prev = move visited.add(move) path.append((prev, start)) # 返回起始点 return path def pick_move(self, pheromone, dist, visited): pheromone = np.copy(pheromone) pheromone[list(visited)] = 0 row = pheromone ** self.alpha * (( 1.0 / dist) ** self.beta) norm_row = row / row.sum() move = np_choice(self.all_inds, 1, p=norm_row)[0] return move # 辅助函数 def np_choice(a, size, replace=True, p=None): """numpy的random.choice不允许选择不替换的选项，编写一个简单的函数来实现""" idx = np.random.choice(range(len(a)), size=size, replace=replace, p=p) return np.array(a)[idx] # 示例用法 if __name__ == '__main__': # 示例距离矩阵（对称） distances = np.array([[0, 2, 2, 5], [2, 0, 1, 2], [2, 1, 0, 1], [5, 2, 1, 0]]) # 参数设置 n_ants = 3 n_best = 2 n_iterations = 10 decay = 0.95 # 创建蚁群实例 ant_colony = AntColony(distances, n_ants, n_best, n_iterations, decay) # 运行优化 shortest_path, shortest_path_length = ant_colony.run() print("最短路径:", shortest_path) print("最短路径长度:", shortest_path_length)

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

终是蝶衣梦晓楼 你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。