rosalind练习题十四

最新推荐文章于 2025-04-25 11:47:46 发布

jkl_bio

最新推荐文章于 2025-04-25 11:47:46 发布

阅读量86

点赞数

文章标签： python

本文链接：https://blog.csdn.net/weixin_44619692/article/details/130880156

版权

该程序旨在从一组不超过100个长度不超过1kb的DNA字符串中找出最长公共子串，也称为motif。使用动态规划的方法，从最短的字符串开始，逐个检查可能的子串，直到找到所有字符串共有的最长子串。在给定的示例数据中，输入包括三个DNA序列，输出的最长公共子串是AC。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

# Problem

# A common substring of a collection of strings is a substring of every member of the collection. We say that a common substring is a longest common substring if there does not exist a longer common substring. For example, "CG" is a common substring of "ACGTACGT" and "AACCGTATA", but it is not as long as possible; in this case, "CGTA" is a longest common substring of "ACGTACGT" and "AACCGTATA".

# Note that the longest common substring is not necessarily unique; for a simple example, "AA" and "CC" are both longest common substrings of "AACC" and "CCAA".

# Given: A collection of k (k≤100) DNA strings of length at most 1 kbp each in FASTA format.

# Return: A longest common substring of the collection. (If multiple solutions exist, you may return any single solution.)

# Sample Dataset

# >Rosalind_1

# GATTACA

# >Rosalind_2

# TAGACCA

# >Rosalind_3

# ATACA

# Sample Output

# AC

# 题目要求我们找到一个 DNA 字符串集合中的最长公共子串，即motif。