今天来分享Rosalind第二道题:转录
Transcribing DNA into RNA
Problem
An RNA string is a string formed from the alphabet containing ‘A’, ‘C’, ‘G’, and ‘U’.
Given a DNA string t corresponding to a coding strand, its transcribed RNA string u is formed by replacing all occurrences of ‘T’ in t with ‘U’ in u.
Given: A DNA string t having length at most 1000 nt.
Sample input:
CTGTCCTGTCGATATTATCATCGTATCCGCAGAAAGCGTCTAATGTAATGTCCCGGAACGAGTTTCTGGAACCCATCTTCGCGTCCGCTTAAGAGCAGTGTTTGCACGAAGTCGTATAAGGTTTTTCATTGTGCTATCGGAGTCACATTTCCGTTGGCTAGTTTGCCTAAATATCATTAAGTAACTGTTATATTACAACGGGGAGTTCTCCCAGCTCTGTTTCCCGCCTACGATGGGAACCAGTACAGCGATTACGATCCTCTAGGGCCTGGGGATGACTCCGCCAACGCATCGTATGGGACCCACAGCGGTAGCGGATCCCTGTGCTCCCAGAAGGGATTCCGATGAGGGGGCAGCGGCAGTTATATGGAACCTGTCACATCGTTATCGGCTCATTATCCGCGAGGGGATGCAGCATACAATTCTGCCGAAATTCTATTCTCCCATCCCTCTCCTACCGGGTACCCAAGACCCGTCCTCCGAGTACCCCCAAAGTCGCCCAGTCGTTCATAAGTCGGACTGCCACTCACTTACGGGTAATGATAGGGTAATCCATAGGGTGGTCGTGCTGAAACTGGGTCTACCGATACTGAAAGAAGCATTAACTAGATTCAGGTTGCTGAAAGCCTGTAGCAGTCGTAGACCTAATGCAAGGATGCTCCTCTCTGGTGTTTCGTGGCCTCGTACTGTCACAGGACAACATAGCTAATTGATCGGCAACGTTCCGACATGGCTTAGCGCGAGCTGTAATGACAGGAGCCCGCGAATAGGCCAAGGGTCTCGTTAGAAATCCAAATTATACAGGCTTATGATTACCACCCACGAGACACTGTTAACGAGCGTTGGAGTAGATGACCAAATCTTGTAGGCTAGCCCACGGCCGGATGTCGGACAGTTAAGGCATACCGGAATTTCTTGGATGACTACCATGACGGG
Return: The transcribed RNA string of t.
Sample output:
CUGUCCUGUCGAUAUUAUCAUCGUAUCCGCAGAAAGCGUCUAAUGUAAUGUCCCGGAACGAGUUUCUGGAACCCAUCUUCGCGUCCGCUUAAGAGCAGUGUUUGCACGAAGUCGUAUAAGGUUUUUCAUUGUGCUAUCGGAGUCACAUUUCCGUUGGCUAGUUUGCCUAAAUAUCAUUAAGUAACUGUUAUAUUACAACGGGGAGUUCUCCCAGCUCUGUUUCCCGCCUACGAUGGGAACCAGUACAGCGAUUACGAUCCUCUAGGGCCUGGGGAUGACUCCGCCAACGCAUCGUAUGGGACCCACAGCGGUAGCGGAUCCCUGUGCUCCCAGAAGGGAUUCCGAUGAGGGGGCAGCGGCAGUUAUAUGGAACCUGUCACAUCGUUAUCGGCUCAUUAUCCGCGAGGGGAUGCAGCAUACAAUUCUGCCGAAAUUCUAUUCUCCCAUCCCUCUCCUACCGGGUACCCAAGACCCGUCCUCCGAGUACCCCCAAAGUCGCCCAGUCGUUCAUAAGUCGGACUGCCACUCACUUACGGGUAAUGAUAGGGUAAUCCAUAGGGUGGUCGUGCUGAAACUGGGUCUACCGAUACUGAAAGAAGCAUUAACUAGAUUCAGGUUGCUGAAAGCCUGUAGCAGUCGUAGACCUAAUGCAAGGAUGCUCCUCUCUGGUGUUUCGUGGCCUCGUACUGUCACAGGACAACAUAGCUAAUUGAUCGGCAACGUUCCGACAUGGCUUAGCGCGAGCUGUAAUGACAGGAGCCCGCGAAUAGGCCAAGGGUCUCGUUAGAAAUCCAAAUUAUACAGGCUUAUGAUUACCACCCACGAGACACUGUUAACGAGCGUUGGAGUAGAUGACCAAAUCUUGUAGGCUAGCCCACGGCCGGAUGUCGGACAGUUAAGGCAUACCGGAAUUUCUUGGAUGACUACCAUGACGGG
简单来说就是通过输入核酸序列,将T换成U即可。
我们可以采用replace() 方法。
replace() 方法通过用 newChar 字符替换字符串中出现的所有 oldChar 字符,并返回替换后的新字符串。
public String replace(char oldChar, char newChar)
public class Transcribing_DNA_into_RNA {
public static void main(String[] args) {
Scanner sc = new Scanner(System.in);
String s = sc.nextLine();
System.out.println(s.replace('T','U'));
}
}
可见使用replace()方法可以一步完成整条核苷酸序列的字符替换,非常方便且快速。
但是会有明眼的小伙伴提出问题:在此案例下可以完成如上碱基替换,因为输入的碱基序列全部为大写。但是如果输入的碱基序列是小写或者是大小写混用的情况,上述的方法其实并不完善,因此在下方完善了代码。
public class Transcribing_DNA_into_RNA {
public static void main(String[] args) {
Scanner sc = new Scanner(System.in);
String s = sc.nextLine();
for (int i = 0; i < s.length(); i++) {//遍历核酸序列中的每一个碱基
if (s.charAt(i) == 'T') {//若该碱基是大写T
s=s.replace(s.charAt(i), 'U');
} else if (s.charAt(i) == 't') {//若该碱基是小写t
s=s.replace(s.charAt(i), 'u');
}
}
System.out.println(s);
}
}
这一次的solution通过拆分字符串类型的输入数据,并且逐字符判断大小写的T/t。在使用replace进行碱基替换时,分别检验大小写的碱基T/t,从而保证哪怕输入碱基序列是大小写混合的方式也可以正常输出替换后的碱基。