[leetcode]187. Repeated DNA Sequences重复DNA序列

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to identify repeated sequences within the DNA.

Write a function to find all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule.

Example:

Input: s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT"

Output: ["AAAAACCCCC", "CCCCCAAAAA"]

 

题意:

DNA序列里有ATCG四种,求所有长度为10,出现次数超过一次的序列。

 

Solution1: HashMap

1.  scan the given string,  put each 10-letter-long substring and its corresponding frequency into a map

2. looping each entrySet in the map, find if entry.getValue() > 1 

 

 

 

code

 1 /*
 2 Time Complexity: O(n)
 3 Space Complexity: O(n)
 4 */
 5 class Solution {
 6     public List<String> findRepeatedDnaSequences(String s) {
 7         List<String> result = new ArrayList<>();
 8         // corner case
 9         if (s.length() < 10) return result;
10         
11         Map<String, Integer> map = new HashMap<>();
12         for (int i = 0; i < s.length() - 9; ++i) {
13             String key = s.substring(i, i + 10);
14             if(map.containsKey(key)){
15                 map.put(key, map.get(key) + 1);
16             }else{
17                 map.put(key, 1);
18             } 
19         }
20 
21         for (Map.Entry<String, Integer> entry : map.entrySet()) {
22             if (entry.getValue() > 1) {
23                 result.add(entry.getKey());
24             }
25         }
26         return result;
27     }
28 }

 

转载于:https://www.cnblogs.com/liuliu5151/p/10807445.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值