eetcode题解(10): Regular Expression Matching——DP解决正则匹配

18 篇文章 0 订阅

两个参考:

http://xiaohuiliucuriosity.blogspot.com/2014/12/regular-expression-matching.html

https://www.youtube.com/watch?v=l3hda49XcDE&list=PLrmLmBdmIlpuE5GEMDXWf0PWbBD9Ga1lO

问题

Implement regular expression matching with support for ‘.’ and ‘*’.

‘.’ Matches any single character. 
‘*’ Matches zero or more of the preceding element.

The matching should cover the entire input string (not partial).

The function prototype should be: 
bool isMatch(const char *s, const char *p)

Some examples: 
isMatch(“aa”,”a”) → false 
isMatch(“aa”,”aa”) → true 
isMatch(“aaa”,”aa”) → false 
isMatch(“aa”, “a*”) → true 
isMatch(“aa”, “.*”) → true 
isMatch(“ab”, “.*”) → true 
isMatch(“aab”, “c*a*b”) → true

思路

这里面最复杂的操作是”*”,这是个很可恶的操作,因为你永远不知道它多长。但是有一点,”*”不会单独出现,它一定是和前面一个字母或”.”配成一对。看成一对后”X*”,它的性质就是:要不匹配0个,要不匹配连续的“X”

题目的关键就是如何把这一对放到适合的位置。

考虑一个特殊的问题: 
情况1: 
“aaaaaaaaaaaaaaaa” 
“a*aa*”

情况2: 
“aaaaaaaaaaaaaaaa” 
“a*ab*”

在不知道后面的情况的时候,我如何匹配a*?

参考一个这个问题  ,思路会更明显 

 

  • 最长匹配 
    显然不合适,这样后面的a就无法匹配上了

  • 匹配到和后面长度一样的位置,比如情况1,就是留3个a不匹配,让后面3个字母尝试去匹配。 
    这样看似合适,但是遇到情况2就不行了。

  • 回溯,每种”*”的情况,看哪种情况能成功,如果其中出现了问题,马上回溯,换下一种情况
is problem has a typical solution using Dynamic Programming. 
We define the state P[i][j] to be true if s[0..i) matches p[0..j) 
and false otherwise. Then the state equations are:

P[i][j] = P[i - 1][j - 1], if p[j - 1] != 
'*' && (s[i - 1] == p[j - 1] || p[j - 1] == '.');

P[i][j] = P[i][j - 2], if p[j - 1] == '*' and the pattern repeats for 0 times;

P[i][j] = P[i - 1][j] && (s[i - 1] == p[j - 2] || p[j - 2] == '.'),
 if p[j - 1] == '*' and the pattern repeats for at least 1 times.

Putting these together, we will have the following code.

 

class Solution {
public:
    bool isMatch(string s, string p) {
        /**
         * f[i][j]: if s[0..i-1] matches p[0..j-1]
         * if p[j - 1] != '*'
         *      f[i][j] = f[i - 1][j - 1] && s[i - 1] == p[j - 1]
         * if p[j - 1] == '*', denote p[j - 2] with x
         *      f[i][j] is true iff any of the following is true
         *      1) "x*" repeats 0 time and matches empty: f[i][j - 2]
         *      2) "x*" repeats >= 1 times and matches "x*x": s[i - 1] == x && f[i - 1][j]
         * '.' matches any single character
         */
        int m = s.size(), n = p.size();
        vector<vector<bool>> f(m + 1, vector<bool>(n + 1, false));
        
        f[0][0] = true;
        for (int i = 1; i <= m; i++)
            f[i][0] = false;
        // p[0.., j - 3, j - 2, j - 1] matches empty iff p[j - 1] is '*' and p[0..j - 3] matches empty
        for (int j = 1; j <= n; j++)
            f[0][j] = j > 1 && '*' == p[j - 1] && f[0][j - 2];
        
        for (int i = 1; i <= m; i++)
            for (int j = 1; j <= n; j++)
                if (p[j - 1] != '*')
                    f[i][j] = f[i - 1][j - 1] && (s[i - 1] == p[j - 1] || '.' == p[j - 1]);
                else
                    // p[0] cannot be '*' so no need to check "j > 1" here
                    f[i][j] = f[i][j - 2] || (s[i - 1] == p[j - 2] || '.' == p[j - 2]) && f[i - 1][j];
        
        return f[m][n];
    }
};

 

  •  
  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值