java.util.regex,Java正则表达式(java.util.regex)。搜索美元符号

I have a search string.

When it contains a dollar symbol, I want to capture all characters thereafter, but not include the dot, or a subsequent dollar symbol.. The latter would constitute a subsequent match.

So for either of these search strings...:

"/bla/$V_N.$XYZ.bla";

"/bla/$V_N.$XYZ;

I would want to return:

V_N

XYZ

If the search string contains percent symbols, I also want to return what's between the pair of % symbols.

The following regex seems do the trick for that.

"%([^%]*?)%";

Inferring:

Start and end with a %,

Have a capture group - the ()

have a character class containing anything except a % symbol, (caret infers not a character)

repeated - but not greedily *?

Where some languages allow %1, %2, for capture groups, Java uses backslash\number syntax instead. So, this string compiles and generates output.

I suspect the dollar symbol and dot need escaping, as they are special symbols:

$ is usually end of string

. is a meta sequence for any character.

I have tried using double backslash symbols.. \

Both as character classes .e.g. [^\\.\\$%]

and using OR'd notation %|\\$

in attempts to combine this logic and can't seem to get anything to play ball.

I wonder if another pair of eyes can see how to solve this conundrum!

My attempts so far:

import java.util.ArrayList;

import java.util.List;

import java.util.regex.Matcher;

import java.util.regex.Pattern;

class Main {

public static void main(String[] args) {

String search = "/bla/$V_N.$XYZ.bla";

String pattern = "([%\\$])([^%\\.\\$]*?)\\1?";

/* Either % or $ in first capture group ([%\\$])

* Second capture group - anything except %, dot or dollar sign

* non greedy group ( *?)

* then a backreference to an optional first capture group \\1?

* Have to use two \, since you escape \ in a Java string.

*/

Pattern r = Pattern.compile(pattern);

Matcher m = r.matcher(search);

List results = new ArrayList();

while (m.find())

{

for (int i = 0; i<= m.groupCount(); i++) {

results.add(m.group(i));

}

}

for (String result : results) {

System.out.println(result);

}

}

}

The following links may be helpful:

解决方案

You may use

String search = "/bla/$V_N.$XYZ.bla";

String pattern = "[%$]([^%.$]*)";

Matcher matcher = Pattern.compile(pattern).matcher(search);

while (matcher.find()){

System.out.println(matcher.group(1));

} // => V_N, XYZ

NOTE

You do not need an optional \1? at the end of the pattern. As it is optional, it does not restrict match context and is redundant (as the negated character class cannot already match neither $ nor%)

[%$]([^%.$]*) matches % or $, then captures into Group 1 any zero or more

chars other than %, . and $. You only need Group 1 value, hence, matcher.group(1) is used.

In a character class, neither . nor $ are special, thus, they do not need escaping in [%.$] or [%$].

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值