今天使用split()方法时,发现对于空字符串有丢弃的现象,充满好奇心的我就测试了一下
String test = ",1,,2,3,4,,,,";
String[] tests = test.split(",");
for (int i = 0; i < tests.length; i++) {
System.out.print(tests[i]);
if (i!=tests.length-1){
System.out.print(",");
}
}
输出结果
,1,,2,3,4
可以发现开头,中间的空字符串不会被丢弃,结尾的1~n个空字符串会被丢弃
ctrl+b跳到方法中看看
public String[] split(String regex) {
return split(regex, 0);
}
返回的是两个参数的split()方法的结果,继续跳
public String[] split(String regex, int limit) {
/* fastpath if the regex is a
(1)one-char String and this character is not one of the
RegEx's meta characters ".$|()[{^?*+\\", or
(2)two-char String and the first char is the backslash and
the second is not the ascii digit or ascii letter.
*/
char ch = 0;
if (((regex.value.length == 1 &&
".$|()[{^?*+\\".indexOf(ch = regex.charAt(0)) == -1) ||
(regex.length() == 2 &&
regex.charAt(0) == '\\' &&
(((ch = regex.charAt(1))-'0')|('9'-ch)) < 0 &&
((ch-'a')|('z'-ch)) < 0 &&
((ch-'A')|('Z'-ch)) < 0)) &&
(ch < Character.MIN_HIGH_SURROGATE ||
ch > Character.MAX_LOW_SURROGATE))
{
int off = 0;
int next = 0;
boolean limited = limit > 0;
ArrayList<String> list = new ArrayList<>();
while ((next = indexOf(ch, off)) != -1) {
if (!limited || list.size() < limit - 1) {
list.add(substring(off, next));
off = next + 1;
} else { // last one
//assert (list.size() == limit - 1);
list.add(substring(off, value.length));
off = value.length;
break;
}
}
// If no match was found, return this
if (off == 0)
return new String[]{this};
// Add remaining segment
if (!limited || list.size() < limit)
list.add(substring(off, value.length));
// Construct result
int resultSize = list.size();
if (limit == 0) {
while (resultSize > 0 && list.get(resultSize - 1).length() == 0) {
resultSize--;
}
}
String[] result = new String[resultSize];
return list.subList(0, resultSize).toArray(result);
}
return Pattern.compile(regex).split(this, limit);
}
一大段的看不懂,但找到原因所在了
//list长度是分隔符分割后的真正长度
int resultSize = list.size();
if (limit == 0) {
//如果list最后一个元素长度为0,resultSize就减一,所以对字符串前和中间的无影响
while (resultSize > 0 && list.get(resultSize - 1).length() == 0) {
resultSize--;
}
}
//返回的Array长度与减后的resultSize相同
String[] result = new String[resultSize];
return list.subList(0, resultSize).toArray(result);
经尝试,split(regex,limit)中limit参数只要不传0,就不会把最后的空字符串丢弃(等以后看懂了再更新吧)