string 长度_JDK1.8 源码分析(一)--String

1. 类的定义

public final class String implements, Comparable, CharSequence


  • String 是一个final修饰的类,不能被继承
  • String 实现了 接口,可以被序列化
  • String实现了 Comparable 可以用于比较大小(按顺序比较单个字符的ASCII码)
  • String 实现了CharSequence 接口,表示String是一个可读的char序列,因为String就是一个char数组。

2. 字段属性

/** 用来存储字符串. */
    private final char value[];

    /** 缓存字符串的hash码 */
    private int hash; // Default to 0

    /** 序列化标识 */
    private static final long serialVersionUID = -6849794470754667710L;

从String字段属性中可以看出, String的本质就是一个final修饰的不可变的char数组

3. 构造方法



  • 空构造函数,默认为“”空字符串
  • 可以传入一个String字符串,类似于深拷贝
  • 可以传入char数组,还可以指定偏移位置和拷贝的长度
  • 可以传入int数组,还可以指定偏移位置和拷贝的长度
  • 可以传入byte ascii数组,还可以指定偏移位置和拷贝的长度
  • 可以传入byte数组,还可以指定偏移位置和拷贝的长度,并且指定编码格式,默认为utf-8
  • 可以传入一个StringBuilder对象
  • 可以传入一个StringBuffer对象

4. 方法

hashCode 方法

public int hashCode() {

        int h = hash;

        if (h == 0 && value.length > 0) {

            char val[] = value;

            for (int i = 0; i < value.length; i++) {

                h = 31 * h + val[i];


            hash = h;


        return h;


String 的hash算法的核心就是中间for循环的h = 31 * h + val[i],通俗表达就是s[0]31^(n-1) + s[1]31^(n-2) + ... + s[n-1],为什么这里会选择31作为乘积因子?

  • 31是一个不大不小的质数,是作为hashcode乘积的优选质数之一
  • 31可以被JVM优化,31*i=(i << 5) - i。因为位移运算比乘法运算性能更好

equals 方法

public boolean equals(Object anObject) {

        if (this == anObject) {

            return true;


        if (anObject instanceof String) {

            String anotherString = (String)anObject;

            int n = value.length;

            if (n == anotherString.value.length) {

                char v1[] = value;

                char v2[] = anotherString.value;

                int i = 0;

                while (n-- != 0) {

                    if (v1[i] != v2[i])

                        return false;



                return true;



        return false;

  • 如果两个引用都相同,返回ture
  • 如果两个字符串的长度和每个字符都相同,返回ture
  • 其它的返回false

length 方法

public int length() {

        return value.length;

  • length方法就是返回char数组的长度

isEmpty 方法

public boolean isEmpty() {

        return value.length == 0;

  • 判断字符串为空本质就是看char的长度是否等于0

charAt 方法

public char charAt(int index) {

        if ((index < 0) || (index >= value.length)) {

            throw new StringIndexOutOfBoundsException(index);


        return value[index];

  • 获取给定位置的字符

compareTo 方法

public int compareTo(String anotherString) {

        int len1 = value.length;

        int len2 = anotherString.value.length;

        int lim = Math.min(len1, len2);

        char v1[] = value;

        char v2[] = anotherString.value;

        int k = 0;

        while (k < lim) {

            char c1 = v1[k];

            char c2 = v2[k];

            if (c1 != c2) {

                return c1 - c2;




        return len1 - len2;

  • 比较两个字符串,从短的字符串长度进行循环比较
  • 如果循环中两个字符不相等,则返回两个字符的Unicode值之差
  • 如果循环中都相等,则返回两个字符串长度之差
  • compareToIgnoreCase 则是先转换为大写再进行比较,比较过程同上面一样

startsWith 方法

public boolean startsWith(String prefix, int toffset) {

        char ta[] = value;

        int to = toffset;

        char pa[] = prefix.value;

        int po = 0;

        int pc = prefix.value.length;

        // Note: toffset might be near -1>>>1.

        if ((toffset < 0) || (toffset > value.length - pc)) {

            return false;


        while (--pc >= 0) {

            if (ta[to++] != pa[po++]) {

                return false;



        return true;

  • 如果偏移量小于0返回false
  • 如果偏移量加上prefix字符串的长度超过字符串char数组的长度,返回false
  • 循环比较中,如果出现字符不相等,返回false
  • 其它返回ture
  • startsWith(String prefix) 的本质是 startsWith(prefix, 0)
  • endsWith(String suffix) 的本质是 startsWith(suffix, value.length - suffix.value.length)

indexOf 方法

public int indexOf(int ch, int fromIndex) {

        final int max = value.length;
        //如果fromIndex小于0, 就从0开始

        if (fromIndex < 0) {

            fromIndex = 0;

        } else if (fromIndex >= max) {

            // Note: fromIndex might be near -1>>>1.

            return -1;



        if (ch < Character.MIN_SUPPLEMENTARY_CODE_POINT) {

            // handle most cases here (ch is a BMP code point or a

            // negative value (invalid code point))

            final char[] value = this.value;

            for (int i = fromIndex; i < max; i++) {

                if (value[i] == ch) {

                    return i;



            return -1;

        } else {

            return indexOfSupplementary(ch, fromIndex);


private int indexOfSupplementary(int ch, int fromIndex) {

        if (Character.isValidCodePoint(ch)) {

            final char[] value = this.value;

            final char hi = Character.highSurrogate(ch);

            final char lo = Character.lowSurrogate(ch);

            final int max = value.length - 1;

            for (int i = fromIndex; i < max; i++) {

                if (value[i] == hi && value[i + 1] == lo) {

                    return i;




        return -1;

  • indexOf(int ch) 的本质是 indexOf(ch, 0)

substring 方法

public String substring(int beginIndex) {
        if (beginIndex < 0) {

            throw new StringIndexOutOfBoundsException(beginIndex);


        int subLen = value.length - beginIndex;

        if (subLen < 0) {

            throw new StringIndexOutOfBoundsException(subLen);


        return (beginIndex == 0) ? this : new String(value, beginIndex, subLen);

public String substring(int beginIndex, int endIndex) {

        if (beginIndex < 0) {

            throw new StringIndexOutOfBoundsException(beginIndex);


        if (endIndex > value.length) {

            throw new StringIndexOutOfBoundsException(endIndex);


        int subLen = endIndex - beginIndex;

        if (subLen < 0) {

            throw new StringIndexOutOfBoundsException(subLen);


        return ((beginIndex == 0) && (endIndex == value.length)) ? this

                : new String(value, beginIndex, subLen);


concat 方法

public String concat(String str) {

        int otherLen = str.length();

        if (otherLen == 0) {

            return this;


        int len = value.length;

        char buf[] = Arrays.copyOf(value, len + otherLen);

        str.getChars(buf, len);

        return new String(buf, true);


replace 方法

public String replace(char oldChar, char newChar) {

        if (oldChar != newChar) {

            int len = value.length;

            int i = -1;

            char[] val = value; /* avoid getfield opcode */


            while (++i < len) {

                if (val[i] == oldChar) {




            if (i < len) {

                char buf[] = new char[len];

                for (int j = 0; j < i; j++) {

                    buf[j] = val[j];


                while (i < len) {

                    char c = val[i];

                    buf[i] = (c == oldChar) ? newChar : c;



                return new String(buf, true);



        return this;


matches 方法

public boolean matches(String regex) {
        return Pattern.matches(regex, this);


contains 方法

public boolean contains(CharSequence s) {

        return indexOf(s.toString()) > -1;


replaceAll 方法

public String replaceAll(String regex, String replacement) {
        return Pattern.compile(regex).matcher(this).replaceAll(replacement);


split 方法

public String[] split(String regex, int limit) {

        /* fastpath if the regex is a

         (1)one-char String and this character is not one of the

            RegEx's meta characters ".$|()[{^?*+", or

         (2)two-char String and the first char is the backslash and

            the second is not the ascii digit or ascii letter.


        char ch = 0;

        if (((regex.value.length == 1 &&

             ".$|()[{^?*+".indexOf(ch = regex.charAt(0)) == -1) ||

             (regex.length() == 2 &&

              regex.charAt(0) == '' &&

              (((ch = regex.charAt(1))-'0')|('9'-ch)) < 0 &&

              ((ch-'a')|('z'-ch)) < 0 &&

              ((ch-'A')|('Z'-ch)) < 0)) &&

            (ch < Character.MIN_HIGH_SURROGATE ||

             ch > Character.MAX_LOW_SURROGATE))


            int off = 0;

            int next = 0;

            boolean limited = limit > 0;

            ArrayList list = new ArrayList<>();

            while ((next = indexOf(ch, off)) != -1) {

                if (!limited || list.size() < limit - 1) {

                    list.add(substring(off, next));

                    off = next + 1;

                } else {    // last one

                    //assert (list.size() == limit - 1);

                    list.add(substring(off, value.length));

                    off = value.length;





            if (off == 0)

                return new String[]{this};

            // 如果limit<=0时,list的size小于limit时,截取添加剩余的字符串

            if (!limited || list.size() < limit)

                list.add(substring(off, value.length));

            // Construct result

            int resultSize = list.size();

            if (limit == 0) {

                while (resultSize > 0 && list.get(resultSize - 1).length() == 0) {




            String[] result = new String[resultSize];

            return list.subList(0, resultSize).toArray(result);


        return Pattern.compile(regex).split(this, limit);

  • split(String regex) 的本质是split(regex, 0)

trim 方法

public String trim() {

        int len = value.length;

        int st = 0;

        char[] val = value;    /* avoid getfield opcode */

        //如果起始位置小于长度并且当前字符小于" ",起始位置后移一位

        while ((st < len) && (val[st] <= ' ')) {


        //如果起始位置小于长度并且结束位置的字符小于" ",结束位置前移一位

        while ((st < len) && (val[len - 1] <= ' ')) {



        return ((st > 0) || (len < value.length)) ? substring(st, len) : this;





