offsetByCodePoints()与codePointAt()

最新推荐文章于 2021-06-12 09:33:58 发布

dajiangqingzhou

最新推荐文章于 2021-06-12 09:33:58 发布

阅读量3.6k

点赞数 7

分类专栏： Java 文章标签： Java api lang

本文链接：https://blog.csdn.net/dajiangqingzhou/article/details/77605965

版权

Java 专栏收录该内容

32 篇文章 1 订阅

订阅专栏

在Java api文档中对这两个方法的解释为

offsetByCodePoints

public int offsetByCodePoints(int index,
                              int codePointOffset)
返回此 String 中从给定的 index 处偏移 codePointOffset 个代码点的索引。文本范围内由 index 和 codePointOffset 给定的未配对代理项各计为一个代码点。
参数：
index - 要偏移的索引
codePointOffset - 代码点中的偏移量
返回：
String 的索引
抛出：
IndexOutOfBoundsException - 如果 index 为负或大于此 String 的长度；或者 codePointOffset 为正，且以 index 开头子字符串的代码点比 codePointOffset 少；如果 codePointOffset 为负，且 index 前面子字符串的代码点比 codePointOffset 的绝对值少。

这里的index就是你指定的任意第 i 个码点，假如你想知道距离第 i 个码点 x 个码点（x可正可负）是相对于第0个码点第几个码点，则可以用offsetByCodePoints( i , x )得到的你想要的值。比方说

String astring = "abcdABCD";
int cpcount = astring.offsetByCodePoints(7,-4)

cpcount的值应该是3（即表示d距离a 3个码点，也就是距离第0个位置3个位置）；

codePointAt

public int codePointAt(int index)
返回指定索引处的字符（Unicode 代码点）。索引引用 char 值（Unicode 代码单元），其范围从 0 到 length() - 1。
如果给定索引指定的 char 值属于高代理项范围，则后续索引小于此 String 的长度；如果后续索引处的 char 值属于低代理项范围，则返回该代理项对相应的增补代码点。否则，返回给定索引处的 char 值。

参数：
index - char 值的索引
返回：
index 处字符的代码点值
抛出：
IndexOutOfBoundsException - 如果 index 参数为负或小于此字符串的长度。

还是以上面那个例子，d距离a，有3个位置，这个3通过offsetByCodePoints()得到，并且就可以看成是一个索引值，通过他你就能找到对应位置上是d

String astring = "abcdABCD";
int cpcount = astring.offsetByCodePoints(7,-4)
int dddunicode = atest.codePointAt(cpcount);

dddunicode = 100；这个其实是 ASCII码值

为了更好地理解这两个方法，下面再举例：

public class Program2 
{
	public static void main(String[] args)
	{
		String atest = "abcABCa";
		if(atest.equals("abcABCa"))
		{
			int cp1 = atest.offsetByCodePoints(0,0);
			int cp4 = atest.offsetByCodePoints(0, 3);
			int cp7 = atest.offsetByCodePoints(0, 6);
			System.out.println("cp1 is "+cp1+"  cp2 is "+ cp4+"  cp7 is "+cp7);
			int cp1unicode = atest.codePointAt(cp1);
			int cp4unicode = atest.codePointAt(cp4);						int cp7unicode = atest.codePointAt(cp7);
 System.out.println(cp1unicode+" "+cp4unicode+" "+cp7unicode);}}}



结果是：

cp1 is 0  cp2 is 3  cp7 is 6
97 65 97

dajiangqingzhou

关注

7
点赞
踩
14

收藏

觉得还不错? 一键收藏
3
评论
offsetByCodePoints()与codePointAt()

在Java api文档中对这两个方法的解释为offsetByCodePointspublic int offsetByCodePoints(int index, int codePointOffset)返回此 String 中从给定的 index 处偏移 codePointOffset 个代码点的索引。文本范围内由 index
复制链接

扫一扫

专栏目录