Java判断中文符号 — Character.UnicodeBlock中的cjk说明

版权声明:本文为博主原创文章,遵循 CC 4.0 by-sa 版权协议,转载请附上原文出处链接和本声明。
本文链接:https://blog.csdn.net/yangdan1025/article/details/86586578

01. Java判断中文符号

private static final boolean isChinese(char c) {   
    Character.UnicodeBlock ub = Character.UnicodeBlock.of(c); 
    //获取此字符的UniCodeBlock 
    if (ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS  
            || ub == Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS  
            || ub == Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A  
            || ub == Character.UnicodeBlock.GENERAL_PUNCTUATION  
            || ub == Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION  
            || ub == Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS) {  
        return true;  
    }  
    return false;  
}  

02. Character.UnicodeBlock中的cjk说明

CJK的意思是“Chinese,Japanese,Korea”的简写 ,实际上就是指中日韩三国的象形文字的Unicode编码 。
jdk API中Character.UnicodeBlock中的cjk说明:
在这里插入图片描述

  • Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS : 4E00-9FBF:CJK 统一表意符号
  • Character.UnicodeBlock.CJK_COMPATIBILITY_IDEOGRAPHS :F900-FAFF:CJK 兼容象形文字
  • Character.UnicodeBlock.CJK_UNIFIED_IDEOGRAPHS_EXTENSION_A :3400-4DBF:CJK 统一表意符号扩展A
  • Character.UnicodeBlock.GENERAL_PUNCTUATION :2000-206F:常用标点
  • Character.UnicodeBlock.CJK_SYMBOLS_AND_PUNCTUATION :3000-303F:CJK 符号和标点
  • Character.UnicodeBlock.HALFWIDTH_AND_FULLWIDTH_FORMS :FF00-FFEF:半角及全角形式

小注:
Character.isLetter( c ):判断字符是否是字母
Character.isDigit( c ):判断字符是否是数字

展开阅读全文

没有更多推荐了,返回首页