项目开发过程中有很多场景使用到汉语转化成拼音、获取ASCII码等的需求,以避免字符集编码不一致造成中文乱码问题。下面开始分享:
1、引入jar包:
<dependency>
<groupId>net.sourceforge.jexcelapi</groupId>
<artifactId>jxl</artifactId>
<version>2.6.12</version>
</dependency>
<dependency>
<groupId>com.belerweb</groupId>
<artifactId>pinyin4j</artifactId>
<version>2.5.0</version>
</dependency>
2、工具类伪代码:
获取汉语拼音:
/**
* 获取中文的拼音
* @param src
* @return
*/
public static String getPingYin(String src) {
char[] t1 = null;
t1 = src.toCharArray();
String[] t2 = new String[t1.length];
HanyuPinyinOutputFormat t3 = new HanyuPinyinOutputFormat();
t3.setCaseType(HanyuPinyinCaseType.LOWERCASE);
t3.setToneType(HanyuPinyinToneType.WITHOUT_TONE);
t3.setVCharType(HanyuPinyinVCharType.WITH_V);
String t4 = "";
int t0 = t1.length;
try {
for (int i = 0; i < t0; i++) {
if (Character.toString(t1[i]).matches(
"[\\u4E00-\\u9FA5]+")) {
t2 = PinyinHelper.toHanyuPinyinStringArray(t1[i], t3);
t4 += t2[0];
} else
t4 += Character.toString(t1[i]);
}
return t4;
} catch (BadHanyuPinyinOutputFormatCombination e1) {
e1.printStackTrace();
}
return t4;
}
获取汉语拼音的首字母:
/**
* 获取汉语拼音的首字母
* @param str
* @return
*/
public static String getPinYinHead(String str) {
String convert = "";
for (int j = 0; j < str.length(); j++) {
char word = str.charAt(j);
String[] pinyinArray = PinyinHelper.toHanyuPinyinStringArray(word);
if (pinyinArray != null) {
convert += pinyinArray[0].charAt(0);
} else {
convert += word;
}
}
return convert;
}
获取中文的ASCII码:
/**
* 获取中文的ASCII码
* @param cnStr
* @return
*/
public static String getCnStrASCII(String cnStr) {
StringBuffer strBuf = new StringBuffer();
byte[] bGBK = cnStr.getBytes();
for (int i = 0; i < bGBK.length; i++) {
strBuf.append(Integer.toHexString(bGBK[i] & 0xff));
}
return strBuf.toString();
}
3、客户端测试代码:
String pingYin = ChineseSpellingUtil.getPingYin("南道");
String pingYin1 = ChineseSpellingUtil.getCnStrASCII("南道");
String pingYin2 = ChineseSpellingUtil.getPinYinHead("南道");
log.info("[{}]:[{}]:[{}]",pingYin,pingYin1,pingYin2);
4、测试结果:
[nandao]:[e58d97e98193]:[nd]
到此,汉语转化成拼音的使用分享完毕。