java字符转-48,java字符串应用之字符串编码转换 (转)

最新推荐文章于 2023-03-05 18:04:04 发布

weixin_39667797

最新推荐文章于 2023-03-05 18:04:04 发布

阅读量349

点赞数

文章标签： java字符转-48

一、关键技术点：

1、当前流行的字符编码格式有：US-ASCII、ISO-8859-1、UTF-8、UTF-16BE、UTF-16LE、UTF-16、GBK、GB2312等，其中GBK、GB2312是专门处理中文编码的。

2、String的getBytes方法用于按指定编码获取字符串的字节数组，参数指定了解码格式，如果没有指定解码格式，则按系统默认编码格式。

3、String的“String(bytes[] bs, String charset)”构造方法用于把字节数组按指定的格式组合成一个字符串对象

二、实例演示：

packagebook.String;

importjava.io.UnsupportedEncodingException;

/** *//** * 转换字符串的编码

*@authorjoe

*

*/

publicclassChangeCharset...{

/** *//**7位ASCII字符，也叫作ISO646-US、Unicode字符集的基本拉丁块*/publicstaticfinalString US_ASCII="US-ASCII";

/** *//**ISO拉丁字母表 No.1，也叫做ISO-LATIN-1*/publicstaticfinalString ISO_8859_1="ISO-8859-1";

/** *//**8 位 UCS 转换格式*/publicstaticfinalString UTF_8="UTF-8";

/** *//**16 位 UCS 转换格式，Big Endian(最低地址存放高位字节)字节顺序*/publicstaticfinalString UTF_16BE="UTF-16BE";

/** *//**16 位 UCS 转换格式，Litter Endian(最高地址存放地位字节)字节顺序*/publicstaticfinalString UTF_16LE="UTF-16LE";

/** *//**16 位 UCS 转换格式，字节顺序由可选的字节顺序标记来标识*/publicstaticfinalString UTF_16="UTF-16";

/** *//**中文超大字符集 **/publicstaticfinalString GBK="GBK";

publicstaticfinalString GB2312="GB2312";

/** *//**将字符编码转换成US-ASCII码*/

publicString toASCII(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, US_ASCII);

}

/** *//**将字符编码转换成ISO-8859-1*/

publicString toISO_8859_1(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, ISO_8859_1);

}

/** *//**将字符编码转换成UTF-8*/

publicString toUTF_8(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, UTF_8);

}

/** *//**将字符编码转换成UTF-16BE*/

publicString toUTF_16BE(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, UTF_16BE);

}

/** *//**将字符编码转换成UTF-16LE*/

publicString toUTF_16LE(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, UTF_16LE);

}

/** *//**将字符编码转换成UTF-16*/

publicString toUTF_16(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, UTF_16);

}

/** *//**将字符编码转换成GBK*/

publicString toGBK(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str, GBK);

}

/** *//**将字符编码转换成GB2312*/

publicString toGB2312(String str)throwsUnsupportedEncodingException...{

returnthis.changeCharset(str,GB2312);

}

/** *//** * 字符串编码转换的实现方法

*@paramstr 待转换的字符串

*@paramnewCharset 目标编码

*/

publicString changeCharset(String str, String newCharset)throwsUnsupportedEncodingException...{

if(str!=null)...{

//用默认字符编码解码字符串。与系统相关，中文windows默认为GB2312byte[] bs=str.getBytes();

returnnewString(bs, newCharset);//用新的字符编码生成字符串}returnnull;

}

/** *//** * 字符串编码转换的实现方法

*@paramstr 待转换的字符串

*@paramoldCharset 源字符集

*@paramnewCharset 目标字符集

*/

publicString changeCharset(String str, String oldCharset, String newCharset)throwsUnsupportedEncodingException...{

if(str!=null)...{

//用源字符编码解码字符串byte[] bs=str.getBytes(oldCharset);

returnnewString(bs, newCharset);

}returnnull;

}

publicstaticvoidmain(String[] args)throwsUnsupportedEncodingException...{

ChangeCharset test=newChangeCharset();

String str="This is a 中文的 String!";

System.out.println("str："+str);

String gbk=test.toGBK(str);

System.out.println("转换成GBK码："+gbk);

System.out.println();

String ascii=test.toASCII(str);

System.out.println("转换成US-ASCII："+ascii);

System.out.println();

String iso88591=test.toISO_8859_1(str);

System.out.println("转换成ISO-8859-1码："+iso88591);

System.out.println();

gbk=test.changeCharset(iso88591, ISO_8859_1, GBK);

System.out.println("再把ISO-8859-1码的字符串转换成GBK码："+gbk);

System.out.println();

String utf8=test.toUTF_8(str);

System.out.println();

System.out.println("转换成UTF-8码："+utf8);

String utf16be=test.toUTF_16BE(str);

System.out.println("转换成UTF-16BE码："+utf16be);

gbk=test.changeCharset(utf16be, UTF_16BE, GBK);

System.out.println("再把UTF-16BE编码的字符转换成GBK码："+gbk);

System.out.println();

String utf16le=test.toUTF_16LE(str);

System.out.println("转换成UTF-16LE码："+utf16le);

gbk=test.changeCharset(utf16le, UTF_16LE, GBK);

System.out.println("再把UTF-16LE编码的字符串转换成GBK码："+gbk);

System.out.println();

String utf16=test.toUTF_16(str);

System.out.println("转换成UTF-16码："+utf16);

String gb2312=test.changeCharset(utf16, UTF_16, GB2312);

System.out.println("再把UTF-16编码的字符串转换成GB2312码："+gb2312);

}

}

输出结果：

str：Thisisa 中文的 String!转换成GBK码：Thisisa 中文的 String!

转换成US-ASCII：Thisisa??????String!

转换成ISO-8859-1码：Thisisa??????String!

再把ISO-8859-1码的字符串转换成GBK码：Thisisa 中文的 String!

转换成UTF-8码：Thisisa?????String!转换成UTF-16BE码：周楳?猠愠????瑲楮朡

再把UTF-16BE编码的字符转换成GBK码：Thisisa 中文的 String!

转换成UTF-16LE码：桔獩椠?????匠牴湩Ⅷ

再把UTF-16LE编码的字符串转换成GBK码：Thisisa 中文的 String!

转换成UTF-16码：周楳?猠愠????瑲楮朡

再把UTF-16编码的字符串转换成GB2312码：?Thisisa 中文的 String!

三、源码分析：

更改字符串编码的步骤为:

1、调用String的getByte方法对字符串进行解码，得到字符串的字节数组(字节数组不携带任何有关编码格式的信息，只有字符才有编码格式)

2、根据字节数组和新的字符编码构造一个新的String对象，得到的就是按照新的字符编码生成的字符串

posted on 2012-02-16 20:48 fly 阅读(186) 评论(0) 编辑收藏所属分类: java学习

weixin_39667797

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
java字符转-48,java字符串应用之字符串编码转换 (转)

一、关键技术点：1、当前流行的字符编码格式有：US-ASCII、ISO-8859-1、UTF-8、UTF-16BE、UTF-16LE、UTF-16、GBK、GB2312等，其中GBK、GB2312是专门处理中文编码的。2、String的getBytes方法用于按指定编码获取字符串的字节数组，参数指定了解码格式，如果没有指定解码格式，则按系统默认编码格式。3、String的“String(bytes...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。