java解析十六进制编码字符串

最新推荐文章于 2024-08-25 04:07:55 发布

始于千里之外

最新推荐文章于 2024-08-25 04:07:55 发布

阅读量1.6k

点赞数

文章标签：十六进制字符串 hbase shell中文乱码

本文链接：https://blog.csdn.net/u013816347/article/details/130204443

版权

在使用hbase shell等命令时，若输出的内容包含中文，经常会出现乱码等现象，我们可以在命令后面加上 {formatter => 'tostring'}来处理例如：

scan 'test', {formatter => 'tostring'}那么，在java中如何来解析这些字符串，使之能正常显示中文呢？可以参考下面的代码：

带解析字符串为：

{
	"first_name":"\xE4\xB8\x89\xE5\x8F\xB6\xE4\xB8\x9C\xE8\xB7\xAF",
	"second_name":"\xE4\xB8\x89\xE5\x8F\xB6\xE4\xB8\x9C\xE8\xB7\xAF"
}


import java.io.UnsupportedEncodingException;
import java.nio.charset.StandardCharsets;
import java.util.regex.Matcher;
import java.util.regex.Pattern;

public class Demo {
    private static final Pattern UNICODE_PATTERN = Pattern.compile("\\\\x([0-9a-fA-F]{2})");

    public static String decodeJson(String json) {
        Matcher matcher = UNICODE_PATTERN.matcher(json);
        StringBuffer sb = new StringBuffer();
        while (matcher.find()) {
            char c = (char) Integer.parseInt(matcher.group(1), 16);
            matcher.appendReplacement(sb, String.valueOf(c));
        }
        matcher.appendTail(sb);
        return new String(sb.toString().getBytes(StandardCharsets.ISO_8859_1), StandardCharsets.UTF_8);
    }

    public static void main(String[] args) throws UnsupportedEncodingException {
        String json = "{\"first_name\":\"\\xE4\\xB8\\x89\\xE5\\x8F\\xB6\\xE4\\xB8\\x9C\\xE8\\xB7\\xAF\",\"second_name\":\"\\xE4\\xB8\\x89\\xE5\\x8F\\xB6\\xE4\\xB8\\x9C\\xE8\\xB7\\xAF\"}";
        String decodedStr = decodeJson(json);
        System.out.println(decodedStr);  // 输出解码后的字符串
    }
}

输出结果为：

{
  "first_name": "三叶东路",
  "second_name": "三叶东路"
}

始于千里之外

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫