Hadoop的Text类型实现

最新推荐文章于 2022-09-12 17:50:12 发布

yongjian_luo

最新推荐文章于 2022-09-12 17:50:12 发布

阅读量1.9k

点赞数

分类专栏： Hadoop相关

本文链接：https://blog.csdn.net/yongjian_luo/article/details/16808185

版权

Hadoop相关专栏收录该内容

70 篇文章 1 订阅

订阅专栏

Hadoop的Text类型是将字符串用UTF-8编码转换成bytes位数组。

/**

* Converts the provided String to bytes using the
* UTF-8 encoding. If <code>replace</code> is true, then
* malformed input is replaced with the
* substitution character, which is U+FFFD. Otherwise the
* method throws a MalformedInputException.
* @return ByteBuffer: bytes stores at ByteBuffer.array()
* and length is ByteBuffer.limit()
*/
public static ByteBuffer encode(String string, boolean replace)
throws CharacterCodingException {
CharsetEncoder encoder = ENCODER_FACTORY.get();
if (replace) {
encoder.onMalformedInput(CodingErrorAction.REPLACE);
encoder.onUnmappableCharacter(CodingErrorAction.REPLACE);
}
ByteBuffer bytes =
encoder.encode(CharBuffer.wrap(string.toCharArray()));
if (replace) {
encoder.onMalformedInput(CodingErrorAction.REPORT);
encoder.onUnmappableCharacter(CodingErrorAction.REPORT);
}
return bytes;
}

yongjian_luo

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
Hadoop的Text类型实现

Hadoop的Text类型是将字符串用UTF-8编码转换成bytes位数组。 /** * Converts the provided String to bytes using the * UTF-8 encoding. If replace is true, then * malformed input is replaced with the *
复制链接

扫一扫