java获取inputstream,Java:如何从inputStream获取编码?

I want get the encoding from a stream.

1st method - to use the InputStreamReader.

But it always return OS encode.

InputStreamReader reader = new InputStreamReader(new FileInputStream("aa.rar"));

System.out.println(reader.getEncoding());

output:GBK

2nd method - to use the UniversalDetector.

But it always return null.

FileInputStream input = new FileInputStream("aa.rar");

UniversalDetector detector = new UniversalDetector(null);

byte[] buf = new byte[4096];

int nread;

while ((nread = input.read(buf)) > 0 && !detector.isDone()) {

detector.handleData(buf, 0, nread);

}

// (3)

detector.dataEnd();

// (4)

String encoding = detector.getDetectedCharset();

if (encoding != null) {

System.out.println("Detected encoding = " + encoding);

} else {

System.out.println("No encoding detected.");

}

// (5)

detector.reset();

output:null

How can I get the right? :(

解决方案

Let's resume the situation:

InputStream delivers bytes

*Readers deliver chars in some encoding

new InputStreamReader(inputStream) uses the operating system encoding

new InputStreamReader(inputStream, "UTF-8") uses the given encoding (here UTF-8)

So one needs to know the encoding before reading. You did everything right using first a charset detecting class.

Reading http://code.google.com/p/juniversalchardet/ it should handle UTF-8 and UTF-16.

You might use the editor JEdit to verify the encoding, and see whether there is some problem.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值