poi 解析中文_如何使用POI解析Excel文件中的UTF-8字符

bd96500e110b49cbb3cd949968f18be7.png

I have been using POI to parse XLS and XLSX files successfully. However, I am unable to correctly extract special characters, such as UTF-8 encoded characters like Chinese or Japanese, from an Excel spreadsheet. I have figured out how to extract data from a UTF-8 encoded csv or tab delimited file, but no luck with the Excel file. Can anyone help?

(Edit: Code snippet from comments)

HSSFSheet sheet = workbook.getSheet(worksheet);

HSSFEvaluationWorkbook ewb = HSSFEvaluationWorkbook.create(workbook);

while (rowCtr <= lastRow && !rowBreakOut)

{

Row row = sheet.getRow(rowCtr);//rows.next();

for (int col=firstCell; col

Cell cell;

cell = row.getCell(col,Row.RETURN_BLANK_AS_NULL);

if (ctype == Cell.CELL_TYPE_STRING) {

sValue = cell.getStringCellValue();

log.warn("String value = "+sValue);

String encoded = URLEncoder.encode(sValue, "UTF-8");

log.warn("URL-encoded with UTF-8: " + encoded);

....

解决方案

I had the same problem while extracting Persian text from an Excel file. I was using Eclipse, and simply going to Project -> Properties and changing the "text file encoding" to UTF-8 solved the problem.

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值