Java获取TXT文本和Word文件的内容并显示在页面
注意2003版本的word和2007以上的word需要用不同的jar包来获取!
1.TXT文本:
import java.io.BufferedReader;
import java.io.FileReader;
StringBuffer texts =new StringBuffer();
BufferedReader br = new BufferedReader(new FileReader(file)); //
String line = null;
while ((line = br.readLine()) != null) {
texts.append(line);
}
br.close();
注意:按照上面的方式,会出现中文乱码问题!
解决方式:加上编码转换
StringBuffer texts =new StringBuffer();
InputStreamReader isr = new InputStreamReader(new FileInputStream(file), "UTF-8");//加上编码转换
BufferedReader read = new BufferedReader(isr);
String line = null;
while ((line = br.readLine()) != null) {
texts.append(line);
}
br.close();
2.Word2003——doc格式:
import java.io.FileInputStream;
import org.apache.poi.hwpf.extractor.WordExtractor;
try {
FileInputStream inputStream =