一、问题描述
有个需求就是读取word文档里的内容,使用到了poi这个包,代码如下:
/**
* 读取doc文件内容
*
* @param fs 想要读取的文件对象
* @return 返回文件内容
* @throws IOException
*/
public static String doc2String(BufferedInputStream fs) throws IOException {
String text = "";
if (FileMagic.valueOf(fs) == FileMagic.OLE2) {
WordExtractor ex = new WordExtractor(fs);
text = ex.getText();
ex.close();
} else if (FileMagic.valueOf(fs) == FileMagic.OOXML) {
XWPFDocument doc = new XWPFDocument(fs);
XWPFWordExtractor extractor = new XWPFWordExtractor(doc);
text = extractor.getText();
extractor.close();
}
return text;
}
public static String doc2String(File file) throws IOException {
return doc2String(new BufferedInputStream(new FileInputStream(file)));
}
public static void main(String[] args) {
File file = new File("D:\\xxx\\xxx\\1\\file\\2021\\04\\28\\34a58ac4faa4222712a4329ac60f34f9\\34a58ac4faa4222712a4329ac60f34f9.docx");
try {
System.out.println(doc2String(file));
} catch (IOException e) {
e.printStackTrace();
}
}
运行报错:
Exception in thread "main" java.lang.NoSuchMethodError: org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTRImpl.getXmlObjectArray(Ljavax/xml/namespace/QName;[Lorg/apache/xmlbeans/XmlObject;)[Lorg/apache/xmlbeans/XmlObject;
at org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTRImpl.getDrawingArray(CTRImpl.java:3979)
at org.apache.poi.xwpf.usermodel.XWPFRun.<init>(XWPFRun.java:96)
at org.apache.poi.xwpf.usermodel.XWPFRun.<init>(XWPFRun.java:146)
at org.apache.poi.xwpf.usermodel.XWPFParagraph.buildRunsInOrderFromXml(XWPFParagraph.java:118)
at org.apache.poi.xwpf.usermodel.XWPFParagraph.<init>(XWPFParagraph.java:67)
at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:178)
at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:169)
at org.apache.poi.xwpf.usermodel.XWPFDocument.<init>(XWPFDocument.java:126)
二、解决方法
找不到org.openxmlformats.schemas.wordprocessingml.x2006.main.impl.CTRImpl.getXmlObjectArray类,出现这种原因,肯定是少引入了相关的jar或者版本错误导致的,像我这里就是错误的引入了包导致的:
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-ooxml-schemas</artifactId>
<version>4.1.2</version>
</dependency>
我这里引入的是poi-ooxml-schemas,这个包是个精简过的,所以有些类没有,官方的说明如下:http://poi.apache.org/help/faq.html
引入poi-ooxml包就行了,完整依赖如下:
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi</artifactId>
<version>5.0.0</version>
</dependency>
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-ooxml</artifactId>
<version>5.0.0</version>
</dependency>
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-scratchpad</artifactId>
<version>5.0.0</version>
</dependency>
<!-- <dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-ooxml-schemas</artifactId>
<version>4.1.2</version>
</dependency>-->
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi-ooxml-full</artifactId>
<version>5.0.0</version>
</dependency>
<dependency>
<groupId>org.apache.poi</groupId>
<artifactId>poi</artifactId>
<version>5.0.0</version>
</dependency>