DOCX本质上是一个压缩包,其中包含了一个docProps文件夹。里面就有字数统计的信息,
其中APP.XML的信息如下:
<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
- <Properties xmlns="http://schemas.openxmlformats.org/officeDocument/2006/extended-properties" xmlns:vt="http://schemas.openxmlformats.org/officeDocument/2006/docPropsVTypes">
<Template>Normal.dotm</Template>
<TotalTime>1</TotalTime>
<Pages>1</Pages>
<Words>16</Words>
<Characters>101</Characters>
<Application>Microsoft Office Word</Application>
<DocSecurity>0</DocSecurity>
<Lines>4</Lines>
<Paragraphs>4</Paragraphs>
<ScaleCrop>false</ScaleCrop>
<Company />
<LinksUpToDate>false</LinksUpToDate>
<CharactersWithSpaces>112</CharactersWithSpaces>
<SharedDoc>false</SharedDoc>
<HyperlinksChanged>false</HyperlinksChanged>
<AppVersion>15.0000</AppVersion>
</Properties>
利用openXML即可获取这些属性信息:
代码如下:
WordprocessingDocument wordprocessingDocument =
WordprocessingDocument.Open(filepath, true);
String wordCount = wordprocessingDocument.ExtendedFilePropertiesPart.Properties.Words.Text;
String pageCount = wordprocessingDocument.ExtendedFilePropertiesPart.Properties.Pages.Text;
String lineCount = wordprocessingDocument.ExtendedFilePropertiesPart.Properties.Lines.Text;