使用.NET 将PDF转成Word

使用Solid Framework可以很方便的已编程方式将PDF转换成Word文件格式

  1. 首先准备一套Solid Framework
    在这里插入图片描述
  2. 在Visual Studio中建立一个项目并引用SolidFramework.dll
    在这里插入图片描述
  3. 添加命名空间 SolidFramework
using SolidFramework;
using SolidFramework.Configuration;
using SolidFramework.Converters;
using SolidFramework.Converters.Plumbing;
  1. 前期准备, 设置路径个许可证
string directoryName = Path.GetDirectoryName(Assembly.GetExecutingAssembly().Location);
char directorySeparatorChar = Path.DirectorySeparatorChar;
string str = string.Concat(directoryName, directorySeparatorChar.ToString(), "SolidFramework");

Installer.NativePlatformDirectory = str;
Installer.ForceUnpack = false;
License.Import("Solid Framework", "xxxx", "xxx", "xxxxxxxxxxxxxxx", "NOCALL");
  1. 初始化SolidFrame Pdf Converter
//Add the PDF file to convert
pdfToWordConverter.AddSourceFile(path);
//Settings
pdfToWordConverter.ReconstructionMode = option.C_ReconstructionMode;
pdfToWordConverter.DetectTables = option.Table_Detection;
pdfToWordConverter.OutputType = WordDocumentType.DocX;
pdfToWordConverter.HeaderAndFooterMode = option.C_HeaderAndFooterMode;
pdfToWordConverter.ImageAnchoringMode=option.C_ImageAnchoringMode;
pdfToWordConverter.OverwriteMode = SolidFramework.Plumbing.OverwriteMode.ForceOverwrite;
pdfToWordConverter.KeepCharacterSpacing = false;
FileInfo fileInfo = new FileInfo(path);
pdfToWordConverter.TextRecoveryType = option.Recognize_Text;
pdfToWordConverter.OutputDirectory= fileInfo.DirectoryName;
pdfToWordConverter.SupportRightToLeftWritingDirection = true;
pdfToWordConverter.DetectLists = true;
pdfToWordConverter.DetectStyles = true;
pdfToWordConverter.DetectToc = true;
pdfToWordConverter.MarkupAnnotConversionType = MarkupAnnotConversionType.Never;
pdfToWordConverter.TextRecoveryNseType = TextRecoveryNSE.Never;
  1. OCR识别引擎, 这里使用内置引擎
pdfToWordConverter.TextRecoveryEngine = TextRecoveryEngine.SolidOCR;
  1. 一切就绪开始转换
pdfToWordConverter.Convert();
ConversionStatus status = pdfToWordConverter.Results[0].Status;
  1. 关于ConversionStatus
    ConversionStatus 定义了多种转换状态IO错误密码错误等
 public enum ConversionStatus
    {
        Success = 0,
        Canceled = 1,
        InternalError = 2,
        Unknown = 200,
        Fail = 3,
        BadData = 5,
        IOError = 6,
        IOFileLocked = 7,
        NotEnoughMemory = 9,
        FileHasCopyProtection = 10,
        InvalidPagesRange = 8,
        UnsupportedEncryptionHandler = 11,
        MissingCertificate = 12,
        OCRCanceled = 13,
        NoTablesToExtract = 0xF,
        NoImagesToExtract = 0x10,
        NoBppConversion = 150,
        NoGrayscale = 151,
        PSDUnsupportedMode = 152,
        PdfAError = 20,
        PdfAFatalError = 21,
        CanceledExists = 14,
        WrongPassword = 0x1F,
        NoUserNoOwner = 0x20,
        NoUserOwner = 33,
        UserNoOwner = 34,
        UserOwner = 35,
        InvalidLicense = 36,
        AlreadyLoaded = 30,
        UnavailableAction = 4
    }

关于输出格式
pdfToWordConverter.OutputType = WordDocumentType.DocX;
可以是Doc或Docx

执行后
就可以将PDF转换成Word了

  • 1
    点赞
  • 1
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值