Aspose.Pdf破解版,读取PDF内容,转换为文本格式,亲自测试Aspose.Pdf.dll可用。
string dataDir = @"D:\XXX.pdf";
// Open PDF document
Aspose.Pdf.Document pdfDocument = new Aspose.Pdf.Document(dataDir);
// Create TextAbsorber object to extract text
TextAbsorber textAbsorber = new TextAbsorber();
// Accept the absorber for all pages
pdfDocument.Pages.Accept(textAbsorber);
// Get the extracted text
string extractedText = RemoveWhiteSpaces(textAbsorber.Text);
// Create a writer and open the file
TextWriter tw = new StreamWriter(dataDir + "extracted-text.txt");
// Write a line of text to the file
tw.WriteLine(extractedText);
// Close the stream
tw.Close();
官方连接: