thepi.pe 开源项目教程

贾雁冰

于 2024-09-13 07:18:17 发布

阅读量810

点赞数 23

本文链接：https://blog.csdn.net/gitblog_00052/article/details/142191091

版权

thepi.pe 开源项目教程

thepipe Feed PDFs, URLs, Slides, YouTube, and more into Vision-Language models with one line of code⚡ 项目地址: https://gitcode.com/gh_mirrors/th/thepipe

1. 项目介绍

thepi.pe 是一个强大的 API，旨在从各种来源（如 PDF、URL、文档、幻灯片等）中提取 Markdown 和图像，并准备用于多模态大型语言模型（LLMs）。该项目支持多种文件类型和数据源，能够进行多模态数据抓取和结构化数据提取。

主要功能

Markdown 和图像提取：从任何文档或网页中提取 Markdown、表格和图像。
结构化数据提取：从任何文档或网页中提取复杂的结构化数据。
多模态抓取：支持视频、音频和图像源的多模态抓取。
AI 原生文件类型检测：自动检测文件类型并进行布局分析。

2. 项目快速启动

安装

使用 pip 安装

pip install thepipe-api

获取 API 密钥

注册并获取 API 密钥。
设置环境变量 THEPIPE_API_KEY 为你的 API 密钥。

示例代码

from thepipe.scraper import scrape_file
from thepipe.core import chunks_to_messages
from openai import OpenAI

# 抓取干净的 Markdown 块
chunks = scrape_file(filepath="paper.pdf", ai_extraction=False)

# 使用抓取的块调用 LLM
client = OpenAI()
response = client.chat.completions.create(
    model="gpt-4o",
    messages=chunks_to_messages(chunks)
)