LLMParser 开源项目教程

最新推荐文章于 2024-08-31 09:44:15 发布

毛彤影

最新推荐文章于 2024-08-31 09:44:15 发布

阅读量490

点赞数 21

本文链接：https://blog.csdn.net/gitblog_00094/article/details/141734452

版权

LLMParser 开源项目教程

llmparserClassify and extract structured data with LLMs项目地址:https://gitcode.com/gh_mirrors/ll/llmparser

1、项目介绍

LLMParser 是一个简单且灵活的工具，用于通过大型语言模型（LLMs）从文本中分类和提取结构化数据。尽管大型语言模型非常强大，但生成可靠的 JSON 输出仍然具有挑战性。LLMParser 旨在通过强制执行一致的 JSON 输入和输出格式来解决这一问题，从而实现对文本的分类和提取。

2、项目快速启动

安装

首先，通过 npm 安装 LLMParser：

npm install llmparser

使用示例

以下是一个简单的使用示例，展示了如何使用 LLMParser 解析一个 PDF 文件并提取信息：

import { LLMParser } from 'llmparser';

const categories = [
  {
    name: "MSA",
    description: "Master service agreement"
  },
  {
    name: "NDA",
    description: "Non disclosure agreement",
    fields: [
      {
        name: "effective_date",
        description: "effective date or start date",
        type: "string"
      },
      {
        name: "company",
        description: "name of the company",
        type: "string"
      },
      {
        name: "counterparty",
        description: "name of the counterparty",
        type: "string"
      }
    ]
  }
];

const parser = new LLMParser({
  categories,
  apiKey: process.env.OPENAI_API_KEY
});

const ndaText = await loadPDFAsText("src/nda.pdf"); // 获取 PDF 文本
const extraction = await parser.parse(ndaText);

console.log(extraction);