DeepSeek-R1 8B 模型

最新推荐文章于 2025-05-02 18:57:07 发布

Leon_Jinhai_Sun

最新推荐文章于 2025-05-02 18:57:07 发布

阅读量463

点赞数 5

文章标签：服务器运维

本文链接：https://blog.csdn.net/Leon_Jinhai_Sun/article/details/146245149

版权

！DeepSeek-R1 8B 模型确实是一个非常重要的模型大小选项，尤其是在平衡性能和资源需求方面。以下是对 8B 模型的详细分析，以及它在 PDF 和图片关键内容提取任务中的适用性。

1. 8B 模型的特点

参数数量：80 亿参数，介于 7B 和 13B 之间。
性能：比 7B 模型更强，适合中等复杂度的任务。
资源需求：比 13B 模型更轻量，适合资源有限的环境。

2. 8B 模型的适用场景

中等复杂度任务：如生成中等长度的摘要、提取关键信息、回答简单问题。
资源有限的环境：如本地 CPU 或中端 GPU。
实时性要求较高的场景：8B 模型在性能和速度之间取得了较好的平衡。

3. 8B 模型与其他模型的对比

模型大小	参数数量	适用场景	资源需求
7B	70 亿	简单任务（如关键词提取、短摘要）	低
8B	80 亿	中等复杂度任务（如中等长度摘要）	中低
13B	130 亿	中等复杂度任务（如复杂问答）	中
32B	320 亿	高复杂度任务（如长文本摘要）	高

4. 示例：使用 8B 模型提取关键内容

以下是一个使用 8B 模型从 PDF 和图片中提取关键内容的示例代码：

步骤 1：安装依赖

pip install paddleocr fitz requests

步骤 2：编写代码

import fitz  # PyMuPDF
from paddleocr import PaddleOCR
import requests

# 初始化 PaddleOCR
ocr = PaddleOCR(use_angle_cls=True, lang="ch")

def extract_text_from_pdf(pdf_path):
    # 打开 PDF 文件
    pdf_document = fitz.open(pdf_path)
    text = ""

    # 遍历每一页
    for page_num in range(len(pdf_document)):
        page = pdf_document.load_page(page_num)
        page_text = page.get_text()

        if page_text.strip():  # 如果是文本型 PDF
            text += page_text
        else:  # 如果是图像型 PDF，使用 OCR
            pix = page.get_pixmap()
            image_path = f"page_{page_num + 1}.png"
            pix.save(image_path)
            result = ocr.ocr(image_path, cls=True)
            page_text = "\n".join([line[1][0] for line in result[0]])
            text += page_text

    return text

def extract_text_from_image(image_path):
    # 调用 PaddleOCR 识别图片中的文字
    result = ocr.ocr(image_path, cls=True)
    text = "\n".join([line[1][0] for line in result[0]])
    return text

def call_deepseek_model(text, model_size="8B"):
    # 调用 DeepSeek 模型处理文本
    url = "http://localhost:11434/api/generate"  # ollama 的 API 地址
    payload = {
        "model": f"deepseek-r1:{model_size}",
        "prompt": f"请从以下文本中提取关键内容：{text}",
        "stream": False
    }
    response = requests.post(url, json=payload)
    return response.json()["response"]

def process_file(file_path):
    # 处理文件（PDF 或图片），提取文本并调用 DeepSeek 模型
    if file_path.endswith(".pdf"):
        text = extract_text_from_pdf(file_path)
    elif file_path.endswith((".jpg", ".png", ".jpeg")):
        text = extract_text_from_image(file_path)
    else:
        raise ValueError("不支持的文件格式")

    # 调用 DeepSeek 模型提取关键内容
    result = call_deepseek_model(text, model_size="8B")
    return result

# 示例：处理文件
file_path = "example.pdf"  # 或 "example.jpg"
result = process_file(file_path)
print("提取的关键内容：", result)