Outlines：让LLM结构化输出可控，提升LLM应用的稳定性

最新推荐文章于 2025-02-12 08:45:35 发布

程序员笑武

最新推荐文章于 2025-02-12 08:45:35 发布

阅读量1.1k

点赞数 28

文章标签：数据库服务器运维知识图谱 prompt 人工智能语言模型

本文链接：https://blog.csdn.net/m0_59164304/article/details/141175433

版权

在面向LLM进行应用开发时，LLM相较于传统接口服务来讲的一个大的优势，即能够生成更符合人习惯的自然语言输出，但是这对于系统集成来讲却是一个障碍，系统之间的交互通常是结构化的。这就要求我们需要让LLM以某种格式输出，比如json，以便后续处理。通常做法是，是在 prompt 中提供格式要求（并最好提供示例）来进行约束，但这并非百分之百有效，进而影响应用的稳定性。

Provide 3 suggestions for specific places to go to in Seattle on a rainy day. Respond in the form of JSON. The JSON should have the following forma``   ``[`    `{ "venue": "...", "description": "..." },`    `{ "venue": "...", "description": "..." }``]

选项：

import outlines``   ``model = outlines.models.transformers("mistralai/Mistral-7B-Instruct-v0.2")``   ``prompt = """You are a sentiment-labelling assistant.``Is the following review positive or negative?``   ``Review: This restaurant is just awesome!``"""``   ``generator = outlines.generate.choice(model, ["Positive", "Negative"])``answer = generator(prompt)

类型限制

from outlines import models, generate``   ``model = models.transformers("mistralai/Mistral-7B-v0.1")``generator = generate.format(model, int)``answer = generator("When I was 6 my sister was half my age. Now I’m 70 how old is my sister?")``print(answer)``# 67

正则：

import outlines``   ``model = outlines.models.transformers("mistralai/Mistral-7B-Instruct-v0.2")``   ``prompt = "What is the IP address of the Google DNS servers? "``   ``generator = outlines.generate.text(model)``unstructured = generator(prompt, max_tokens=30)``   ``generator = outlines.generate.regex(`    `model,`    `r"((25[0-5]|2[0-4]\d|[01]?\d\d?)\.){3}(25[0-5]|2[0-4]\d|[01]?\d\d?)",``)``structured = generator(prompt, max_tokens=30)``   ``print(unstructured)``# What is the IP address of the Google DNS servers?``#``# Passive DNS servers are at DNS servers that are private.``# In other words, both IP servers are private. The database``# does not contain Chelsea Manning``   ``print(structured)``# What is the IP address of the Google DNS servers?``# 2.2.6.1

json model

from pydantic import BaseModel``   ``from outlines import models, generate``   ``   ``class User(BaseModel):`    `name: str`    `last_name: str`    `id: int``   ``   ``model = models.transformers("mistralai/Mistral-7B-v0.1")``generator = generate.json(model, User)``result = generator(`    `"Create a user profile with the fields name, last_name and id"``)``print(result)``# User(name="John", last_name="Doe", id=11)

outlines还提供了Prompt template的功能，支持对prompt进行管理和优化了使用方法，更加符合python编程习惯。

import outlines``   ``examples = [`    `("The food was disgusting", "Negative"),`    `("We had a fantastic night", "Positive"),`    `("Recommended", "Positive"),`    `("The waiter was rude", "Negative")``]``   ``@outlines.prompt``def labelling(to_label, examples):`    `"""You are a sentiment-labelling assistant.``   `    `{% for example in examples %}`    `{{ example[0] }} // {{ example[1] }}`    `{% endfor %}`    `{{ to_label }} //`    `"""``   ``model = outlines.models.transformers("mistralai/Mistral-7B-Instruct-v0.2")``prompt = labelling("Just awesome", examples)``answer = outlines.generate.text(model)(prompt, max_tokens=100)

通过上面的介绍可以看出，outlines在使用上比较简单，非常适合python开发者，能够很好的完成LLM输出格式化的需求。

如何学习大模型 AI ？

由于新岗位的生产效率，要优于被取代岗位的生产效率，所以实际上整个社会的生产效率是提升的。

但是具体到个人，只能说是：

“最先掌握AI的人，将会比较晚掌握AI的人有竞争优势”。

这句话，放在计算机、互联网、移动互联网的开局时期，都是一样的道理。

我在一线互联网企业工作十余年里，指导过不少同行后辈。帮助很多人得到了学习和成长。

我意识到有很多经验和知识值得分享给大家，也可以通过我们的能力和经验解答大家在人工智能学习中的很多困惑，所以在工作繁忙的情况下还是坚持各种整理和分享。但苦于知识传播途径有限，很多互联网行业朋友无法获得正确的资料得到学习提升，故此将并将重要的AI大模型资料包括AI大模型入门学习思维导图、精品AI大模型学习书籍手册、视频教程、实战学习等录播视频免费分享出来。

在这里插入图片描述