【AI大模型学习路线】第一阶段之大模型开发基础——第五章（提示工程技术-2）Tree of thought prompting。-CSDN博客

本文链接：https://blog.csdn.net/gaoxiaoxiao1209/article/details/147562920

【AI大模型学习路线】第一阶段之大模型开发基础——第五章（提示工程技术-2）Tree of thought prompting。

文章目录

【AI大模型学习路线】第一阶段之大模型开发基础——第五章（提示工程技术-2）Tree of thought prompting。
前言

欢迎宝子们点赞、关注、收藏！欢迎宝子们批评指正！
祝所有的硕博生都能遇到好的导师！好的审稿人！好的同门！顺利毕业！

大多数高校硕博生毕业要求需要参加学术会议，发表EI或者SCI检索的学术论文会议论文。详细信息可关注VX “学术会议小灵通”或参考学术信息专栏：https://fighting.blog.csdn.net/article/details/146692330

前言

本文概述了 Tree of Thoughts（ToT）提示在大模型中的地位与前景，然后从定义与机制、提示结构、Python 实操、效果评估及最佳实践与未来趋势五个方面展开详述。

1 背景与前景

大型语言模型在“模型即服务”（MaaS）模式下广泛应用，但在需要探索和回溯的复杂推理任务中，传统的 Chain-of-Thought（CoT）线性思路仍显不足。
Tree of Thoughts（ToT）提示通过在推理过程中构建“思路树”，允许模型并行探索多个思路分支、进行自我评估与回溯，从而显著提升诸如数学运算、跨词创作与解谜等多步骤任务的成功率。
在 Game of 24 等基准测试中，GPT-4 使用 CoT 仅有 4% 成功率，而采用 ToT 后成功率飙升至 74%，显示出 ToT 在工业级大模型中的巨大潜力。

2 定义与机制

2.1 ToT 的核心理念

ToT 将“思考步骤”视为节点（thoughts），并在每一步生成多个候选节点，然后基于评估函数选出最优分支继续扩展。
与 CoT 只沿单一路径生成推理不同，ToT 在每个节点均可“回溯”至父节点并尝试其他分支，具备类似搜索算法的全局决策能力。

2.2 组件与流程

生成（Generation）：在当前节点，根据 Prompt 生成若干候选“思路”分支。
评估（Evaluation）：对每个分支应用启发式或模型打分，衡量其解决问题的潜力。
选择（Selection）：保留得分最高的若干分支作为下轮扩展基础。
回溯/终止（Backtrack/Terminate）：若达到深度或找到满足条件的解，则停止或回溯至上层探索其他分支。

3 提示结构与模板

3.1 基本模板

You are a problem solver.  
Task: <描述问题>  

# Step 1: generate thoughts  
Thoughts:  
1. <candidate thought A>  
2. <candidate thought B>  
3. <candidate thought C>  

# Step 2: evaluate thoughts  
Evaluate each thought and pick the best to expand.  

# Step 3: expand chosen thought  
…

此模板显式分离“生成—评估—扩展”阶段，引导模型按树形搜索逻辑执行。

3.2 典型示例：24 点游戏

Solve 24 Game: [3, 3, 8, 8]  
Thoughts:  
1. (3+3)×(8÷8)=6×1=6  
2. (8÷(3−3/8))=…  
3. …  
Evaluate: Thought 2 yields 24 → select and terminate.

该示例展示了如何在 Prompt 中手动构造分支并评估。

4 Python 实操示例

下面以 OpenAI GPT-4 为后端，演示一个简化的 ToT 搜索框架：

import os, openai
openai.api_key = os.getenv("OPENAI_API_KEY")

def generate_thoughts(prompt, k=3):
    resp = openai.ChatCompletion.create(
        model="gpt-4",
        messages=[{"role":"user","content":prompt}],
        n=k, temperature=0.7
    )
    return [choice.message.content for choice in resp.choices]

def evaluate_thought(thought):
    # 简单用模型打分评价
    resp = openai.ChatCompletion.create(
        model="gpt-4",
        messages=[
            {"role":"system","content":"Score the following thought on a scale 1–10 for solving the task."},
            {"role":"user","content":thought}
        ],
        temperature=0.0
    )
    return float(resp.choices[0].message.content.strip())

def tree_of_thought(task, depth=2, breadth=3):
    nodes = [task]
    for d in range(depth):
        candidates = []
        for node in nodes:
            prompt = f"{node}\nGenerate next thoughts:"
            thoughts = generate_thoughts(prompt, k=breadth)
            candidates.extend(thoughts)
        # evaluate all candidates
        scored = [(thought, evaluate_thought(thought)) for thought in candidates]
        # select top branches
        nodes = [t for t,_ in sorted(scored, key=lambda x: -x[1])[:breadth]]
    return nodes

result = tree_of_thought("Solve 15 ÷ 5 + 6 = ?", depth=2, breadth=3)
print("Final candidate thoughts:", result)