怎么通过OpenAI API调用其多模态大模型（GPT-4o）

最新推荐文章于 2025-03-22 00:15:00 发布

蛐蛐蛐

最新推荐文章于 2025-03-22 00:15:00 发布

阅读量8.9k

点赞数 11

分类专栏： Python技巧深度学习科研工具文章标签：人工智能深度学习

本文链接：https://blog.csdn.net/qysh123/article/details/139048339

版权

现在只要有额度，大家都可以调用OpenAI的多模态大模型了，例如GPT-4o和GPT-4 Turbo，我一年多前总结过一些OpenAI API的用法，发现现在稍微更新了一下。主要参考了这里：https://platform.openai.com/docs/guides/vision

其实也是比较简单的，就是本地图片需要用base 64先编码，然后再上传。我举个例子，大家应该一看就清楚（图片放在Processed文件夹里面）：

from openai import OpenAI
import os
import base64

client = OpenAI(
    api_key="Your_API_Key"
)

# Function to encode the image
def encode_image(image_path):
  with open(image_path, "rb") as image_file:
    return base64.b64encode(image_file.read()).decode('utf-8')

fig_path='Processed'

for filename in os.listdir(fig_path):
    if filename.endswith('.png'):
       image_path=os.path.join(fig_path, filename)
       print(image_path)
       base64_image = encode