调用百度ai接口批量读取图片上的文字（代码没有优化，不喜勿喷，部分需要隐藏）

最新推荐文章于 2022-04-21 17:50:49 发布

weixin_42653353

最新推荐文章于 2022-04-21 17:50:49 发布

阅读量332

点赞数

分类专栏： python 文章标签：数据挖掘机器学习

本文链接：https://blog.csdn.net/weixin_42653353/article/details/105863959

版权

python 专栏收录该内容

10 篇文章 0 订阅

订阅专栏

欢迎评论席交流学习原创内容

import requests
import base64
import json
import pandas as pd
import os

#百度申请的百度智能云上找到对应的文字识别（要找对）创建应用，记得勾选需要的，然后记下下面的3个key

APP_ID=''
API_KEY=''
SECRET_KEY=''

#获取access_token

host = 'https://aip.baidubce.com/oauth/2.0/token?grant_type=client_credentials&client_id=API_KEY&client_secret='
response = requests.get(host)
if response:
print(response.json())

price=[列表内容]
list=[]
t=0
def static_info():
# url = "https://aip.baidubce.com/rest/2.0/ocr/v1/accurate_basic" #高精度接口每天500次调用限制
url = "https://aip.baidubce.com/rest/2.0/ocr/v1/general_basic" #普通接口好像是每天50000次限制
params = {"image": img}
access_token = 这里填写上面获取到的access_token
request_url = url + "?access_token=" + access_token
headers = {'content-type': 'application/x-www-form-urlencoded'}
response = requests.post(request_url, data=params, headers=headers)
return response

for d in price:
file_num=0

for root, dirs, files in os.walk('路径'+str(d)):

file=[i for i in files if len(i)>15]
for p in file:
print(p)
try:
f = open('路径‘+p, 'rb')
except IOError:
print('此处没有文件了')
else:
img = base64.b64encode(f.read())

response=static_info()

if response:
js=response.json()
words_dic=js.get('words_result')
if words_dic:
for i in words_dic:
data=[[d],words_dic[0]['words'],i['words']]
list.append(data)
list.append([])

p= pd.DataFrame(list)
p.to_excel('路径/文件名',header=False)
print('数据读取完毕')

weixin_42653353

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
调用百度ai接口批量读取图片上的文字（代码没有优化，不喜勿喷，部分需要隐藏）

欢迎评论席交流学习原创内容import requestsimport base64import jsonimport pandas as pdimport os#百度申请的百度智能云上找到对应的文字识别（要找对）创建应用，记得勾选需要的，然后记下下面的3个keyAPP_ID=''API_KEY=''SECRET_KEY=''#获取acce...
复制链接

扫一扫

专栏目录