python - 来自 Google Vision API OCR 的响应 400，带有指定图像的 base64 字符串-6ren

python - 来自 Google Vision API OCR 的响应 400，带有指定图像的 base64 字符串

转载作者：行者123 更新时间：2023-12-02 11:32:06

25

4

我已阅读 How to use the Google Vision API for text detection from base64 encoded image?但这根本没有帮助。 Cloud client library这对我来说是不可取的，因为我在 OCR 之前和期间进行了许多图像处理(例如旋转、裁剪、调整大小等)。将它们保存为新文件并重新读取它们作为 Google Vision API 的输入，效率相当低。

因此，我直接查看了发布请求的文档:

这里是导致失败的最少代码:

import base64
import requests
import io

# Read the image file and transform it into a base64 string
with io.open("photos/foo.jpg", 'rb') as image_file:
    image = image_file.read()
content = base64.b64encode(image)

# Prepare the data for request
# Format copied from https://cloud.google.com/vision/docs/ocr
sending_request = {
  "requests": [
    {
      "image": {
        "content": content
      },
      "features": [
        {
          "type": "TEXT_DETECTION"
        }
      ]
    }
  ]
}

# Send the request and get the response
# Format copied from https://cloud.google.com/vision/docs/using-python
response = requests.post(
    url='https://vision.googleapis.com/v1/images:annotate?key={}'.format(API_KEY),
    data=sending_request,
    headers={'Content-Type': 'application/json'}
)

# Then get 400 code
response
# <Response [400]>
print(response.text)
{
  "error": {
    "code": 400,
    "message": "Invalid JSON payload received. Unexpected token.\nrequests=image&reque\n^",
    "status": "INVALID_ARGUMENT"
  }
}

我转到控制台，发现 google.cloud.vision.v1.ImageAnnotator.BatchAnnotateImages 确实存在请求错误，但我不知道发生了什么。是不是因为requests.post中发送的data格式错误？

最佳答案

错误，“message”:“收到无效的 JSON 有效负载。意外的 token 。\nrequests=image&reque\n^”， 表明您正在传递非 json 格式，该格式必须是json。因此，您应该将其转换为 json 并将其传递给请求，如下所示。

response = requests.post(
url='https://vision.googleapis.com/v1/images:annotate?key={}'.format(API_KEY),
# import json module
# dumps the object to JSON
data=json.dumps(sending_request), 
headers={'Content-Type': 'application/json'}

它将触发 typeError: Object of type 'bytes' is not JSON Serialized 在 json.dumps([sending_request]) 行，因为您没有解码 b64encode 图像。因此，首先执行此操作并发送请求

content = base64.b64encode(image).decode('UTF-8')

关于python - 来自 Google Vision API OCR 的响应 400，带有指定图像的 base64 字符串，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/49918950/

25

4

0

文章推荐： opencl - clEnqueueBarrier 和 clFinish 之间有什么区别？

文章推荐： spring - 如何在 Gradle 构建脚本中加载 .yml 属性？

c++ - `Base *b = new Base;` 与 `Base *b = new Base();` 没有定义我自己的构造函数
如果我不定义自己的构造函数，Base *b = new Base; 与 Base *b = new Base(); 之间有什么区别吗？最佳答案初始化是标准中要遵循的一种 PITA...然而，这两个
c# - 在 C# 中将 base-27(或 base-X)转换为 base-10？
是否有现成的函数可以在 C# 中进行基本转换？我希望将以 26 为基数和以 27 为基数的数字转换为以 10 为基数。我可以在纸上完成，但我不是一个非常有经验的程序员，如果可能的话，我宁愿不要从头开始
java - JNA 的 Pointer.getPointerArray(long base) 和 Pointer.getStringArray(long base) 中的 'base' 是什么意思？
JNA 中'base'是什么意思 Pointer.getPointerArray(long base) Pointer.getStringArray(long base) ? JNA Document
C++ base 10 to base 2逻辑错误
我正在做一个将数字从 10 进制转换为 2 进制的基本程序。我得到了这段代码: #include #include #include #include using namespace std;
c# - 从 "base.base"类调用方法？
“假设以下代码: public class MultiplasHerancas { static GrandFather grandFather = new GrandFather();
三进制计算机与其他基于二进制的算法分析，4th based 5th based
当我分析算法的时候，我突然问自己这个问题，如果我们有三元计算机时间复杂度会更便宜吗？还是有任何基础可以让我们构建计算机，这样时间复杂度分析就无关紧要了？我在互联网上找不到太多，但是基于三元的计算机在给
c# - Base Base Constructor C# 初始化
一个简化的场景。三个类，GrandParent，Parent 和 Child。我想要做的是利用 GrandParent 和 Parent 构造函数来初始化一个 Child 实例。 class Gran
javascript - 评估javascript base 10 to base 2转换函数
我编写了一个简单的函数来将基数为 10 的数字转换为二进制数。我编写的函数是我使用我所知道的简单工具的最佳尝试。我已经在这个网站上查找了如何执行此操作的其他方法，但我还不太了解它。我确定我编写的函数非
c++ - 将数字从 base-10 转换为另一个 base
我尝试了以下代码将数字从 base-10 转换为另一个 base。如果目标基地中没有零(0)，它就会工作。检查 79 和 3 并正确打印正确的 2221。现在尝试数字 19 和 3，结果将是 21 而
algorithm - 分析时间复杂度时log base 2等于log base 3？
这个问题在这里已经有了答案: Is Big O(logn) log base e? (7 个答案) 关闭 8 年前。 Intro 练习 4.4.6 的大多数解决方案。算法第三版说，n*log3(n)
c++ - 运行时检查实例 (Base*) 是否覆盖父函数 (Base::f())
如何判断基类(B)的指针是否(多态)重写了基类的某个虚函数？ class B{ public: int aField=0; virtual void f(){}; }; class C
c# - 为什么 C# 不支持 base.base？
我测试了这样的代码: class A { public A() { } public virtual void Test () { Console.WriteL
html - WPF的grid based layout和html中禁忌的table based layout不一样吗？
两者都采用相同的概念:定义一些行和列并将内容添加到特定位置。但是 Grid 是最常见的 WPF 布局容器，而 html 中基于表格的布局是 very controversial .那么，为什么 WPF
javascript - JS中的继承 : this. base = Class(); this.base() 还是……？
我试图在 JS 中“获得”继承。我刚刚发现了一种基本上可以将所有属性从一个对象复制到另一个对象的简洁方法: function Person(name){ this.name="Mr or Miss
c# - 如何调用像 base.base.GetHashCode() 这样的二级基类方法
class A { public override int GetHashCode() { return 1; } } class B : A { pu
php - 如何将比特种子信息哈希从 Base 32 转换为 Base 16
我有一个 Base32 信息哈希。例如IXE2K3JMCPUZWTW3YQZZOIB5XD6KZIEQ ，我需要将其转换为base16。我怎样才能用 PHP 做到这一点？我的代码如下所示: $ha
google-analytics - 谷歌分析内容实验 : Session Based or User-Based?
我已经使用其实验界面对 Google Analytics 进行了一些实验，一切似乎都运行良好，但我无法找到 Google Analytics 属性如何达到变体目标的答案，即归因 session - 基
flutter - 为什么 "base is derivedA || base is derivedB"没有按预期工作？
if (state is NoteInitial || state is NewNote) return ListView.builder(
c++ - Derived1::Base 和 Derived2::Base 是否指代相同的类型？
MSVC、Clang 和 GCC 不同意此代码: struct Base { int x; }; struct Der1 : public Base {}; struct Der2 : public
javascript - Base 10 到 Base 2 转换器
我已经尝试构建一个 Base 10 到 Base 2 转换器... var baseTen = window.prompt("Put a number from Base 10 to conver

首页

博学

6Ren·AI

商城

python - 来自 Google Vision API OCR 的响应 400，带有指定图像的 base64 字符串