python - Tensorflow:如何编码和读取bmp图像？-6ren

python - Tensorflow:如何编码和读取bmp图像？

转载作者：行者123 更新时间：2023-12-01 01:49:32

24

4

我正在尝试读取 .bmp 图像，对这些图像进行一些增强，将它们保存到 .tfrecords 文件中，然后打开 .tfrecords 文件并使用这些图像进行图像分类。我知道有一个 tf.image.encode_jpeg() 和一个 tf.image.encode_png() 函数，但没有 tf.image.encode_bmp() 函数。我知道 .bmp 图像是未压缩的，因此我尝试简单地对图像进行 Base64 编码、np.tostring() 和 np.tobytes()，但在尝试解码这些格式时出现以下错误:

tensorflow.python.framework.errors_impl.InvalidArgumentError: channels attribute 3 does not match bits per pixel from file <some long number>

我的看法是，tensorflow 在编码为 jpeg 或 png 时，对图像的字节编码做了一些额外的事情；保存有关数组维数等的信息。但是，我对此一无所知，所以任何帮助都会很棒!

一些代码来展示我想要实现的目标:

with tf.gfile.FastGFile(filename, 'rb') as f:
    image_data = f.read()
    bmp_data = tf.placeholder(dtype=tf.string)
    decode_bmp = tf.image.decode_bmp(self._decode_bmp_data, channels=3)
    augmented_bmp = <do some augmentation on decode_bmp>
    sess = tf.Session()
    np_img = sess.run(augmented_bmp, feed_dict={bmp_data: image_data})
    byte_img = np_img.tostring()

    # Write byte_img to file using tf.train.Example
    writer = tf.python_io.TFRecordWriter(<output_tfrecords_filename>)
    example = tf.train.Example(features=tf.train.Features(feature={
        'encoded_img': tf.train.Feature(bytes_list=tf.train.BytesList(value=[byte_img])}))
    writer.write(example.SerializeToString())

    # Read img from file
    dataset = tf.data.TFRecordDataset(<img_file>)
    dataset = dataset.map(parse_img_fn)

parse_img_fn可以简化为以下内容:

def parse_img_fn(serialized_example):
    features = tf.parse_single_example(serialized_example, feature_map)
    image = features['encoded_img']
    image = tf.image.decode_bmp(image, channels=3) # This is where the decoding fails
    features['encoded_img']

    return features

最佳答案

^{在您的评论中，您的意思肯定是编码而不是加密}

BMP file format非常简单，由一堆标题和几乎原始像素数据组成。这就是 BMP 图像如此大的原因。我想这也是 TensorFlow 开发人员懒得编写函数来将数组(表示图像)编码为这种格式的原因。现在仍然很少有人使用它。建议使用 PNG 来代替，它对图像执行无损压缩。或者，如果您可以处理有损压缩，请使用 JPG。

TensorFlow 没有对图像编码做任何特殊的事情。它只是返回表示该格式图像的字节，类似于 matplotlib 在执行 save_fig 时所做的操作(除了 MPL 还将字节写入文件)。

假设您生成一个 numpy 数组，其中顶行为 0，底行为 255。这是一个数字数组，如果您将其视为图片，那么这是一个数字数组>，代表 2 个水平带，顶部一个黑色，底部一个白色。

如果您想在另一个程序 (GIMP) 中查看此图片，您需要以标准格式编码此信息，例如 PNG。编码意味着添加一些 header 和元数据，并且可以选择压缩数据。

<小时/>

现在更清楚什么是编码，我建议您使用 PNG 图像。

with tf.gfile.FastGFile('image.png', 'rb') as f:
    # get the bytes representing the image
    # this is a 1D array (string) which includes header and stuff
    raw_png = f.read()

    # decode the raw representation into an array
    # so we have 2D array representing the image (3D if colour) 
    image = tf.image.decode_png(raw_png)

    # augment the image using e.g.
    augmented_img = tf.image.random_brightness(image)

    # convert the array back into a compressed representation
    # by encoding it into png
    # we now end up with a string again
    augmented_png = tf.image.encode_png(augmented_img, compression=9) 

    # Write augmented_png to file using tf.train.Example
    writer = tf.python_io.TFRecordWriter(<output_tfrecords_filename>)
    example = tf.train.Example(features=tf.train.Features(feature={
        'encoded_img': tf.train.Feature(bytes_list=tf.train.BytesList(value=[augmented_png])}))
    writer.write(example.SerializeToString())

    # Read img from file
    dataset = tf.data.TFRecordDataset(<img_file>)
    dataset = dataset.map(parse_img_fn)

有一些重要的建议:

不要使用numpy.tostring。这会返回一个HUUGE表示，因为每个像素都表示为 float ，并且它们都是连接在一起的。没有压缩，什么都没有。尝试检查文件大小:)
无需使用 tf.Session 传回 python。您可以在TF端执行所有操作。这样您就拥有了一个输入图，您可以将其重新用作输入管道的一部分。

关于python - Tensorflow:如何编码和读取bmp图像？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/50871281/

24

4

0

文章推荐： apache-spark - Spark Thrift 服务器与 Apache Thrift 的关系

文章推荐： javascript - for 循环中的 getElementById 仅显示第一项

文章推荐： r - 多组(类型)的 Rmarkdown block 的多组全局选项

文章推荐： Javascript 在将数字相加时删除小数

c++ - 我尝试用 C++ 编写一个 bmp 文件，但是 bmp 文件的宽度和高度似乎会影响结果，有时我会得到一个错误的 bmp 文件
正如标题中所描述的那样。代码如下所示 #include #include using namespace std; #define WIDTH 90 #define HEIGHT 180 cla
bmp - BMP 文件的方向 : upright or upside-down?
我正在开发一个程序来调整 BMP 文件的大小并将其存储在一个新文件中。我注意到有些 BMP 完全颠倒存储，而其他 BMP 则只是直立存储。所以我做了两个解决方案来处理这两种情况。我的问题是如何事先知道
bmp - 使用 ZPL 打印 BMP
如何使用 ZPL II 打印位图 (BMP) 图像？我使用 ~DY 将 BMP 下载到打印机: ~DYR:PRINT,B,B, , , 我正在使用 PHP 将原始数据发送到打印机，因此 <>
c++ - 将彩色 bmp 转换为灰度 bmp？
我正在尝试将彩色 BMP 文件转换为灰度 BMP。输入 bmp 是 24 位，我在输出端生成相同的 24 位 bmp，只是这次是灰度。我使用的代码是 for(int x = 0; x < max;
bmp - 试图了解如何绘制 1 位 BMP 图像
正如在 this example 中所见, BMP 文件中的每个 channel (R, G, B) 接受一个输入。一个 24 位的 BMP 图像有 8 位的-R、8 位的 G 和 8 位的 B。我在
image - 如何区分原始.BMP/.EMF图片文件与处理后的.BMP/.EMF文件之间的区别？
总结: 直接计算表明:一张3289 X 4570 X 32bpp的.BMP图片大约需要53MB。同样大小但24bpp的图片大约需要43MB，16bpp的图片大约需要28MB。在我的情况下，ChemDr
c++ - 从 RGB BMP 创建灰度 BMP
我有 24 位图像，我读取位图并将其转换为灰度并像 8 位一样保存。 RGBTRIPLE temp; unsigned char t; ... t = (temp.rgbtBlue * 0.114 +
c++ - 在较大的 bmp 文件中查找较小的 bmp 文件
我想从另一个更大的 bmp 文件中找到一个小的 bmp 文件(更大的一个是从屏幕上捕获的并称为 Sample.bmp ，小的 bmp 文件称为 Button.bmp 。事情是在比较图像时文件可以随处可
c - .bmp 图像比较显示 image.bmp 上的 EOF
我需要使用 .bmp 类型的图像。其格式为: struct bmp_fileheader { unsigned char fileMarker1; /* 'B' */ unsigne
c - 从 BMP 文件读取到 C 中的 BMP 头结构
我正在尝试获取一个 BMP 文件并将其读入，然后对其中的像素执行操作以更改其颜色。我的问题是我无法将文件中的数据读入两个 BMP header 结构。我能够很好地将所有数据读入第一个结构，但在读入第二
如果 'dialog-bmp' 是 'dialog-bmp'，Python 将返回 false
这个问题已经有答案了: Why does comparing strings using either '==' or 'is' sometimes produce a different resul
python - 不支持的 BMP 压缩 - BMP 到 JPEG - PIL - Python
我正在尝试使用以下代码将 bmp 图像转换为 jpeg。 from PIL import Image img = Image.open('/Desktop/xyz.bmp') new_img = im
c - 24 位 BMP 与 256 色 BMP 中的像素存储
在 24 位 bmp 中，像素存储为 BGR，每种颜色仅占用 1 个字节。那个可以读 for(i=0;i
c++ - 在 C++ 中图像 bmp 中的 Alpha 混合水印 bmp
我在学校有一个任务，要将水印 bmp 图像添加到其他 bmp 图像中。该任务称为 alpha 混合。我必须在用户将在启动时通过程序参数设置的特定坐标插入水印，以及水印混合的 alpha 值。我几乎成功
c# - 将 8 位彩色 bmp 图像转换为 8 位灰度 bmp
这是我的位图对象 Bitmap b = new Bitmap(columns, rows, PixelFormat.Format8bppIndexed); BitmapData bmd = b.Loc
php - ImageMagick:无法将 24 位 bmp 图像转换为 8 位 bmp 图像
我的编程平台是: Ubuntu 3.19 ImageMagick 6.7.7 编程语言:php 我的代码: $img = new Imagick($image_input_24bit_bmp); $i
c - 从图像 RGB bmp 纯 C 编写灰度 bmp 的这段代码有什么问题 - Windows 操作系统
这是我的函数，我根据维基百科 BITMAPINFOHEADER 使用标题 BMP。但是，我得到的文件没有任何图像...当放置填充时，该过程停止。 // Structures for header i
icons - Ico 格式 - 32 位 PNG 与 32 位 BMP 与 24 位 BMP
我正在发布一个 .NET 程序，并且正处于完善阶段。我需要在项目属性中设置图标。根据我的研究，我想创建以下像素大小的图像以适应所有图标大小:16、32、48、96、256。我还找到了一个程序，可以将
php - 无法在 PHP 中从 MIME 类型为 'image/x-ms-bmp' 的 BMP 创建 GD 图像资源
我正在尝试从 BMP 图像创建 GD 图像资源，但是我没有运气。有问题的 BMP 图像是用 Photoshop 创建和保存的。我也尝试了几个在网上找到的 BMP，它们给出了相同的结果。 getima
bmp - BMP文件结构中数据类型FXPT2DOT30是如何定义的？
类型 FXPT2DOT30 出现在为 BMP 文件定义 struct CIEXYZ 时，根据微软提供的定义: http://msdn.microsoft.com/en-us/library/windo

首页

博学

6Ren·AI

商城

python - Tensorflow:如何编码和读取bmp图像？