python - C# 和 Python 中的 JPEG 压缩差异-6ren

python - C# 和 Python 中的 JPEG 压缩差异

转载作者：行者123 更新时间：2023-12-04 11:56:27

25

4

我正在将一些图像处理功能从 .NET 转移到 Python，限制条件是输出图像必须以与在 .NET 中完全相同的方式进行压缩。但是，当我比较 .jpg 时在类似 text-compare 的工具上输出文件并选择 Ignore nothing ，文件的压缩方式存在显着差异。
例如:
python

bmp = PIL.Image.open('marbles.bmp')

bmp.save(
    'output_python.jpg',
    format='jpeg',
    dpi=(300,300),
    subsampling=2,
    quality=75
)

.NET

ImageCodecInfo jgpEncoder = ImageCodecInfo.GetImageDecoders().First(codec => codec.FormatID == ImageFormat.Jpeg.Guid);
EncoderParameters myEncoderParameters = new EncoderParameters(1);
myEncoderParameters.Param[0] = new EncoderParameter(Encoder.Quality, 75L);

Bitmap bmp = new Bitmap(directory + "marbles.bmp");

bmp.Save(directory + "output_net.jpg", jgpEncoder, myEncoderParameters);

exiftool output_python.jpg -a -G1 -w txt

[ExifTool]      ExifTool Version Number         : 12.31
[System]        File Name                       : output_python.jpg
[System]        Directory                       : .
[System]        File Size                       : 148 KiB
[System]        File Modification Date/Time     : 2021:09:28 09:19:20-06:00
[System]        File Access Date/Time           : 2021:09:28 09:19:21-06:00
[System]        File Creation Date/Time         : 2021:09:27 21:33:35-06:00
[System]        File Permissions                : -rw-rw-rw-
[File]          File Type                       : JPEG
[File]          File Type Extension             : jpg
[File]          MIME Type                       : image/jpeg
[File]          Image Width                     : 1419
[File]          Image Height                    : 1001
[File]          Encoding Process                : Baseline DCT, Huffman coding
[File]          Bits Per Sample                 : 8
[File]          Color Components                : 3
[File]          Y Cb Cr Sub Sampling            : YCbCr4:2:0 (2 2)
[JFIF]          JFIF Version                    : 1.01
[JFIF]          Resolution Unit                 : inches
[JFIF]          X Resolution                    : 300
[JFIF]          Y Resolution                    : 300
[Composite]     Image Size                      : 1419x1001
[Composite]     Megapixels                      : 1.4

exiftool output_net.jpg -a -G1 -w txt

[ExifTool]      ExifTool Version Number         : 12.31
[System]        File Name                       : output_net.jpg
[System]        Directory                       : .
[System]        File Size                       : 147 KiB
[System]        File Modification Date/Time     : 2021:09:28 09:18:05-06:00
[System]        File Access Date/Time           : 2021:09:28 09:18:52-06:00
[System]        File Creation Date/Time         : 2021:09:27 21:32:19-06:00
[System]        File Permissions                : -rw-rw-rw-
[File]          File Type                       : JPEG
[File]          File Type Extension             : jpg
[File]          MIME Type                       : image/jpeg
[File]          Image Width                     : 1419
[File]          Image Height                    : 1001
[File]          Encoding Process                : Baseline DCT, Huffman coding
[File]          Bits Per Sample                 : 8
[File]          Color Components                : 3
[File]          Y Cb Cr Sub Sampling            : YCbCr4:2:0 (2 2)
[JFIF]          JFIF Version                    : 1.01
[JFIF]          Resolution Unit                 : inches
[JFIF]          X Resolution                    : 300
[JFIF]          Y Resolution                    : 300
[Composite]     Image Size                      : 1419x1001
[Composite]     Megapixels                      : 1.4

marbles.bmp sample image
文本比较差异

问题

假设这两种 JPEG 压缩实现可以产生相同的输出文件是否合理？

如果是这样，要么是 PIL或 System.Drawing.Image做任何额外的步骤，比如抗锯齿，使结果不同？

或者 PIL 是否有其他参数.save()让它表现得更像 C# 中的 JPEG 编码器？

谢谢
更新
基于 Jeremy's recommendation , 我用了 JPEGsnoop比较文件之间的更多细节，发现亮度和色度表是不同的。我修改了代码:

bmp = PIL.Image.open('marbles.bmp')

output_net = PIL.Image.open('output_net.jpg')

bmp.save(
    'output_python.jpg',
    format='jpeg',
    dpi=(300,300),
    subsampling=2,
    qtables=output_net.quantization,
    #quality=75
)

现在表是相同的，但文件之间的差异没有改变。 JPEGsnoop 现在显示的唯一区别在于 Compression stats和 Huffman code histogram stats . output_net.jpeg

*** Decoding SCAN Data ***
  OFFSET: 0x0000026F
  Scan Decode Mode: Full IDCT (AC + DC)

  Scan Data encountered marker   0xFFD9 @ 0x00024BE7.0

  Compression stats:
    Compression Ratio: 28.43:1
    Bits per pixel:     0.84:1

  Huffman code histogram stats:
    Huffman Table: (Dest ID: 0, Class: DC)
      # codes of length 01 bits:        0 (  0%)
      # codes of length 02 bits:     1664 (  7%)
      # codes of length 03 bits:    18238 ( 81%)
      # codes of length 04 bits:     1807 (  8%)
      # codes of length 05 bits:      715 (  3%)
      # codes of length 06 bits:        4 (  0%)
      # codes of length 07 bits:        0 (  0%)
      ...

output_python.jpg

*** Decoding SCAN Data ***
  OFFSET: 0x0000026F
  Scan Decode Mode: Full IDCT (AC + DC)

  Scan Data encountered marker   0xFFD9 @ 0x00025158.0

  Compression stats:
    Compression Ratio: 28.17:1
    Bits per pixel:     0.85:1

  Huffman code histogram stats:
    Huffman Table: (Dest ID: 0, Class: DC)
      # codes of length 01 bits:        0 (  0%)
      # codes of length 02 bits:     1659 (  7%)
      # codes of length 03 bits:    18247 ( 81%)
      # codes of length 04 bits:     1807 (  8%)
      # codes of length 05 bits:      711 (  3%)
      # codes of length 06 bits:        4 (  0%)
      # codes of length 07 bits:        0 (  0%)
      ...

我现在正在寻找一种通过 PIL 同步这些值的方法。 .

最佳答案

Is it reasonable to assume that these two implementations of JPEG compression could yield identical output files?

答案并非如此。
JPEG 压缩的要点是有损失的高压缩。即使质量设置为 100，损失也是不可避免的，因为该算法需要无限精度来精确复制源图像。
如果使用相同的参数对两种算法进行相同的编码:精度、边界选择和填充/偏移规范以提供 FFT 的 2 次幂大小，则可以生成相同的文件。
JPEG 算法的实现可以使用预传递来优化算法的参数。
鉴于两种实现的参数优化不同，输出不太可能相同。

Are there additional parameters to PIL .save() to make it behave more like the JPEG encoder in C#?

我不能直接回答这个问题，但是，你可以使用这个包: Python for.NET从 Python 访问 C# JPEG 编码器。该解决方案将提供一致的相同结果。

为什么除了教育值(value)之外，还有人需要二进制兼容性吗？
在我认为解决这个问题的所有实际场景中，唯一的需要是保存图像的附加散列:将新散列保存在单独的字段中。
选择一种技术并使用它，直到它不再适合您的需要/要求。
如果没有(最好是之前)，找到垫片来填补空白并重写代码以利用新技术。

关于python - C# 和 Python 中的 JPEG 压缩差异，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/69365037/

25

4

0

文章推荐： javascript - 弹出框没有出现在 React 中

文章推荐： python - 运行 shutdown 的 Cloud Run Flask API 容器进入休眠循环

文章推荐： reactjs - 如何使变量的类型为未知或 'Movie' 分配？

CUDA替代__syncthreads而不是__threadfence()差异
我从NVIDIA手册Eg中复制了以下代码:__threadfence()。他们为什么有在以下代码中使用了__threadfence()。我认为使用__syncthreads()而不是__thread
带有修订范围和更改列表的 SVN 差异
我在使用 SVN 更改列表和 svn diff 时遇到了一些麻烦.特别是我想获取特定修订范围的特定文件列表的更改历史记录。 SVN 变更列表似乎是完美的解决方案，所以我的方法是: svn change
shell - 差异/合并两个文件
我有两个 IP 地址列表。我需要将它们合并到三个文件中，交集，仅来自 list1 的文件和仅来自 list2 的文件。我可以用 awk/diff 或任何其他简单的 unix 命令来做到这一点吗？如何
上一个和新工作副本之间的 svn 差异
假设自上次更新(恢复)到我的 a.b 文件以来我做了一些更改。此 a.b 文件也在存储库中更改。现在我想将我所做的更改与 repos 更改进行比较。如果我 svn revert 文件，我可以看到
JavaBeans 比较器/差异
关闭。这个问题不符合Stack Overflow guidelines .它目前不接受答案。我们不允许提问寻求书籍、工具、软件库等的推荐。您可以编辑问题，以便用事实和引用来回答。关闭 7 年前。
openssl sha256 差异
我使用的是 openssl 1.0.1c , linux x86_64 我正在创建包含“hello”的文件(没有换行符) openssl dgst -sha256 hello_file i get :
naming - 共同与核心 - 差异
假设我们有几个库。有什么区别核心和普通图书馆？他们应该如何被认可，我们是否组织了两者的职责？ +Common -Class1 +Core -Class2 +Lib1 has : Comm
以毫秒为单位的日期之间的 SQLite 差异
如何在 SQLite 中计算以毫秒为单位的最小时间间隔？好的，提供一些背景信息，这是我的 table 的样子: link_budget table 所以有这个时间列，我想发出一个请求，以毫秒为单位
concurrency - 乐观与多版本并发控制 - 差异？
我想知道，乐观并发控制 (OCC) 和多版本并发控制 (MVCC) 之间的区别是什么？到目前为止，我知道两者都是基于更新的版本检查。在 OCC 中，我读到了没有获取读取访问锁的事务，仅适用于以后的
c# - SignalR 差异
说到 SignalR，我有点菜鸟。刚刚开始四处探索和谷歌搜索它，我想知道是否有人可以向我解释完成的事情之间的一些差异。在我见过的一些示例中，人们需要创建一个 Startup 类并定义 app.Map
math - 两个四元数之间的“差异”
我在 Ogre 工作，但这是一个一般的四元数问题。我有一个对象，我最初对其应用旋转四元数 Q1。后来，我想让它看起来好像我最初通过不同的四元数 Q2 旋转了对象。我如何计算四元数，该四元数将采用已
Javascript 模块模式 - 差异
我了解 javascript 模块模式，但我使用两种类型的模块模式，并且想从架构 Angular 了解它们之间的区别。 // PATTERN ONE var module = (function()
Scala JSON 差异
我有两个具有完全相同键的 JSON。 val json1 = """{ 'name': 'Henry', 'age' : 26, 'activities' : {
vba - 文件复制与名称函数？差异？
我发现使用 VBA 在 Excel 中复制单个文件有两种不同的方法。一是文件复制: FileCopy (originalPath), (pathToCopyTo) 另一个是名称: Name (orig
java - float[] 差异
我想知道查找两个 float 组之间差异的绝对值的最有效方法是什么？是否是以下内容: private float absDifference(float[] vector1, float[] vec
Wicket:getApplication 差异
我有一个关于 wicket getApplication 的问题。 getApplication() 和 getSession().getApplication 有什么区别？部署 wicket 应用
使用和不使用追溯模式的持久订阅之间的 activemq 差异
我刚刚开始使用activemq，我有一个关于追溯消费者的问题，为了启用这个功能，你需要有一个持久的订阅。但是在主题上启用和不启用追溯的持久订阅有什么区别？ activemq 文档说。 http://a
Scala JSON 差异
我有两个具有完全相同键的 JSON。 val json1 = """{ 'name': 'Henry', 'age' : 26, 'activities' : {
types - 浮点和整数的Erlang二进制表示，差异？
得到另一个 Erlang 二进制表示查询('因为这就是我最近正在阅读的内容，并且需要二进制协议(protocol)实现)。如果我正确理解了类型说明符，那么对于“浮点”类型值，8 字节表示似乎很好(这
java - 重载和隐藏 - 差异
关闭。这个问题需要多问focused 。目前不接受答案。想要改进此问题吗？更新问题，使其仅关注一个问题 editing this post . 已关闭 4 年前。 Improve this ques

首页

博学

6Ren·AI

商城

python - C# 和 Python 中的 JPEG 压缩差异