Reading and writing sections of a custom binary file format in Python(在Python中读取和写入自定义二进制文件格式的部分)-6ren

Reading and writing sections of a custom binary file format in Python(在Python中读取和写入自定义二进制文件格式的部分)

转载作者：bug小助手更新时间：2023-10-25 14:33:12

31

4

I have not worked with an entirely custom file format yet, but the project I am working on requires an entirely new, custom binary file format. I don't know all the best practices for the same (like using Identification bytes aka "magic numbers") and how to implement them in Python. Here are the basic requirements:

我还没有使用过完全定制的文件格式，但我正在处理的项目需要一种全新的定制二进制文件格式。我不知道这方面的所有最佳实践(比如使用标识字节，也就是“魔术数字”)，也不知道如何在Python中实现它们。以下是基本要求：

I have a dictionary which must be used as metadata regarding the file (which will probably be serialized, I guess).

The main body of the file will contain random bytes because they are the result of encryption.

I have to read the metadata whenever such a file is provided, and get back my original Python dictionary, and then I need to retrieve the body i.e., the random bytes, for decryption. Kindly provide a basic implementation or an idea to read and write such a file in Python along with the best practices to create the custom file format.

每当提供这样的文件时，我都必须读取元数据，并取回我的原始Python词典，然后我需要检索正文，即随机字节，以进行解密。请提供一个基本的实现或一个想法，以读写这样的文件在Python中，以及创建自定义文件格式的最佳实践。

Currently, as a temporary solution, I am serializing the dictionary using ormsgpack and prepending it to the output file, then using a custom delimiter b"\xFF\xFF\xFF\xFF" to separate the serialized metadata from the main body.

目前，作为临时解决方案，我使用ormsgpack序列化字典并将其前置到输出文件，然后使用定制分隔符b“\xff\xff”将序列化的元数据与主体分开。

|‾‾‾‾‾‾‾‾‾‾‾|
|  metadata |
|___________|
|           |
| delimiter |
|___________|
|           |
|   body    |
|___________|

However, this might be an issue since if this particular sequence occurs somewhere in the serialized metadata, the full metadata will not be read and cause errors.

但是，这可能是一个问题，因为如果此特定序列出现在序列化的元数据中的某个位置，则不会读取完整的元数据并导致错误。

更多回答

优秀答案推荐

Using msgpack is a good idea.

使用msgpack是个好主意。

Right after serializing, check the length of the output, and prepend it:

在序列化之后，立即检查输出的长度，并将其添加到前面：

|‾‾‾‾‾‾‾‾‾‾‾|
|  length   |
| (8 bytes) |
|           |
|‾‾‾‾‾‾‾‾‾‾‾|
|  metadata |
|___________|
|           |
|   body    |
|___________|

That's the way most protocols work. Decoding this will then be easier.

这是大多数协议的工作方式。这样一来，破译就更容易了。

If the metadata is potentially too big for memory, you can set the length to 0, write the metadata, and then seek back to change the length.

如果元数据可能太大而无法存储，您可以将长度设置为0，写入元数据，然后返回以更改长度。

A different option would be to escape the delimiter sequence, but that's more complex and won't be as useful in this scenario.

另一种选择是转义分隔符序列，但这更复杂，在此场景中不会那么有用。

更多回答

31

4

0

文章推荐： unity mouse rotation lag(单位鼠标旋转延迟)

java - Vector C++、List 和 Vector Java
我在大学学习C++时学习了这段代码..后来我在C#中使用了同样的东西...但现在我想在Java中使用它...我在互联网上寻找类似的东西，但我什至不知道如何表达它，以便我得到正确的结果。所以嗯，请让我
ruby-on-rails - 事件记录错误 : Couldn't find Customer with 'id' =1 [WHERE (`customers` .`company_id` IS NOT NULL) AND `customers` .`company_id` = ?] RUBY ON RAILS
我正在我的 Ruby on Rails Controller 上运行 RSPEC 测试，这是我正在测试的 Controller 操作: Controller 代码: class Customers::
custom-controls - UITabBarItem 选定的选项卡背景 : Custom?
想为我选择的选项卡设置自定义背景，到目前为止，子类化是我自定义 UITAbBar/UITabBarItem 的方式。问题是:有谁知道(或知道我在哪里可以找到)设置背景的属性是什么？所选选项卡周围有
customization - Hybris 产品数据上的 Hybris Custom WSDTO
您好，我在 commerefacades-beans.xml 中创建了 eProductForm bean，我添加了 ProductData 的自定义属性。然后在commercewebs
mysql - SQL : how do i find customer orders with customers?
我有两个表:1. 客户2. customer_order 客户表包含客户数据(duh)，customer_order 包含所有订单。我可以在 customer.id=customer_order.id
iOS : Customizing TableViewCell - Initializing Custom Cell
在我的 TableView 中，我有一个 NSMutableArray *currList 的数据源 - 它包含对象 Agent 的对象。我创建了自定义的 TableCell 并正确设置了所有内容。我
c# - 是否应该使用自引用通用继承，如 Customer : Entity
是否建议使用自引用泛型继承？ public abstract class Entity { public Guid Id {get; set;} public int Version
customization - 将 custom.ini 与 Grafana 结合使用
我正在尝试为我的 Grafana 安装使用自定义文件 ( custom.ini )。不幸的是，这不起作用。我做了什么: 安装了一台装有 CentOS 7 的虚拟机添加了 Grafana Yum R
java - 自定义类: Custom type cannot be converted to other custom type
我被分配了两个给定类的作业，一个是抽象父类 Lot.java，另一个是测试类 TestLots.java。我不应该编辑其中任何一个。任务是创建Lot的两个子类，使TestLots中的错误不再是错误。
bots - 底部压力 : Custom Content Type and Custom Rendering
我是 Botpress 的新手。我刚刚安装了 Botpress 的最新版本“botpress-ce-v11_0_1-win-x64”。我浏览了文档，发现了一些关于内容类型、内容元素和内容渲染的解释
python - Qt 设计器 : Custom code for custom actions
我一直在四处寻找，但我还没有找到任何东西，除了 Qt3 的旧文档和 qt 设计器的 3.x 版。我会举个例子，并不是因为我的项目是 GPL 而不能提供代码，而是为了简单起见。示例:您正在为您的应用
c# - 流利验证 : set custom message on custom validation
场景我有一个自定义规则来验证订单的运费: public class OrderValidator : BaseValidator { private string CustomInfo {
android - 改造 2 : Custom annotations for custom interceptor
我有用于身份验证的自定义拦截器: @Named("authInterceptor") @Provides @Singleton fun providesAuthIntercep
ruby-on-rails - ruby rails : Custom getter or custom helper
如果有人没有添加照片，我想显示默认头像图像。我假设我需要在模型或助手中执行自定义 getter。如果我做 getter，它会看起来像这样吗: def avatar_url "default_ur
google-custom-search - Google Custom Search API 中的保留字
我正在使用 Google Search API，但遇到了一些麻烦。这个请求(在 Python 中，使用 requests 库)工作正常 res = requests.get("https://www.
custom-keyboard - MSKLC : How to associate a country to a custom keyboard layout
我使用 MSKLC 制作了自定义键盘布局。我以为我仔细按照说明操作了chose appropriate values对于LOCALENAME和 LOCALID参数。但是，在通过按 Win+Spac
java - 调用另一个通用方法的通用方法 - util 方法返回 Class 而不是 Customer
我正在使用 simpleframework解析 XML 字符串并将其转换为对象。 Serializer serializer = new Persister(); try { Customer
c# - MySql查询: get all customers that has their ID in both Customer and Customer_x_Billing table
我正在使用 C# 控制台应用程序从 MySql 数据库获取一些数据，但在正确查询时遇到一些问题现在的情况: SELECT * FROM Customer WHERE EXISTS ( SELECT
objective-c - 滑动 Custome UITableViewCell/Custom UITableViewController 时内存泄漏
我在我的 iPhone 4S 上运行我的应用程序，我正在使用自定义表格 View Controller 和自定义表格 View 单元格，当我将表格 View 向上滑动到空白区域并同样向下滑动到空白区域
javascript - 谷歌标签管理器 : How to use "Custom Javascript" in a "Custom HTML Tag?"
我有一个自定义的 JavaScript 变量，它正在检查 eventAction 是什么，这样我就可以知道是否触发一些转换像素。自定义 Javascript 称为“FacebookConversion

首页

博学

6Ren·AI

商城

Reading and writing sections of a custom binary file format in Python(在Python中读取和写入自定义二进制文件格式的部分)