python - 将 .readinto(b) 方法添加到 tarfile 的 ExFileObject 中？-6ren

python - 将 .readinto(b) 方法添加到 tarfile 的 ExFileObject 中？

转载作者：太空宇宙更新时间：2023-11-03 18:49:02

26

4

因此，我尝试迭代 tar 中的多个文件，然后将该数据加载到一些 ctype structures's I've defined 中。。这对于非 tar 文件工作得很好，但后来我发现 ExFileObject tarfile 的 extractfile(member) 方法返回的内容不支持 .readinto(b) 方法。

所以现在这就是我正在做的事情:

import os
import tarfile
import io
from ctypes import c_uint, c_char, c_ubyte, c_ushort, BigEndianStructure

class MyStructure(BigEndianStructure):
    _pack_ = True
    _fields_ = [
                ("id", c_uint), # 4 bytes
                ("namefield", c_char * 32), # 32 bytes
                ("timestamp", c_ubyte * 4), # 4 bytes
                ("payload_length", c_ushort), # 2 bytes
                ]  


def process_tar(tar_files):
    """
    untar and return file objects to be parsed
    """
    for filepath in tar_files:
        f = os.path.abspath(filepath)
        with tarfile.open(f, 'r:*') as tar_f:
            #tar_f.fileobject = io.BufferedReader
            for tarinfo_member in tar_f.getmembers():
                if tarinfo_member.isfile():
                    yield tar_f.extractfile(tarinfo_member)

f = "somefiles.tar.gz"                
for tar_member_fileobj in process_tar([f]):
    mystruct = MyStructure()
    tar_member_fileobj.readinto(mystruct)

得到这个:

---------------------------------------------------------------------------  AttributeError                            Traceback (most recent call last) <ipython-input-4-257ee4b46c31> in <module>()
     29 for tar_member_fileobj in process_tar([f]):
     30     mystruct = MyStructure()
---> 31     tar_member_fileobj.readinto(mystruct)

AttributeError: 'ExFileObject' object has no attribute 'readinto'

有没有办法可以将此方法添加到 ExFileObject 中？或者，是否有另一种方法可以轻松地将我的数据加载到我定义的 ctypes 结构中？我注意到在 tarfile 对象中，您似乎可以将 fileobject 设置为用于返回的 tarinfo 文件，但仅交换 io.BufferedReader 似乎不起作用。

(我尝试将 ExFileObject 读入 StringIO，但它似乎也没有正确实现 readinto()...我想我可以 extractall() 到内存中的文件空间并将文件作为标准文件对象重新打开，但我想避免这种情况，因为这样我就需要额外的配置)

最佳答案

ExFileObject 无法使用 readinto 方法，但您仍然可以通过使用 read 读取文件头并复制使用 memmove 将数据放入结构中:

memmove(byref(mystruct),
        tar_member_fileobj.read(sizeof(mystruct)),
        sizeof(mystruct)
)

例如:

import os
import tarfile
import io
from ctypes import *

class MyStructure(BigEndianStructure):
    _pack_ = True
    _fields_ = [
                ("id", c_uint), # 4 bytes
                ("namefield", c_char * 32), # 32 bytes
                ("timestamp", c_ubyte * 4), # 4 bytes
                ("payload_length", c_ushort), # 2 bytes
                ]  


def process_tar(tar_files):
    """
    untar and return file objects to be parsed
    """
    for filepath in tar_files:
        f = os.path.abspath(filepath)
        with tarfile.open(f, 'r:*') as tar_f:
            #tar_f.fileobject = io.BufferedReader
            for tarinfo_member in tar_f.getmembers():
                if tarinfo_member.isfile():
                    yield tar_f.extractfile(tarinfo_member)



# test tar.gz file
open('somefiles.tar.gz', 'wb').write(
'\x1f\x8b\x08\x08\xad\x5c\x34\x52\x02\x00\x66\x6f\x6f\x2e\x74\
\x61\x72\x00\xed\xca\x49\x0a\x83\x40\x14\x04\xd0\x7f\x85\xdc\
\xa0\x43\x92\x6d\xf8\xdd\xa6\xed\x6d\xe6\xc9\x79\x1e\x76\x82\
\x0a\x82\x4b\xef\x8f\xf6\x42\xf0\x02\x6e\xe4\xbf\x4d\x41\x55\
\xb5\x5d\xdf\x9c\xeb\x6a\xa8\x60\x3d\x1c\x51\x29\xc5\x80\x69\
\xb8\xc8\x99\x14\x8c\x0b\xc1\x25\x4a\x53\x5c\x4c\x5d\x28\x9c\
\xfe\x08\x64\x6d\xbb\xc9\xed\xfe\x78\xbe\xde\x9f\xef\xef\x6f\
\xd9\x8e\xeb\xf9\x41\x18\xc5\x49\x9a\xe5\x45\xb9\xbf\x1e\x8e\
\x27\xd0\xbb\x61\x00\x21\x84\x90\x0d\x19\x01\xd4\xe8\x88\xcf\
\x00\x08\x00\x00'
)

f = "somefiles.tar.gz"                
for tar_member_fileobj in process_tar([f]):
    mystruct = MyStructure()

    memmove(byref(mystruct),
            tar_member_fileobj.read(sizeof(mystruct)),
            sizeof(mystruct)
    )

    print hex(mystruct.id)
    print mystruct.namefield
    print ''.join(map(chr, mystruct.timestamp))
    print hex(mystruct.payload_length)

字符串 \x1f\x8b\x08\x08\xad ... 是一个 tar.gz 文件，包含:

\x11\x11\x11\x11ABCDEFGHIJKLMNOPQRSTUVWXYZ!@#$%\x00ABCD33

将该数据复制到 mystruct 后，它应该打印:

0x11111111L
ABCDEFGHIJKLMNOPQRSTUVWXYZ!@#$%
ABCD
0x3333

关于python - 将 .readinto(b) 方法添加到 tarfile 的 ExFileObject 中？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/18791003/

26

4

0

文章推荐： python - 使用听写还是上课？哪个更快？

文章推荐： assembly - 在 Emacs Lisp 中使用 asm-mode 识别 C++ 风格的注释

文章推荐： emacs - 使 Windows 路径在 Emacs Lisp 中有用

文章推荐： python - 用于返回像素邻域的 Ndimage 或 skimage 函数

没有完整路径的python tarfile
我制作了一个如下所示的小脚本来读取文件组并将它们压缩，一切正常，接受压缩文件在解压缩时包含文件的完整路径。有没有没有目录结构的方法？ compressor = tarfile.open(PATH_TO
Python tarfile 压缩内存中的一个对象
我正在尝试使用 tarfile 在内存中添加一个文件，然后将其写回磁盘，但我遇到的问题是在我的最终输出中，当我提取新创建的 tar.gz 文件时，我得到一个空文件。我的代码哪里做错了？ import
Python:tarfile 流
我想从压缩包中读取一些文件并将其保存到新的压缩包中。这是我写的代码。 archive = 'dum/2164/archive.tar' # Read input data. input_tar = t
Python tarfile 大小
我可以用这种方式计算 tarfile 中文件的大小: import tarfile tf = tarfile.open(name='my.tgz', mode='r') reduce(lambda x
python - Tarfile 在第一个常规文件后停止
我有 .tar.bz2 文件，里面有很多小的 json 文件。一个存档可能有大约数千个，而且 json 很小(低于 10kB，通常也低于 1 KB)。因此，压缩后的单个存档不会超过 100kB。根据
Python tarfile 不解压
我有以下文件: # ls -lha total 2.4M drwxr-xr-x. 2 root root 4.0K Nov 26 19:47 . drwxrwxr-x. 5 bshus
python tarfile 将tar写入管道
我想创建一个 tar 文件并将其通过管道传输到 http 上传。但是，似乎 python tarfile 模块执行搜索，这使得无法通过管道传输到下一个进程。这是代码 tar = tarfile.o
Python tarfile 进度
在将文件添加到 python 中的 tar 存档时，是否有任何库可以显示进度，或者可以扩展 tarfile 模块的功能来执行此操作？在理想情况下，我想展示 tar 创建的总体进度以及关于何时完成的预
python - 将空目录添加到 tarfile
如何在 python 中将空目录添加到 tarfile，而不先在磁盘上创建它？在我的本地文件系统中创建一个空目录，并将其添加到 tar 文件中很容易，但会产生不必要的开销。直接在 tar 文件中创
Python tarfile 和排除
这是 Python 文档的摘录: If exclude is given it must be a function that takes one filename argument and retu
python - 如何从生成器读取 tarfile？
Create a zip file from a generator in Python?描述了将一堆文件写入 .zip 到磁盘的解决方案。我在相反的方向也有类似的问题。我得到了一个发电机: str
Python tarfile 进度输出？
我正在使用以下代码提取一个 tar 文件: import tarfile tar = tarfile.open("sample.tar.gz") tar.extractall() tar.close(
python - 语句 : with and tarfile
我尝试使用 with 语句和 tarfile 模块... with tarfile.open('/dir/dir/dir.tar.gz', 'w:gz') as fl: fl.add('/di
python - 无法将函数并行映射到 tarfile 成员
我有一个包含 bz2 压缩文件的 tarfile。我想将函数 clean_file 应用于每个 bz2 文件，并整理结果。在系列中，这很容易用一个循环: import pandas as pd imp
python - 添加到 tarfile 的文件返回为空文件
我正在尝试将文件添加到 python 中的 gzipped tarfile import tarfile # create test file with open("testfile.txt", "w
python - 使用单个文件名过滤 tarfile.add
使用 tarfile.add 添加目录时，是否可以访问 filter lambda 中的各个文件名？我正在使用 tarfile 模块来创建项目目录的存档。其中一些文件我不再需要，我想忽略: mypr
python - tarfile.open() 未提取到正确的目录路径
我正在尝试将 tar.gz 文件中的所有内容提取到同一目录中。以下代码可以提取所有文件，但文件存储在工作目录中，而不是我输入的名称路径中。 import tarfile zip_rw_data = r
Python tarfile 模块在提取过程中覆盖现有文件 - 如何禁用它？
是否有办法防止 tarfile.extractall ( API ) 覆盖现有文件？我所说的“防止”是指理想情况下在即将发生覆盖时引发异常。当前的行为是静默覆盖文件。最佳答案您可以查看 tar
python - 在 tarfile 中区分来自不同驱动器的文件
我正在尝试使用 tarfile 库归档和压缩分布在多个驱动器上的多个目录。问题是即使两个文件存储在不同的驱动器中，tarfile 也会合并路径。例如: import tarfile with tarf
Python:为 tarfile 分配压缩级别
我的问题是对这个 one 的跟进.我想知道如何修改以下代码以便分配压缩级别: import os import tarfile home = '//global//scratch//chamar//p

首页

博学

6Ren·AI

商城

python - 将 .readinto(b) 方法添加到 tarfile 的 ExFileObject 中？