java - 使用 apache common compress/org.tukaani.xz 在 java 中解码 LZMA 压缩 zip 文件的问题-6ren

java - 使用 apache common compress/org.tukaani.xz 在 java 中解码 LZMA 压缩 zip 文件的问题

转载作者：行者123 更新时间：2023-12-01 06:10:35

获取 org.tukaani.xz.UnsupportedOptionsException:未压缩的大小太大 尝试解码 LZMA 压缩 xls 文件时出错。而非 LZMA 文件解压/解码没有任何问题。这两种情况都在压缩相同的 xls 文件。

我正在使用 Apache commons compress 和 org.tukaani.xz。

引用示例代码

package com.concept.utilities.zip;

import java.io.File;
import java.io.IOException;
import java.io.InputStream;

import org.apache.commons.compress.archivers.zip.ZipArchiveEntry;
import org.apache.commons.compress.archivers.zip.ZipFile;
import org.apache.commons.compress.compressors.lzma.LZMACompressorInputStream;

public class ApacheComm {

    public void extractLZMAZip(File zipFile, String compressFileName, String destFolder) {

        ZipFile zip = null;
        try {

            zip = new ZipFile(zipFile);
            ZipArchiveEntry zipArchiveEntry = zip.getEntry(compressFileName);
            if (null != zipArchiveEntry) {
                String name = zipArchiveEntry.getName();

                // InputStream is = zip.getInputStream(zipArchiveEntry);
                InputStream israw = zip.getRawInputStream(zipArchiveEntry);

                LZMACompressorInputStream lzma = new LZMACompressorInputStream(israw);
            }

        } catch (IOException e) {
            e.printStackTrace();
        } finally {
            if (null != zip)
                ZipFile.closeQuietly(zip);
        }
    }

    public static void main(String[] args) throws IOException {

        ApacheComm c = new ApacheComm();
        try {
            c.extractLZMAZip(new File("H:\\archives\\rollLZMA.zip"), "roll.xls", "H:\\archives\\");
        } catch (Exception e) {
            e.printStackTrace();
        }

    }

}

错误

org.tukaani.xz.UnsupportedOptionsException: Uncompressed size is too big
    at org.tukaani.xz.LZMAInputStream.initialize(Unknown Source)
    at org.tukaani.xz.LZMAInputStream.<init>(Unknown Source)
    at org.apache.commons.compress.compressors.lzma.LZMACompressorInputStream.<init>(LZMACompressorInputStream.java:50)
    at com.concept.utilities.zip.ApacheComm.extractLZMAZip(ApacheComm.java:209)
    at com.concept.utilities.zip.ApacheComm.main(ApacheComm.java:224)

我错过了什么吗？有没有其他方法可以解码 使用压缩方法的 zip 文件 = LZMA

最佳答案

您的代码不起作用的原因是 Zip LZMA 压缩数据段与普通压缩 LZMA 文件相比具有不同的 header 。
您可以在 https://pkware.cachefly.net/webdocs/casestudies/APPNOTE.TXT 阅读规范。 (4.4.4 通用位标志，5.8 LZMA - 方法 14)，但引用重要部分:

5.8.5 [...] The LZMA Compressed Data Segment will consist of an LZMA Properties Header followed by the LZMA Compressed Data as shown:
[LZMA properties header for file 1]
[LZMA compressed data for file 1]
[...]

5.8.8 Storage fields for the property information within the LZMA Properties Header are as follows:
LZMA Version Information 2 bytes
LZMA Properties Size 2 bytes
LZMA Properties Data variable, defined by "LZMA Properties Size"
5.8.8.1 LZMA Version Information - this field identifies which version of the LZMA SDK was used to compress a file. The first byte will store the major version number of the LZMA SDK and the second byte will store the minor number.

5.8.8.2 LZMA Properties Size - this field defines the size of the remaining property data. Typically this size SHOULD be determined by the version of the SDK. This size field is included as a convenience and to help avoid any ambiguity arising in the future due to changes in this compression algorithm.

5.8.8.3 LZMA Property Data - this variable sized field records the required values for the decompressor as defined by the LZMA SDK. The data stored in this field SHOULD be obtained using the WriteCoderProperties() in the version of the SDK defined by the "LZMA Version Information" field.

代码示例:

import org.apache.commons.compress.archivers.zip.ZipArchiveEntry;
import org.apache.commons.compress.archivers.zip.ZipFile;
import org.apache.commons.compress.archivers.zip.ZipMethod;
import org.apache.commons.io.IOUtils;
import org.tukaani.xz.LZMAInputStream;

import java.io.IOException;
import java.io.InputStream;
import java.nio.ByteBuffer;
import java.nio.ByteOrder;

public class ApacheComm
{
    public InputStream getInputstreamForEntry(ZipFile zipFile, ZipArchiveEntry ze) throws IOException
    {
        if (zipFile.canReadEntryData(ze))
        {
            return zipFile.getInputStream(ze);
        } else if (ze.getMethod() == ZipMethod.LZMA.getCode()) {
            InputStream inputStream = zipFile.getRawInputStream(ze);
            ByteBuffer buffer = ByteBuffer.wrap(IOUtils.readFully(inputStream, 9))
                    .order(ByteOrder.LITTLE_ENDIAN);

            // Lzma sdk version used to compress this data
            int majorVersion = buffer.get();
            int minorVersion = buffer.get();

            // Byte count of the following data represent as an unsigned short.
            // Should be = 5 (propByte + dictSize) in all versions
            int size = buffer.getShort() & 0xffff;
            if (size != 5)
                throw new UnsupportedOperationException();

            byte propByte = buffer.get();

            // Dictionary size is an unsigned 32-bit little endian integer.
            int dictSize = buffer.getInt();

            long uncompressedSize;
            if ((ze.getRawFlag() & (1 << 1)) != 0)
            {
                // If the entry uses EOS marker, use -1 to indicate
                uncompressedSize = -1;
            } else {
                uncompressedSize = ze.getSize();
            }

            return new LZMAInputStream(inputStream, uncompressedSize, propByte, dictSize);
        } else {
            throw new UnsupportedOperationException();
        }
    }
}

关于java - 使用 apache common compress/org.tukaani.xz 在 java 中解码 LZMA 压缩 zip 文件的问题，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45213793/

文章推荐： python - pyssh 0.3 的新副本中存在语法错误？ (PYTHON)

文章推荐： java - 如何制作显示 x 轴和 y 轴值的弹出窗口

文章推荐： java - 禁用 sphinx4 日志消息

文章推荐： python - Django 管理员自动保存

android - 找不到类 'org.tukaani.xz.LZMAInputStream'
我使用库 apache commons compress 1.9 和 x.z-1.4 来提取 7zip 文件。我在2个过程中使用了它。首先，我通过WIFI下载7zip文件，下载完成后，我解压它，它成功
java - 使用 apache common compress/org.tukaani.xz 在 java 中解码 LZMA 压缩 zip 文件的问题
获取 org.tukaani.xz.UnsupportedOptionsException:未压缩的大小太大尝试解码 LZMA 压缩 xls 文件时出错。而非 LZMA 文件解压/解码没有任何问题。
java - 使用 apache compress/org.tukaani.xz 在 java 中解压/解密受密码保护 (AES 256) 7z 文件的问题
尝试解密受密码保护的 (AES 256) 7z 文件时出现org.tukaani.xz.CorruptedInputException:压缩数据已损坏错误。而没有密码保护的 7z 文件解压没有任何问题

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

java - 使用 apache common compress/org.tukaani.xz 在 java 中解码 LZMA 压缩 zip 文件的问题