c# - 为什么 C# 内存流要预留这么多内存？-6ren

c# - 为什么 C# 内存流要预留这么多内存？

转载作者：IT王子更新时间：2023-10-28 23:28:55

我们的软件通过 GZipStream 解压缩某些字节数据，该 GZipStream 从 MemoryStream 读取数据。这些数据以 4KB 的 block 为单位解压缩并写入另一个 MemoryStream。

我们已经意识到进程分配的内存远高于实际解压后的数据。

示例:具有 2,425,536 字节的压缩字节数组被解压缩为 23,050,718 字节。我们使用的内存分析器显示方法 MemoryStream.set_Capacity(Int32 value) 分配了 67,104,936 字节。这是保留内存和实际写入内存之间的 2.9 倍。

注意:MemoryStream.set_Capacity 是从 MemoryStream.EnsureCapacity 调用的，而 MemoryStream.EnsureCapacity 本身又是从我们函数中的 MemoryStream.Write 调用的。

为什么 MemoryStream 保留这么多容量，即使它只追加 4KB 的 block ？

这是解压数据的代码片段:

private byte[] Decompress(byte[] data)
{
    using (MemoryStream compressedStream = new MemoryStream(data))
    using (GZipStream zipStream = new GZipStream(compressedStream, CompressionMode.Decompress))
    using (MemoryStream resultStream = new MemoryStream())
    {
        byte[] buffer = new byte[4096];
        int iCount = 0;

        while ((iCount = zipStream.Read(buffer, 0, buffer.Length)) > 0)
        {
            resultStream.Write(buffer, 0, iCount);
        }
        return resultStream.ToArray();
    }
}

注意:如果相关，这是系统配置:

Windows XP 32 位，
.NET 3.5
使用 Visual Studio 2008 编译

最佳答案

因为this is the algorithm了解它如何扩展其容量。

public override void Write(byte[] buffer, int offset, int count) {

    //... Removed Error checking for example

    int i = _position + count;
    // Check for overflow
    if (i < 0)
        throw new IOException(Environment.GetResourceString("IO.IO_StreamTooLong"));

    if (i > _length) {
        bool mustZero = _position > _length;
        if (i > _capacity) {
            bool allocatedNewArray = EnsureCapacity(i);
            if (allocatedNewArray)
                mustZero = false;
        }
        if (mustZero)
            Array.Clear(_buffer, _length, i - _length);
        _length = i;
    }

    //... 
}

private bool EnsureCapacity(int value) {
    // Check for overflow
    if (value < 0)
        throw new IOException(Environment.GetResourceString("IO.IO_StreamTooLong"));
    if (value > _capacity) {
        int newCapacity = value;
        if (newCapacity < 256)
            newCapacity = 256;
        if (newCapacity < _capacity * 2)
            newCapacity = _capacity * 2;
        Capacity = newCapacity;
        return true;
    }
    return false;
}

public virtual int Capacity 
{
    //...

    set {
         //...

        // MemoryStream has this invariant: _origin > 0 => !expandable (see ctors)
        if (_expandable && value != _capacity) {
            if (value > 0) {
                byte[] newBuffer = new byte[value];
                if (_length > 0) Buffer.InternalBlockCopy(_buffer, 0, newBuffer, 0, _length);
                _buffer = newBuffer;
            }
            else {
                _buffer = null;
            }
            _capacity = value;
        }
    }
}

因此，每次达到容量限制时，容量都会增加一倍。这样做的原因是 Buffer.InternalBlockCopy 操作对于大型数组来说很慢，因此如果必须频繁调整每个 Write 调用的大小，性能会显着下降。

您可以做一些事情来提高性能，您可以将初始容量设置为至少是压缩数组的大小，然后您可以将大小增加小于 2.0 以减少您正在使用的内存量。

const double ResizeFactor = 1.25;

private byte[] Decompress(byte[] data)
{
    using (MemoryStream compressedStream = new MemoryStream(data))
    using (GZipStream zipStream = new GZipStream(compressedStream, CompressionMode.Decompress))
    using (MemoryStream resultStream = new MemoryStream(data.Length * ResizeFactor)) //Set the initial size to be the same as the compressed size + 25%.
    {
        byte[] buffer = new byte[4096];
        int iCount = 0;

        while ((iCount = zipStream.Read(buffer, 0, buffer.Length)) > 0)
        {
            if(resultStream.Capacity < resultStream.Length + iCount)
               resultStream.Capacity = resultStream.Capacity * ResizeFactor; //Resize to 125% instead of 200%

            resultStream.Write(buffer, 0, iCount);
        }
        return resultStream.ToArray();
    }
}

如果你愿意，你可以做更多花哨的算法，比如根据当前压缩比调整大小

const double MinResizeFactor = 1.05;

private byte[] Decompress(byte[] data)
{
    using (MemoryStream compressedStream = new MemoryStream(data))
    using (GZipStream zipStream = new GZipStream(compressedStream, CompressionMode.Decompress))
    using (MemoryStream resultStream = new MemoryStream(data.Length * MinResizeFactor)) //Set the initial size to be the same as the compressed size + the minimum resize factor.
    {
        byte[] buffer = new byte[4096];
        int iCount = 0;

        while ((iCount = zipStream.Read(buffer, 0, buffer.Length)) > 0)
        {
            if(resultStream.Capacity < resultStream.Length + iCount)
            {
               double sizeRatio = ((double)resultStream.Position + iCount) / (compressedStream.Position + 1); //The +1 is to prevent divide by 0 errors, it may not be necessary in practice.

               //Resize to minimum resize factor of the current capacity or the 
               // compressed stream length times the compression ratio + min resize 
               // factor, whichever is larger.
               resultStream.Capacity =  Math.Max(resultStream.Capacity * MinResizeFactor, 
                                                 (sizeRatio + (MinResizeFactor - 1)) * compressedStream.Length);
             }

            resultStream.Write(buffer, 0, iCount);
        }
        return resultStream.ToArray();
    }
}

关于c# - 为什么 C# 内存流要预留这么多内存？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/24636259/

文章推荐： java - 如何将 Android Studio 项目导出为 zip 文件？

文章推荐：带有通知徽章的 Android Tablayout 选项卡，例如 WhatsApp

文章推荐： android - 创建 Android 位置对象

Azure Kubernetes 服务 (AKS) 预留 VM？
我的问题是: 如何预留 Azure Kubernetes 服务 (AKS) VM？ https://azure.microsoft.com/en-us/pricing/calculator/ 在定价计
c# - 在 C# 中以编程方式创建 DHCP 预留
我的任务是创建一个应用程序，该应用程序接收 MAC 地址并为该 MAC 地址创建 DHCP 预留。 .NET 中是否内置了任何 API 来轻松完成此操作？最佳答案当我为基础设施编写应用程序时，Wi
amazon-ec2 - 预留 Amazon EC2 实例时，平台相同意味着什么？
在用户指南 [1] 中，声明要根据您的预留实例对您的按需 ec2 实例进行计费，它们必须符合四个条件。地区，可用区，实例类型，和平台指定该平台令人困惑，因为我使用了自定义 Ubuntu A
amazon-web-services - AWS 预留 EC2 实例是否适用于免费市场实例？
我使用 AWS 市场 CentOS 微型和小型实例作为我的实例的镜像基础。查看 EC2 实例预留用户界面，有几个选项可供选择，例如: Linux/Unix Linux/Unix (Amazon VP
azure - Azure 上的 Terraform 预留 VM 实例
已关闭。此问题不符合Stack Overflow guidelines 。目前不接受答案。这个问题似乎不是关于 a specific programming problem, a software

IT王子

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

c# - 为什么 C# 内存流要预留这么多内存？