c++ - std::ofstream - 没有超过 1023 的缓冲字符串(即时刷新)-6ren

c++ - std::ofstream - 没有超过 1023 的缓冲字符串(即时刷新)

转载作者：塔克拉玛干更新时间：2023-11-03 07:38:28

25

4

当我使用 pubsetbuf(...) 更改 ofstream 缓冲区的大小时，一切正常，除非我将 ofstream 设为单个字符串比 1023 长(在下面的代码中)。这是正确的行为还是我做错了什么？

int main(){
    std::vector<char> rawBuf;
    std::ofstream stream;

    rawBuf.resize(20000);
    stream.rdbuf()->pubsetbuf(&rawBuf[0], 20000);

    stream.open("file.txt", std::ios_base::app);

    std::string data(1499, 'b');

    for(int i = 0; i < 10; i++)
    {   
        stream << data.substr(0, 1024) << "\n"; //1023-length string works great
        sleep(1);
    }
    stream.flush();
    stream.close();

    return 0;
}

当有 1024 长度的字符串时 strace ./program 显示如下:

writev(3, [{iov_base=NULL, iov_len=0}, {iov_base="bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb"..., iov_len=1024}], 2) = 1024
nanosleep({tv_sec=1, tv_nsec=0}, 0x7ffcf3889ac0) = 0
writev(3, [{iov_base="\n", iov_len=1}, {iov_base="bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb"..., iov_len=1024}], 2) = 1025
nanosleep({tv_sec=1, tv_nsec=0}, 0x7ffcf3889ac0) = 0
... and so on 10x

当有 1023 长度的字符串时，一切似乎都正常:

nanosleep({tv_sec=1, tv_nsec=0}, 0x7fff8e13a980) = 0
nanosleep({tv_sec=1, tv_nsec=0}, 0x7fff8e13a980) = 0
... 10x

然后:

write(3, "bbbbbbbbbbbbbbbbbbbbbbbbbbbbbbbb"..., 10240) = 10240

为什么这里是单写而之前不是？

编辑:

gcc version 7.3.0 (Ubuntu 7.3.0-16ubuntu3)

最佳答案

根据 [filebuf.virtuals]/12 :

basic_streambuf* setbuf(char_type* s, streamsize n) override;
Effects: If setbuf(0, 0) is called on a stream before any I/O has occurred on that stream, the stream becomes unbuffered. Otherwise the results are implementation-defined. “Unbuffered” means that pbase() and pptr() always return null and output to the file should appear as soon as possible.

“实现定义”包括“工作正常”和“只有一次写入”等。事实上，这就是 libstdc++ 7.3.0 says 的内容:

First, are you sure that you understand buffering? Particularly the fact that C++ may not, in fact, have anything to do with it?

The rules for buffering can be a little odd, but they aren't any different from those of C. (Maybe that's why they can be a bit odd.) Many people think that writing a newline to an output stream automatically flushes the output buffer. This is true only when the output stream is, in fact, a terminal and not a file or some other device -- and that may not even be true since C++ says nothing about files nor terminals. All of that is system-dependent. (The "newline-buffer-flushing only occurring on terminals" thing is mostly true on Unix systems, though.)

Some people also believe that sending endl down an output stream only writes a newline. This is incorrect; after a newline is written, the buffer is also flushed. Perhaps this is the effect you want when writing to a screen -- get the text out as soon as possible, etc -- but the buffering is largely wasted when doing this to a file:
output << "a line of text" << endl;
output << some_data_variable << endl;
output << "another line of text" << endl; 
The proper thing to do in this case to just write the data out and let the libraries and the system worry about the buffering. If you need a newline, just write a newline:
output << "a line of text\n"
 << some_data_variable << '\n'
 << "another line of text\n"; 
I have also joined the output statements into a single statement. You could make the code prettier by moving the single newline to the start of the quoted text on the last line, for example.

If you do need to flush the buffer above, you can send an endl if you also need a newline, or just flush the buffer yourself:
output << ...... << flush;    // can use std::flush manipulator
output.flush();               // or call a member fn 
On the other hand, there are times when writing to a file should be like writing to standard error; no buffering should be done because the data needs to appear quickly (a prime example is a log file for security-related information). The way to do this is just to turn off the buffering before any I/O operations at all have been done (note that opening counts as an I/O operation):
std::ofstream    os;
std::ifstream    is;
int   i;

os.rdbuf()->pubsetbuf(0,0);
is.rdbuf()->pubsetbuf(0,0);

os.open("/foo/bar/baz");
is.open("/qux/quux/quuux");
...
os << "this data is written immediately\n";
is >> i;   // and this will probably cause a disk read 
Since all aspects of buffering are handled by a streambuf-derived member, it is necessary to get at that member with rdbuf(). Then the public version of setbuf can be called. The arguments are the same as those for the Standard C I/O Library function (a buffer area followed by its size).

A great deal of this is implementation-dependent. For example, streambuf does not specify any actions for its own setbuf()-ish functions; the classes derived from streambuf each define behavior that "makes sense" for that class: an argument of (0,0) turns off buffering for filebuf but does nothing at all for its siblings stringbuf and strstreambuf, and specifying anything other than (0,0) has varying effects. User-defined classes derived from streambuf can do whatever they want. (For filebuf and arguments for (p,s) other than zeros, libstdc++ does what you'd expect: the first s bytes of p are used as a buffer, which you must allocate and deallocate.)

A last reminder: there are usually more buffers involved than just those at the language/library level. Kernel buffers, disk buffers, and the like will also have an effect. Inspecting and changing those are system-dependent.

关于c++ - std::ofstream - 没有超过 1023 的缓冲字符串(即时刷新)，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/56849720/

25

4

0

文章推荐： c++ - 我可以声明一个模板只接受具有同质签名的函数吗？

文章推荐： c++ - 如何在两个 boost::intrusive::slist 对象之间传输节点

文章推荐： c++ - 是否有符合标准的方法来确定非静态成员的对齐方式？

c++ - 为什么 `std::common_type_t` 等于 `std::ostream` 而不是 `std::ostream &` ？
我正在开发一个小型图书馆，我需要做的一件事是让访问者访问一些数据并返回结果。在一些较旧的 C++ 代码中，访问者需要声明一个 typedef return_type .例如，boost::stati
c++ - std::map 麻烦
我正在尝试使用std:map类型的键和值制作std::any Visual Studio 2017 std::map m("lastname", "Ivanov"); std::cout (m["la
C++ std::map> 。如何循环设定值？
我已经在 C++ 的 map 中声明了一个集合为 std::map> .如何循环访问或打印设定值？最佳答案如果你知道如何迭代 std::map或 std::set单独地，您应该可以毫无问题地组合迭
C++ 循环 std::vector>
如何循环？我已经试过了: //----- code std::vector >::iterator it; for ( it = users.begin(); it != users.end();
c++ - std::unique_lock 还是 std::lock_guard？
我有两个用例。 A.我想同步访问两个线程的队列。 B.我想同步两个线程对队列的访问并使用条件变量，因为其中一个线程将等待另一个线程将内容存储到队列中。对于用例 A，我看到了使用 std::lock_
c++ - std::trivially_copyable_v 和 std::is_pod_v 之间有什么区别(std::is_standard_layout && std::is_trivial_v)
我正在查看这两种类型特征的文档，但不确定有什么区别。我不是语言律师，但据我所知，它们都适用于“memcpy-able”类型。它们可以互换使用吗？最佳答案不，这些术语不能互换使用。这两个术语都表示
c++ - 为什么我可以有一个 std::vector 而不是 std::vector？
我有以下测试代码，其中有一个参数 fS，它是 ofstream 的容器: #include #include #include #include int
c++ - std::unordered_map
这是这个问题的延续 c++ function ptr in unorderer_map, compile time error 我试图使用 std::function 而不是函数指针，并且只有当函数是

c++ - 将 std::any_of、std::all_of、std::none_of 等与 std::map 一起使用
std::unordered_map str_bool_map = { {"a", true}, {"b", false}, {"c", true} }; 我们可以在此映射上使
c++ - 使用 std::find 检查 std::vector> 中的项目
我有以下对象 std::vector> vectorList; 然后我添加到这个使用 std::vector vec_tmp; vec_tmp.push_back(strDRG); vec_tmp.p
c++ - 为什么 std::initializer_list 不支持 std::get<>、std::tuple_size 和 std::tuple_element
为什么 std::initializer_list不支持std::get<> , std::tuple_size和 std::tuple_element ？在constexpr中用得很多现在的表达式，
c++ - std::tuple 和 std::tuple 是否被 std::vector 视为同一类型？
我有一个像这样定义的变量 auto drum = std::make_tuple ( std::make_tuple ( 0.3f , Ex
c++ :将 std::map 转换为 std::map
假设我有一个私有(private)std::map在我的类(class)里std::map 。我怎样才能将其转换为std::map返回给用户？我想要下面的原型(prototype) const std
c++ :将 std::map 转换为 std::map
假设我有一个私有(private)std::map在我的类(class)里std::map 。我怎样才能将其转换为std::map返回给用户？我想要下面的原型(prototype) const std
c++ - 在带有 std::ref 的 std::thread 中使用地址清理调用 std::invoke(std::forward(...)) 时的奇怪行为
问题我正在尝试将 lambda 闭包传递给 std::thread，它使用任意封闭参数调用任意封闭函数。 template std::thread timed_thread(Function&& f
c++ - 具有模板模板参数的模板定义，可以专门化为类，例如，std::vector 或 std::map
我想创建一个模板类，可以容纳容器和容器的任意组合。例如，std::vector或 std::map ，例如。我尝试了很多组合，但我必须承认模板的复杂性让我不知所措。我编译的关闭是这样的: templ
c++ - 将 std::vector> 分配给另一个 std::vector>
我有一个 std::vector>我将其分配给相同类型的第二个 vector 。我收到这个编译器错误: /opt/gcc-8.2.0/include/c++/8.2.0/bits/stl_algob
c++ - 将 std::vector> 移动到 std::vector>
有时候，我们有一个工厂可以生成一个 std::unique_ptr vector ，后来我们想在类/线程/你命名的之间共享这些指针。因此，最好改用 std::shared_ptr 。当然有一种方法可以
c++ - 为什么 std::sort 假定 std::vector< std::vector> 默认为 std::vector，从而产生错误的结果？
这个问题在这里已经有了答案: Sorting a vector of custom objects (14 个答案) 关闭 6 年前。我创建了一个 vector vector ，我想根据我定义的参
c++ - 将 std::vector> 转换为 std::vector>
我有三个类(class)成员: public: std::vector > getObjects(); std::vector > getObjects() const; privat

首页

博学

6Ren·AI

商城

c++ - std::ofstream - 没有超过 1023 的缓冲字符串(即时刷新)