C99 标准 - fprintf - s 精确转换-6ren

C99 标准 - fprintf - s 精确转换

转载作者：行者123 更新时间：2023-12-04 09:35:55

24

4

假设只有 C99 Standard paper 和 printf 库函数需要根据此标准实现以使用 UTF-16 编码，您能否阐明指定精度的 s 转换的预期行为？

s 转换的 C99 标准 (7.19.6.1) 说:

If no l length modifier is present, the argument shall be a pointer to the initial element of an array of character type. Characters from the array are written up to (but not including) the terminating null character. If the precision is specified, no more than that many bytes are written. If the precision is not specified or is greater than the size of the array, the array shall contain a null character.

If an l length modifier is present, the argument shall be a pointer to the initial element of an array of wchar_t type. Wide characters from the array are converted to multibyte characters (each as if by a call to the wcrtomb function, with the conversion state described by an mbstate_t object initialized to zero before the first wide character is converted) up to and including a terminating null wide character. The resulting multibyte characters are written up to (but not including) the terminating null character (byte). If no precision is specified, the array shall contain a null wide character. If a precision is specified, no more than that many bytes are written (including shift sequences, if any), and the array shall contain a null wide character if, to equal the multibyte character sequence length given by the precision, the function would need to access a wide character one past the end of the array. In no case is a partial multibyte character written.

我不太理解这一段，特别是“如果指定了精度，则写入的字节数不多于”这句话。

例如，我们以 UTF-16 字符串“TEST”(字节序列:0x54、0x00、0x45、0x00、0x53、0x00、0x54、0x00)为例。

在以下情况下，期望写入输出缓冲区的内容:

如果精度为3
如果精度为9(比字符串长度多一个字节)
如果精度为12(比字符串长度多几个字节)

然后还有“数组中的宽字符被转换为多字节字符”。这是否意味着应该先将 UTF-16 转换为 UTF-8？如果我希望仅使用 UTF-16，这会很奇怪。

最佳答案

^{将评论转换为稍微扩展的答案。}

CHAR_BIT 的值是多少？在您的实现中？

如果 CHAR_BIT == 8 , 你不能用 %s 处理 UTF-16 ;你会用 %ls然后你会通过 wchar_t *作为相应的参数。然后您必须阅读规范的第二段。
如果 CHAR_BIT == 16 ，那么数据中的八位字节数不能为奇数。然后你需要知道如何wchar_t涉及 char (它们大小相同吗？它们是否具有相同的符号？)并解释这两段以产生统一的效果——除非你决定使用 wchar_t表示 UTF-32。

关键是如果 CHAR_BIT == 8 则 UTF-16 不能作为 C 字符串处理因为有太多有用的字符被编码为一个字节为零，但那些零字节标记了一个以 null 结尾的字符串的结尾。要处理 UTF-16，可以是普通的 char类型必须是 16 位(或更大)类型(所以 CHAR_BIT > 8 )，或者您必须使用 wchar_t (和 sizeof(wchar_t) > sizeof(char) )。

请注意，规范要求将宽字符转换为合适的多字节表示形式。

如果你想要本地输出宽字符，你必须使用fwprintf()和来自 <wchar.h> 的相关功能，首先在 C99 中定义。那里的规范与 fprintf() 的规范有很多共同之处。，但存在(不出所料)重要差异。

7.29.2.1 The fwprintf function

…

s
If no l length modifier is present, the argument shall be a pointer to the initial element of a character array containing a multibyte character sequence beginning in the initial shift state. Characters from the array are converted as if by repeated calls to the mbrtowc function, with the conversion state described by an mbstate_t object initialized to zero before the first multibyte character is converted, and written up to (but not including) the terminating null wide character. If the precision is specified, no more than that many wide characters are written. If the precision is not specified or is greater than the size of the converted array, the converted array shall contain a null wide character.

If an l length modifier is present, the argument shall be a pointer to the initial element of an array of wchar_t type. Wide characters from the array are written up to (but not including) a terminating null wide character. If the precision is specified, no more than that many wide characters are written. If the precision is not specified or is greater than the size of the array, the array shall contain a null wide character.

关于C99 标准 - fprintf - s 精确转换，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/39686771/

24

4

0

文章推荐： r - 如何使用R将数字提取为数字？

文章推荐： r - 未使用的参数(键 = "iris.hex")

文章推荐： scala - 将字符串列表转换为Map [String，List]

c - C 标准 I/O 的限制以及为什么我们不能将 C 标准 I/O 与套接字一起使用
我最近在读 CSAPP。在 10.9 节中，它说标准 I/O 不应该与 socket 一起使用，原因如下: (1) The restrictions of standard I/O Restricti
(标准？)问题的编码设计
似乎是一个足够标准的问题，可以保证解决方案中的标准设计: 假设我想在文件中写入 x+2(或更少)个字符串。 x 字符串构成一个部分的内容，这两个字符串构成该部分的页眉和页脚。要注意的是，如果内容中没有
标准&有效的项目开发流程
代码版本管理在项目中，代码的版本管理非常重要。每个需求版本的代码开发在版本控制里都应该经过以下几个步骤。在master分支中拉取该需求版本的两个分支，一个feature分支，
查找不同值计数的 Hibernate 标准
我有以下sql查询，我需要获取相应的hibernate条件查询 SELECT COUNT(DISTINCT employee_id) FROM erp_hr_payment WHERE payment
Python MixIn 标准
所以我正在编写一些代码，并且最近遇到了实现一些 mixin 的需要。我的问题是，设计混音的正确方法是什么？我将使用下面的示例代码来说明我的确切查询。 class Projectile(Movable,
linux - 标准 |用包含双引号的环境变量替换值
我的环境变量包含如下双引号: $echo $CONNECT_SASL_JAAS_CONFIG org.apache.kafka.common.security.plain.PlainLoginModu
设置可选参数默认值的 phpdoc 标准？
示例: /** * This function will determine whether or not one string starts with another string. * @pa
不区分大小写的 Grails 标准
有没有办法在 Grails 中做一个不区分大小写的 in 子句？我有这个: "in"("name", filters.tags) 我希望它忽略大小写。我想我可以做一个 sqlRestriction
c++ - Boost库成为C++标准
我搜索了很长时间，以查找将哪些boost库添加到std库中，但是我只找到了一个新库的完整列表(如此处:http://open-std.org/jtc1/sc22/wg21/docs/library_t
vba - 在自动筛选中组合多个排除 (<>) 标准
我已经通过使用这个肮脏的黑客解决了我的问题: ' Filter managerial functions ActiveSheet.Range("$A$1:$BW$2211").Auto
Java:Applet 标准
因此，我很难理解我需要遵循的标准，以便我的 Java 程序能够嵌入 HTML。我是否只需将我的主类扩展到 Applet 类，或者我还需要做更多的事情吗？另外，在我见过的每个 Applet 示例中，它都
hibernate 标准。限制和最大日期
我对在 Hibernate 中使用限制有疑问。我必须创建条件，设置一些限制，然后选择日期字段最大值的记录: Criteria query = session.createCriteria(Stora
休眠+标准。如何在没有一个引用实体的情况下返回结果
我有标准: ICriteria criteria = Session.CreateCriteria() .SetFetchMode("Entity1", FetchMo
用于选择具有空子集合的所有实体的 nHibernate 标准
我很难编写条件来选择所有子集合或孙集合为空的实体。我可以将这些作为单独的条件来执行，但我无法将其组合成一个条件。类结构: public class Component { p
多对多实体属性的 Hibernate 标准
@Entity class A { @ManyToMany private List list; ... } @Entity class B { ... } 我想使用条件(不是 sql 查询)从 A
java hibernate 标准
我的数据库中有以下表结构: Table A: Table B: Table C: _______________
java - 标准(不适用)
请帮助我: 我有下一张 table : 单位 ID 姓名用户 ID 姓名利率单位 ID 用户 ID 我不明白如何从 SQL 创建正确的条件结构: 代码: SELECT * FROM Unit W
URL 结构最佳实践/标准
我正在构建一个包含项目的网站，每个项目都有一个页面，例如: website.com/book/123 website.com/film/456 website.com/game/789 每个项目都可以
NHibernate 标准 - 如何过滤属性组合
我需要使用两个属性的组合来过滤结果列表。一个简单的 SQL 语句如下所示: SELECT TOP 10 * FROM Person WHERE FirstName + ' ' + LastName L
java - 标准:搜索两个不同的实体类
我有一个“ super 实体”SuperEntity 和三个扩展父类(super class)的实体 ChildEntity1、...、ChildEntity3。搜索数据库中的所有实体很容易，即我们

首页

博学

6Ren·AI

商城

C99 标准 - fprintf - s 精确转换