c++ - rint 不存在于 Visual Studio 2010 math.h 中，相当于 CUDA rint-6ren

c++ - rint 不存在于 Visual Studio 2010 math.h 中，相当于 CUDA rint

转载作者：太空狗更新时间：2023-10-29 19:53:20

27

4

我正在将 CUDA 代码移植到 C++ 并使用 Visual Studio 2010。CUDA 代码使用 rint 函数，它似乎不存在于 Visual Studio 2010 math.h 中，所以看来还得自己实现了。

根据这个link , CUDA rint 函数

将 x 舍入为浮点格式中最接近的整数值，中间情况舍入为零。

我想我可以使用转换为 int 的方法来丢弃小数部分，有效地向零舍入，所以我最终得到了以下函数

inline double rint(double x)
{
    int temp; temp = (x >= 0. ? (int)(x + 0.5) : (int)(x - 0.5));
    return (double)temp;
}

它有两种不同的转换，一种是int，一种是double。

我有三个问题:

对于“小”数字，上述函数是否完全等同于 CUDA rint？对于不能表示为 int 的“大”数字，它会失败吗？
是否有计算效率更高的方法(而不是使用两次转换)来定义 rint？

非常感谢您。

最佳答案

CUDA 文档中引用的 rint() 描述不正确。使用浮点结果舍入为整数映射 IEEE-754 (2008) 指定的舍入模式如下:

trunc()   // round towards zero
floor()   // round down (towards negative infinity)
ceil()    // round up (towards positive infinity)
rint()    // round to nearest or even (i.e. ties are rounded to even)
round()   // round to nearest, ties away from zero

通常，这些函数按照 C99 标准中的描述工作。对于 rint()，标准指定函数根据当前舍入模式(默认为最接近或偶数舍入)进行舍入。由于 CUDA 不支持动态舍入模式，所有定义为使用当前舍入模式的函数都使用舍入模式“舍入到最近或偶数”。这里有一些例子显示了 round() 和 rint() 之间的区别:

argument  rint()  round()
1.5       2.0     2.0
2.5       2.0     3.0
3.5       4.0     4.0
4.5       4.0     5.0

round() 可以很容易地按照您发布的代码行进行模拟，我不知道 rint() 的简单模拟。请注意，您不希望使用到整数的中间转换，因为“int”支持的数值范围比可以用“double”精确表示的整数更窄。而是酌情使用 trunc()、ceil()、floor()。

由于 rint() 是当前 C 和 C++ 标准的一部分，我对 MSVC 不包含此函数感到有点惊讶；我建议检查 MSDN 以查看是否提供替代品。如果您的平台支持 SSE4，则可以使用 smmintrin.h 中定义的 SSE 内在函数 _mm_round_sd()、_mm_round_pd()，并将舍入模式设置为 _MM_FROUND_TO_NEAREST_INT ，实现CUDA的rint()的功能。

虽然(根据我的经验)SSE 内在函数可以跨 Windows、Linux 和 Mac OS X 移植，但您可能希望避免使用特定于硬件的代码。在这种情况下，您可以尝试以下代码(经过简单测试):

double my_rint(double a)
{
    const double two_to_52 = 4.5035996273704960e+15;
    double fa = fabs(a);
    double r = two_to_52 + fa;
    if (fa >= two_to_52) {
        r = a;
    } else {
        r = r - two_to_52;
        r = _copysign(r, a);
    }
    return r;
}

请注意，MSVC 2010 似乎也缺少标准的 copysign() 函数，因此我不得不替换为 _copysign()。上面的代码假定当前的舍入模式是舍入到最接近的偶数(默认情况下)。通过添加 2**52 它确保舍入发生在整数单位位。请注意，这还假定执行纯 double 计算。在对中间结果使用更高精度的平台上，可能需要将“fa”和“r”声明为易变的。

关于c++ - rint 不存在于 Visual Studio 2010 math.h 中，相当于 CUDA rint，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/14919512/

27

4

0

文章推荐： c++ - 如何使用 OpenCV 2.4.3 imRotate？

文章推荐： c# - 如何使用 C# 有效地合并巨大的文件

文章推荐： c# - C# : why is it implicitly by reference? 传入数组参数

c++ - rint 不存在于 Visual Studio 2010 math.h 中，相当于 CUDA rint
我正在将 CUDA 代码移植到 C++ 并使用 Visual Studio 2010。CUDA 代码使用 rint 函数，它似乎不存在于 Visual Studio 2010 math.h 中，所以看
c++ - rint 不存在于 Visual Studio 2010 math.h 中，相当于 CUDA rint
我正在将 CUDA 代码移植到 C++ 并使用 Visual Studio 2010。CUDA 代码使用 rint 函数，它似乎不存在于 Visual Studio 2010 math.h 中，所以看
python - numpy `rint` 奇怪的行为
这个问题是关于numpy.rint的，根据定义四舍五入到最接近的整数。但是，以下代码会产生不一致的结果。 In [1]: import numpy as np for i in ran
python - numpy.rint 没有按预期工作
我正在尝试找出导致此结果的原因: import numpy result1 = numpy.rint(1.5) result2 = numpy.rint(6.5) print result 输出: r
python - 霍夫圆检测AttributeError : 'NoneType' object has no attribute 'rint'
我尝试在 open cv2 中使用 Houghcircle 检测这个圆，但出现错误。下面是我的代码 1 chh = cv2.HoughCircles(crr, cv2.HOUGH_GRADIENT,
java - 舍入数字 : Java's 'Math.rint' in Python
Python 中是否存在与 Java 的 Math.rint 类似的东西？如果不是，我怎样才能达到相同的结果？最佳答案这里是 Python 2 的 rint 的完全相似的工作方式: def rin
python - numpy.rint 可以返回一个 Int32 吗？
我在做 ret = np.rint(y * 4) return ret 我希望它返回 Int32。我尝试添加 dtype='Int32'，但错误提示:TypeError: No loop matchi
java - Math.ceil 和 Math.rint 方法返回相同的值
Math.rint() 和Math.ceil() 方法返回相同的值，谁能解释一下为什么？这些方法有什么区别？何时使用 Math.rint() 以及在何处使用 Math.ceil()？ public
python - numpy.rint 和 numpy.round 之间的区别
numpy.rint 和 numpy.round/numpy.around 有什么区别？它们似乎执行相同的功能: >>> a array([-1.7, -1.5, -0.2, 0.2, 1.5,
python - 使用 numpy.round 时出现 `AttributeError: rint`
我有一个如下所示的 numpy 数组: [[41.743617 -87.626839] [41.936943 -87.669838] [41.962665 -87.65571899999999]]
c - 使用 CMake 创建 VS 项目后的 rint() 问题
我在编译代码时遇到问题 - 特别是 METIS - Serial Graph Partitioning and Fill-reducing Matrix Ordering . 我已经成功地通过两种方式
java - Java 中 Math.rint() 和 Math.round() 的区别
Math.rint() 和 Math.round() 有什么区别？最佳答案 Math.rint() 和 Math.round() 有几个不同之处，但对 Java 应用程序的业务逻辑影响最大的是方式它
python - 与 astype(int) 相比，numpy around/rint 慢
所以如果我有类似 x=np.random.rand(60000)*400-200 的东西. iPython 的 %timeit说: x.astype(int)耗时 0.14 毫秒 np.rint(x)
python - 为什么在 tensorflow 中同时存在 tf.round 和 tf.rint 函数？
Tensorflow 有两个函数可以将数字舍入到最接近的整数 tf.round和 tf.rint . 据我所知，这些函数在做同样的事情，但它们在不同的类型集上运行，这使得它们没有一个比另一个更通用(例
c++ - Visual Studio 2012 中的 round()、rint()(等等)C++ 函数在哪里？
这些页面: http://www.cplusplus.com/reference/cmath/round/ http://www.cplusplus.com/reference/cmath/rint/
generics - CGFloat:当 Float 时调用 rintf()，当 Double 时调用 rint()
当架构是 32 位时，CGFloat 是 Float。在这种情况下，我想调用 rintf() 当体系结构为 64 位时，CGFloat 为 Double。在这种情况下，我想调用 rint() 我目
python - 使用 numpy.rint() 舍入到最接近的 int 与 0.5 不一致
numpy's round int似乎与它处理 xxx.5 的方式不一致 In [2]: np.rint(1.5) Out[2]: 2.0 In [3]: np.rint(10.5) Out[3]:

首页

博学

6Ren·AI

商城

c++ - rint 不存在于 Visual Studio 2010 math.h 中，相当于 CUDA rint