c++ - openMP 缺乏 yield 递减与更高的线程数-6ren

c++ - openMP 缺乏 yield 递减与更高的线程数

转载作者：太空宇宙更新时间：2023-11-04 13:13:24

28

4

我的代码现在有一个循环，它调用蒙特卡洛函数来计算多个样本的简单积分(y=x，从 0 到 1)，并将总时间和积分值写入文本文件。然后循环增加线程数并继续前进。现在大约有 8 个线程，时间峰值约为 2.6 秒。循环迭代超过 64 个线程，我发现速度没有超过 0.2 秒，甚至有时会加速。

对于循环调用蒙特卡洛方法，增加线程数:

//this loop will iterate the main loop for a number of threads from 1 to 16
    for (int j = 1; j <= 17; j++)
    {
        //tell user how many threads are running monte-carlo currently
        cout << "Program is running " << number_threads << " thread(s) currently." << endl;

        //reset values for new run
        num_of_samples = 1;
        integration_result = 0;

        //this for loop will run throughout number of circulations running through monte-carlo
        //and entering the data into the text folder
        for (int i = 1; i <= iteration_num; i++)
        {
            //call monte carlo function to perform integration and write values to text
            monteCarlo(num_of_samples, starting_x, end_x, number_threads);

            //increase num of samples for next test round
            num_of_samples = 2 * num_of_samples;
        } //end of second for loop

        //iterate num_threads
        if (number_threads == 1)
            number_threads = 2;
        else if (number_threads >= 32)
            number_threads += 8;
        else if (number_threads >= 16)
            number_threads += 4;
        else
            number_threads += 2;
    } //end of for loop

蒙特卡洛的并行部分:

int num_threads;
    double x, u, error_difference, fs = 0, integration_result = 0; //fs is a placeholder to hold added values of f(x)
    vector< vector<double>> dataHolder(number_threads, vector<double>(1)); //this vector will hold temp values of each thread

    //get start time for parallel block of code
    double start_time = omp_get_wtime();

    omp_set_dynamic(0);     // Explicitly disable dynamic teams
    omp_set_num_threads(number_threads); // Use 4 threads for all consecutive parallel regions

#pragma omp parallel default(none) private(x, u) shared(std::cout, end_x, starting_x, num_of_samples, fs, number_threads, num_threads, dataHolder)
    {
        int i, id, nthrds;
        double temp = fs;

        //define thread id and num of threads
        id = omp_get_thread_num();
        nthrds = omp_get_num_threads();

        //initilialize random seed
        srand(id * time(NULL) * 1000);

        //if there is only one thread
        if(id == 0)
            num_threads = nthrds;

        //this for loop will calculate a temp value for fs for each thread
        for (int i = id; i < num_of_samples; i = i + nthrds)
        {
            //assign random number under integration from 0 to 1
            u = fRand(0, 1); //random number between 0 and 1
            x = starting_x + (end_x - starting_x) * u;

            //this line of code is from Monte_Carlo Method by Alex Godunov (February 2007)
            //calculuate y for reciporical value of x and add it to thread's local fs
            temp += function(x);
        }

        //place temp inside vector dataHolder
        dataHolder[id][0] = temp;

        //no thread will go beyond this barrier until task is complete
#pragma omp barrier

        //one thread will do this task
#pragma omp single
        {
            //add summations to calc fs
            for(i = 0, fs = 0.0; i < num_threads; i ++)
                fs += dataHolder[i][0];
        } //implicit barrier here, wait for all tasks to be done
    }//end of parallel block of code

最佳答案

在使用光散射对简单的蒙特卡洛游走实现相同类型的并行化后，我能够相当多地了解 yield 递减的情况。我认为这里没有 yield 递减，因为积分计算非常简单，线程本身几乎没有单独做的事情，因此它们的开销相对较小。如果其他人有任何其他信息可以证明对这个问题有用，请随时发布。否则我会接受这个作为我的答案。

关于c++ - openMP 缺乏 yield 递减与更高的线程数，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38647322/

28

4

0

文章推荐： css - 响应式网站设计的渐进图像大小过渡

文章推荐： java - 如何在Android应用程序中实现SSO

文章推荐： javascript - FancyBox 加载 HTML 页面

jQuery动画十进制数递增/递减
我想逐步动画化两个十进制数之间的差异。已找到Joss Crowcroft's solution对于效果很好的整数，我做了 example on jsfiddle .代码片段: $({numberVa
video - 在ffmpeg的drawtext中打印结束时间(递减)
我试图弄清楚如何使用 ffmpeg drawtext 函数，该函数在视频上打印叠加层，视频文件结束时还有时间。使用 %{pts\:hms} 从开始 (00:00:00) 开始似乎没问题从视频文件的开
java - 未能成功演示并发递增/递减
本周我开始学习 Java 线程和并发；我需要一些关于我使用 Thread 而不是 Runnable 实现的下一个代码的帮助: 类别 package hilos; public class Hilo e
c# - 更新对象属性 - 递减
我有一个对象列表(发票)。这些对象有一个 qty 属性。我有一个产品返回表格，其中包含要退回的产品的数量。提交返回行时。我用提交的行填充一个 gridview，以便在提交整个返回表格之前可以根据需要
C - 优先级递增/递减？
这个问题已经有答案了: Can you have a triple minus signs in C programming? What does it mean? [duplicate] (5 个回
javascript - 单个语句中的多个递增/递减
var a = 0; (++a)+(a++)+(++a); print(a); 这会打印 3。我假设它只执行单个增量。 var a = 0; (++a)+(a++)+(--a); 这会打印 1. 这里
c - 在同一语句中使用多个递增/递减
我知道 C 中的计算顺序并不严格，所以表达式 --a+++a 的值是未定义的，因为不知道语句的哪一部分先运行。但是，如果我知道在特定情况下计算顺序无关紧要怎么办？例如: 所有修改对应不同的变量(如a
c# - 变量不会在我的代码中递增/递减
if (getchar == '+') { answer = getnum1+getnum2; // if the random operation is add, it will add
c# - 按百分比递增、递减
我会尽量用代码来解释这个问题: double power = 5000; //picked up 5 power ups, now need to increase power by 10% per
python - 递减 for 循环
这个问题在这里已经有了答案: Loop backwards using indices (17 个回答) 关闭5年前. 我想要一个像这样的 for 循环: for counter in range(1
prolog - 递减 Prolog 中的相同变量
像这样: decr(X, X) :- X is X-1. 我想用它来递减父规则中的数字，如果这个数字等于 0，例如，父规则返回 false。最佳答案 Prolog 是声明性的:声明性语言的一
javascript - 递减 setState 中的计数器
我正在制作一个包含三张图片的 slider 。对于此幻灯片的后退按钮，它应该从最后一张图像开始并返回到 1。下面的代码在控制台记录时从 3 减少到 2 但未命中 1，所以 3 2 3 2 3 2 ..
javascript - 递减 While 循环排序 Javascript
这实际上是我关于 javascript 的第一篇文章。想知道以下代码的性能，因为它是一个“递减 while 循环”，在 while 循环中有一个递增变量。 var i = data.d.length;
postgresql pg_trgm.word_similarity_threshold 递减
将 pg_trgm.word_similarity_threshold 设置为 0.2；降低当前 session 的阈值但不为数据库做。我需要降低支持拼写错误的阈值。最佳答案赞the docume
javascript - 两个按钮一个功能(递增、递减)
我希望能够增加和减少一个值(5)，并且我想用一个函数来覆盖它(我知道如何用两个函数来实现)。不幸的是，我无法完成它，也无法找出问题所在。这是我的代码: HTML: -
c++ - 如何自动定义递增/递减/等运算符？
我有一堆 enum 类型，像这样: enum Color {COLOR_RED = 0, COLOR_GREEN = 1, COLOR_BLUE = 2, COLOR_NUM}; enum Direc
c - 递减 NULL 指针会导致未定义的行为吗？
在我的机器上递减一个 NULL 指针仍然得到一个 NULL 指针，我想知道这是否定义明确。 char *p = NULL; --p; 最佳答案是的，行为未定义。 --p 等同于 p = p - 1(
c# - 基于正方形内距离的 yield 递减
简而言之，我正在将地形(2d 高度图)从生成的值平滑回到其原始值。有一个仅使用生成值的 6 单位平坦区域，然后是一个从生成值移回原始值的 3 单位平滑区域(总共 9 个) 在平坦区域中，所有 x、z
java - 在递增、递减、递增和递减数组中查找最大值和最小值的算法
给定一个数组，其中的值要么只增加，要么只减少或先增加再减少，如何找到此类数组的最大值和最小值？最小值只是最终值中的最小值。但是如何找到最大值呢？一种方法是运行时间为 O(n) 的线性方法，是否可
java - 递归方法调用中的后递增/递减 (Java)
假设您有一个递归方法，并且您在递归调用中递增/递减一个值。为什么这会导致堆栈溢出异常，而预自增/自减不会？例如 numberCount(currentNumber++); //Stack overf

首页

博学

6Ren·AI

商城

c++ - openMP 缺乏 yield 递减与更高的线程数