arrays - Goroutines 共享 slice : : trying to understand a data race-6ren

arrays - Goroutines 共享 slice : : trying to understand a data race

转载作者：IT王子更新时间：2023-10-29 02:26:28

24

4

我尝试用 Go 编写一个程序，以在非常大的 DNA 序列文件中查找一些基因。我已经编写了一个 Perl 程序来执行此操作，但我想利用 goroutines 并行执行此搜索；)

因为文件很大，我的想法是一次读取 100 个序列，然后将分析发送到 goroutine，然后再次读取 100 个序列等。

我要感谢本网站的成员，感谢他们对 slice 和 goroutines 的真正有用的解释。

我已经进行了建议的更改，以使用由 goroutines 处理的 slice 的副本。但是 -race 执行仍然在 copy() 函数级别检测到一个数据竞争:

非常感谢您的意见!

    ==================
WARNING: DATA RACE
Read by goroutine 6:
  runtime.slicecopy()
      /usr/lib/go-1.6/src/runtime/slice.go:113 +0x0
  main.main.func1()
      test_chan006.go:71 +0xd8

Previous write by main goroutine:
  main.main()
      test_chan006.go:63 +0x3b7

Goroutine 6 (running) created at:
  main.main()
      test_chan006.go:73 +0x4c9
==================
[>5HSAA098909 BA098909 ...]
Found 1 data race(s)
exit status 66

    line 71 is : copy(bufCopy, buf_Seq)
    line 63 is : buf_Seq = append(buf_Seq, line)
    line 73 is :}(genes, buf_Seq)




    package main

import (
    "bufio"
    "fmt"
    "os"
    "github.com/mathpl/golang-pkg-pcre/src/pkg/pcre"
    "sync"
)

// function read a list of genes and return a slice of gene names
func read_genes(filename string) []string {
    var genes []string // slice of genes names
    // Open the file.
    f, _ := os.Open(filename)
    // Create a new Scanner for the file.
    scanner := bufio.NewScanner(f)
    // Loop over all lines in the file and print them.
    for scanner.Scan() {
          line := scanner.Text()
        genes = append(genes, line)
    }
    return genes
}

// function find the sequences with a gene matching gene[] slice
func search_gene2( genes []string, seqs []string) ([]string) {
  var res []string

  for r := 0 ; r <= len(seqs) - 1; r++ {
    for i := 0 ; i <= len(genes) - 1; i++ {

      match := pcre.MustCompile(genes[i], 0).MatcherString(seqs[r], 0)

      if (match.Matches() == true) {
          res = append( res, seqs[r])           // is the gene matches the gene name is append to res
          break
      }
    }
  }

  return res
}
//###########################################

func main() {
    var slice []string
    var buf_Seq []string
    read_buff := 100    // the number of sequences analysed by one goroutine

    var wg sync.WaitGroup
    queue := make(chan []string, 100)

    filename := "fasta/sequences.tsv"
    f, _ := os.Open(filename)
    scanner := bufio.NewScanner(f)
    n := 0
    genes := read_genes("lists/genes.csv")

    for scanner.Scan() {
            line := scanner.Text()
            n += 1
            buf_Seq = append(buf_Seq, line) // store the sequences into buf_Seq
            if n == read_buff {   // when the read buffer contains 100 sequences one goroutine analyses them

          wg.Add(1)

          go func(genes, buf_Seq []string) {
            defer wg.Done()
                        bufCopy := make([]string, len(buf_Seq))
                        copy(bufCopy, buf_Seq)
            queue <- search_gene2( genes, bufCopy)
            }(genes, buf_Seq)
                        buf_Seq = buf_Seq[:0]   // reset buf_Seq
              n = 0 // reset the sequences counter

        }
    }
    go func() {
            wg.Wait()
            close(queue)
        }()

        for t := range queue {
            slice = append(slice, t...)
        }

        fmt.Println(slice)
}

最佳答案

goroutines 只在 slice headers 的副本上工作，底层数组是相同的。要制作 slice 的副本，您需要使用 copy(或 append 到不同的 slice )。

buf_Seq = append(buf_Seq, line)
bufCopy := make([]string, len(buf_Seq))
copy(bufCopy, buf_Seq)

然后您可以安全地将 bufCopy 传递给 goroutine，或者直接在闭包中使用它。

关于arrays - Goroutines 共享 slice : : trying to understand a data race，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/38923237/

24

4

0

文章推荐： go - 新函数和未初始化变量的区别

文章推荐： java - 将数百万个 xml 文件插入 basex

文章推荐： c# - 无法读取 iTunes XML 提要

multithreading - Goroutines 是协作调度的。这是否意味着不让出执行的 goroutine 会导致 goroutine 一个一个运行？
发件人:http://blog.nindalf.com/how-goroutines-work/ As the goroutines are scheduled cooperatively, a go
multithreading - 戈朗 : to goroutine or not to goroutine?
很多时候在用 Go 开发 http 服务器时，我都会遇到这种困境。假设我想尽快用http statuscode 200响应客户端(然后在后面执行工作)，这就是我通常这样做的原因: 我让我的主要 ht
go - goroutine 调用行之后的行是否比 goroutine 的第一行更早开始？
这是代码: import "fmt" func main() { messages := make(chan string, 1) go func(c chan string) {
go - 创建 goroutine 并设置最大 goroutines
我正在学习 Golang，但遇到了一些困难。我已经研究过 Google，但没有任何进展。我编写了一个代码，通过多台服务器的 ICMP 检查 RTT。它有这样的结构: type Server str
go - 如何在完成一个 goroutine 后完成所有 goroutine
我想运行多个 goroutine，进行一些处理，将结果放入 channel ，当至少有一个 goroutine 完成时，完成所有其他 goroutine 并从 channel 返回结果。所以，我尝试
go - 如何阻止除正在运行的 goroutines 之外的所有 goroutines
我有两个(但以后我会是三个)go 例程来处理来自远程服务器(来自 ampq channel )的传入消息。但是因为它们正在处理相同的数据/状态，所以我想阻止所有其他 go 例程，除了正在运行的例程。
go - 从 goroutine 获取值并取消另一个 goroutine
我有一个案例，我从 2 个不同的位置(ES 和 REDIS)读取数据，我需要从最快的源读取一个值，因此我触发了 2 个 goroutines，一个从 ES 获取数据，其他从REDIS获取。一旦从其中
go - 如何根据 goroutine 的返回值停止 goroutine
像这里一样，我创建了一个 go playground 示例:sGgxEh40ev ，但无法正常工作。 quit := make(chan bool) res := make(chan int) go
go - 关于 goroutine 在 goroutine 中的行为
我是golang的新手，正在研究goroutine。我写了一个简单的代码，故意使用 goroutine 来划分数字。首先，我给出基数并继续除它的数，直到它不能被整除但是，我改变了go split
go - 如何让多个 goroutine 等待另一个 goroutine 的输出
Main { go routine_1(carryout a time consuming task and return output) go routine_2(wait for output f
go - 从调用另一个 goroutine 的 goroutine 返回
我想知道从另一个 goroutine 返回时调用的 goroutine 会发生什么。他们是继续运行还是被终止？这是一个示例代码来说明我的意思: func func() { // Doing s
go - 一个 goroutine 从结构中读取而另一个 goroutine 正在修改它是否安全？
更具体地说，在我的例子中，我有一个网络服务器和一个全局可访问的结构，网络服务器使用它来生成页面。我有另一个 Goroutine，它总是定期用新值更新该结构。这会引起问题吗？我是否需要实现一种机制来确保
go - 从另一个 goroutine 启动一个 goroutine 是什么意思？
来自 this file ，我不明白为什么函数startWorker会这样写: func (p *WorkerPool) dispatch() { for i := 0; i < p.maxW
go - 无法无限期地写入一个 goroutine 并从另一个 goroutine 中读取
我正在学习围棋，但在使用 goroutines 时遇到了问题。这是我的代码 package main import ( "fmt" "sync" "time" ) var co
go - 反正执行 goroutine 时所有 goroutines 都睡着了
我收到以下错误，我不明白为什么: 发送:查询 Herefatal 错误:所有 goroutines 都睡着了 - 死锁! 您可以看到我正在调用我使用 goroutine 创建的函数 routine。我
go - 我如何等待对多个其他 Goroutines 的单个 Goroutine 响应？
大家好，我正在从 Python3 过渡到 Go，所以我正在尝试重写我创建的库以获得更好的性能。我面临一个问题，因为我是 Golang XD 中的新手，我使用有限的 API 下载数百个 json，我想
go - 用 goroutines 调用非 goroutine 函数
我有以下格式的脚本部分: func main() { for i=0;i<1000000;i++ { go test() } } func test() { a := test
go - 在同一个 goroutines 中创建的 goroutines 是否总是按顺序执行？
package main func main() { c:=make(chan int) for i:=0; i<=100;i++ {
go - 在 goroutines 中启动 goroutines 是否可以接受？
我正在学习 Go，我的第一个项目是一个简单的 ping 脚本。本质上，我想 ping 一堆 url，并在每个响应时等待 XXX 秒，然后再次 ping。这是删减的代码: func mai
algorithm - 程序正在等待 goroutines 完成，但 goroutines 被死锁
这个问题在这里已经有了答案: Go all goroutines are asleep deadlock (2 个回答) fatal error: all goroutines are asleep

首页

博学

6Ren·AI

商城

arrays - Goroutines 共享 slice : : trying to understand a data race