Golang在Goroutine之间共享大量数据

I have a need to read structure fields set from another goroutine, afaik doing so directly even when knowing for sure there will be no concurrent access(write finished before read occurred, signaled via chan struct{}) may result in stale data

Will sending a pointer to the structure(created in the 1st goroutine, modified in the 2nd, read by the 3rd) resolve the possible staleness issue, considering I can guarantee no concurrent access?

I would like to avoid copying as structure is big and contains huge Bytes.Buffer filled in the 2nd goroutine, I need to read from the 3rd

There is an option for locking, but seems like an overkill considering I know that there will be no concurrent access

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

dongshuo1257 2016-05-22 10:41

关注

There are many answers to this, and it depends to your data structure and program logic.

see: How to lock/synchronize access to a variable in Go during concurrent goroutines?
and: How to use RWMutex in Golang?

1- using Stateful Goroutines and channels
2- using sync.Mutex
3- using sync/atomic
4- using WaitGroup
5- using program logic(Semaphore)
...

1: Stateful Goroutines and channels:
I simulated very similar sample(imagine you want to read from one SSD and write to another SSD with different speed):
In this sample code one goroutine (named write) does some job prepares data and fills the big struct, and another goroutine (named read) reads data from big struct then do some job, And the manger goroutine, guarantee no concurrent access to same data. And communication between three goroutines done with channels. And in your case you can use pointers for channel data, or global struct like this sample.
output will be like this:
mean= 36.6920166015625 stdev= 6.068973186592054

I hope this helps you to get the idea.
Working sample code:

package main

import (
    "fmt"
    "math"
    "math/rand"
    "runtime"
    "sync"
    "time"
)

type BigStruct struct {
    big     []uint16
    rpos    int
    wpos    int
    full    bool
    empty   bool
    stopped bool
}

func main() {
    wg.Add(1)
    go write()
    go read()
    go manage()
    runtime.Gosched()
    stopCh <- <-time.After(5 * time.Second)
    wg.Wait()
    mean := Mean(hist)
    stdev := stdDev(hist, mean)
    fmt.Println("mean=", mean, "stdev=", stdev)
}

const N = 1024 * 1024 * 1024

var wg sync.WaitGroup
var stopCh chan time.Time = make(chan time.Time)

var hist []int = make([]int, 65536)

var s *BigStruct = &BigStruct{empty: true,
    big: make([]uint16, N), //2GB
}

var rc chan uint16 = make(chan uint16)
var wc chan uint16 = make(chan uint16)

func next(pos int) int {
    pos++
    if pos >= N {
        pos = 0
    }
    return pos
}

func manage() {
    dataReady := false
    var data uint16
    for {
        if !dataReady && !s.empty {
            dataReady = true
            data = s.big[s.rpos]
            s.rpos++
            if s.rpos >= N {
                s.rpos = 0
            }
            s.empty = s.rpos == s.wpos
            s.full = next(s.wpos) == s.rpos
        }
        if dataReady {
            select {
            case rc <- data:
                dataReady = false
            default:
                runtime.Gosched()
            }
        }
        if !s.full {
            select {
            case d := <-wc:
                s.big[s.wpos] = d
                s.wpos++
                if s.wpos >= N {
                    s.wpos = 0
                }
                s.empty = s.rpos == s.wpos
                s.full = next(s.wpos) == s.rpos
            default:
                runtime.Gosched()
            }
        }
        if s.stopped {
            if s.empty {
                wg.Done()
                return
            }
        }

    }
}

func read() {
    for {
        d := <-rc
        hist[d]++
    }
}

func write() {
    for {
        wc <- uint16(rand.Intn(65536))
        select {
        case <-stopCh:
            s.stopped = true
            return
        default:
            runtime.Gosched()
        }
    }
}

func stdDev(data []int, mean float64) float64 {
    sum := 0.0
    for _, d := range data {
        sum += math.Pow(float64(d)-mean, 2)
    }
    variance := sum / float64(len(data)-1)
    return math.Sqrt(variance)
}
func Mean(data []int) float64 {
    sum := 0.0
    for _, d := range data {
        sum += float64(d)
    }
    return sum / float64(len(data))
}

5: another way(faster) for some use cases:
here another way to use shared data structure for read job/write job/ processing job which it was separated in first post, now here doing same 3 jobs without channels and without mutex.

working sample:

package main

import (
    "fmt"
    "math"
    "math/rand"
    "time"
)

type BigStruct struct {
    big     []uint16
    rpos    int
    wpos    int
    full    bool
    empty   bool
    stopped bool
}

func manage() {
    for {
        if !s.empty {
            hist[s.big[s.rpos]]++ //sample read job with any time len
            nextPtr(&s.rpos)
        }
        if !s.full && !s.stopped {
            s.big[s.wpos] = uint16(rand.Intn(65536)) //sample wrire job with any time len
            nextPtr(&s.wpos)
        }
        if s.stopped {
            if s.empty {
                return
            }
        } else {
            s.stopped = time.Since(t0) >= 5*time.Second
        }
    }
}

func main() {
    t0 = time.Now()
    manage()
    mean := Mean(hist)
    stdev := StdDev(hist, mean)
    fmt.Println("mean=", mean, "stdev=", stdev)
    d0 := time.Since(t0)
    fmt.Println(d0) //5.8523347s
}

var t0 time.Time

const N = 100 * 1024 * 1024

var hist []int = make([]int, 65536)

var s *BigStruct = &BigStruct{empty: true,
    big: make([]uint16, N), //2GB
}

func next(pos int) int {
    pos++
    if pos >= N {
        pos = 0
    }
    return pos
}
func nextPtr(pos *int) {
    *pos++
    if *pos >= N {
        *pos = 0
    }

    s.empty = s.rpos == s.wpos
    s.full = next(s.wpos) == s.rpos
}

func StdDev(data []int, mean float64) float64 {
    sum := 0.0
    for _, d := range data {
        sum += math.Pow(float64(d)-mean, 2)
    }
    variance := sum / float64(len(data)-1)
    return math.Sqrt(variance)
}
func Mean(data []int) float64 {
    sum := 0.0
    for _, d := range data {
        sum += float64(d)
    }
    return sum / float64(len(data))
}

报告相同问题？

关注问题

Golang如何在goroutine之间共享变量？
2016-08-29 13:33

回答 3 已采纳 You have new variable on each run of x := i, This code shows difference well, by printing the addr
在golang中优先使用goroutine
2018-12-21 20:09

回答 2 已采纳 I have created threadpools on golang. This should allow easily one to prioritize certain goroutine
在Golang模板之间共享变量
2019-09-24 23:00

回答 1 已采纳 Create a third file with the common definitions: {{define "myVar"}} the-var {{end}} Parse that
Golang协程goroutine
2019-12-12 22:35

gengqianyu的博客进程是程序在一个数据集上的一次运行过程。进程是操作系统进行资源分配的基本单位。每个进程都有自己的独立内存空间，不同进程通过进程间同步信号量来通信。由于进程比较重量，占据独立的内存，所以上下文进程间...
在多个goroutine之间共享的Golang结构中，非共享成员是否需要互斥保护？
2016-01-29 12:10

回答 2 已采纳 If only a single goroutine accesses the struct member, you don't need to have a mutex to control a
Golang多个goroutine通过引用共享相同的变量
2017-02-05 01:15

回答 1 已采纳 Are you sure you need goroutines to perform simple validations? Anyway the code you have written u
Golang Goroutine泄漏
2015-03-22 02:35

回答 2 已采纳 The program stops receiving on the channels when a difference is detected. The walk goroutines r
Golang Goroutine 入门使用
2020-09-08 23:55

Vongolar的博客 goroutine(协程)是golang最重要的特色，大多数语言都有协程或类似的任务调度系统，一般叫做线程池。那为什么golang的协程还是最被使用者津津乐道的呢？因为golang是第一个语言层面支持协程的语言(也许有别...
Golang：Goroutine无限循环
2014-08-01 06:03

回答 1 已采纳 The Go By Example article includes: // Allow other goroutines to proceed. runtime.Gosched()
在golang中的包之间共享常量
2018-04-11 14:10

回答 2 已采纳 I would suggest to declare struct with constant fields and import that struct in any package you w
如何在Golang中实现适当的并行性？ goroutine是否与Go1.5 +并行？
2018-03-04 21:09

回答 1 已采纳 The Go Playground is a single-processor virtual machine. You are running a trivial goroutine. Toy
golang goroutine实现_深入golang之---goroutine并发控制与通信
2021-01-14 07:52

唱游大世界的博客开发go程序的时候，时常需要使用goroutine并发处理任务，有时候这些goroutine是相互独立的，而有的时候，多个goroutine之间常常是需要同步与通信的。另一种情况，主goroutine需要控制它所属的子goroutine，总结起来...
Golang：在多个goroutine中发送关闭通道错误
2018-08-08 07:32

回答 2 已采纳 When you work with channels in Go always the sender should close the channel. Because that signals
golang中Goroutine + Channel 常用模型实践
2019-10-11 13:56

咻咻ing的博客 goroutine不同于thread，threads是操作系统中的对于一个独立运行实例的描述，不同操作...启动thread虽然比process所需的资源要少，但是多个thread之间的上下文切换仍然是需要大量的工作的（寄存器/Program Count/St...
golang goroutine实现_Golang 探索对Goroutine的控制方法
2021-01-14 07:52

懂车老王的博客前言在golang中，只需要在函数调用前加上关键字go即可创建一个并发任务单元，而这个新建的任务会被放入队列中，等待调度器安排。相比系统的MB级别线程栈，goroutine的自定义栈只有2KB，这使得我们能够轻易创建上万个...
没有解决我的问题, 去提问

悬赏问题

¥15 关于#hadoop#的问题
¥15 (标签-Python|关键词-socket)
¥15 keil里为什么main.c定义的函数在it.c调用不了
¥50 切换TabTip键盘的输入法
¥15 可否在不同线程中调用封装数据库操作的类
¥15 微带串馈天线阵列每个阵元宽度计算
¥15 keil的map文件中Image component sizes各项意思
¥20 求个正点原子stm32f407开发版的贪吃蛇游戏
¥15 划分vlan后，链路不通了？
¥20 求各位懂行的人，注册表能不能看到usb使用得具体信息，干了什么，传输了什么数据

码龄粉丝数原力等级 --

Golang在Goroutine之间共享大量数据

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

Golang在Goroutine之间共享大量数据

2条回答 默认 最新

悬赏问题

2条回答默认最新