duangan6731 2017-08-20 17:03

甚至使用互斥体进行自定义并发映射的Golang数据竞赛

Here is a simple concurrent map that I wrote for learning purpose

    package concurrent_hashmap

    import (
        "hash/fnv"
        "sync"
    )

    type ConcurrentMap struct {
        buckets []ThreadSafeMap
        bucketCount uint32
    }

    type ThreadSafeMap struct {
        mapLock sync.RWMutex
        hashMap map[string]interface{}
    }

    func NewConcurrentMap(bucketSize uint32) *ConcurrentMap {
        var threadSafeMapInstance ThreadSafeMap
        var bucketOfThreadSafeMap []ThreadSafeMap

        for i := 0; i <= int(bucketSize); i++ {
            threadSafeMapInstance = ThreadSafeMap{sync.RWMutex{}, make(map[string]interface{})}
            bucketOfThreadSafeMap = append(bucketOfThreadSafeMap, threadSafeMapInstance)
        }

        return &ConcurrentMap{bucketOfThreadSafeMap, bucketSize}
    }

    func (cMap *ConcurrentMap) Put(key string, val interface{}) {
        bucketIndex := hash(key) % cMap.bucketCount
        bucket := cMap.buckets[bucketIndex]
        bucket.mapLock.Lock()
        bucket.hashMap[key] = val
        bucket.mapLock.Unlock()
    }

    // Helper
    func hash(s string) uint32 {
        h := fnv.New32a()
        h.Write([]byte(s))
        return h.Sum32()
    }

I am trying to write a simple benchmark and I find that synchronize access will work correctly but concurrent access will get

fatal error: concurrent map writes

Here is my benchmark run with go test -bench=. -race

package concurrent_hashmap

import (
    "testing"
    "runtime"
    "math/rand"
    "strconv"
    "sync"
)
// Concurrent does not work
func BenchmarkMyFunc(b *testing.B) {
    var wg sync.WaitGroup

    runtime.GOMAXPROCS(runtime.NumCPU())

    my_map := NewConcurrentMap(uint32(4))
    for n := 0; n < b.N; n++ {
        go insert(my_map, wg)
    }
    wg.Wait()
}

func insert(my_map *ConcurrentMap, wg sync.WaitGroup) {
    wg.Add(1)
    var rand_int int
    for element_num := 0; element_num < 1000; element_num++ {
        rand_int = rand.Intn(100)
        my_map.Put(strconv.Itoa(rand_int), rand_int)
    }
    defer wg.Done()
}

// This works
func BenchmarkMyFuncSynchronize(b *testing.B) {
    my_map := NewConcurrentMap(uint32(4))
    for n := 0; n < b.N; n++ {
        my_map.Put(strconv.Itoa(123), 123)
    }
}

The WARNING: DATA RACE is saying that bucket.hashMap[key] = val is causing the problem, but I am confused on why that is possible, since I lock that logic whenever write is happening.

I think I am missing something basic, can someone point out my mistake?

Thanks

Edit1:

Not sure if this helps but here is what my mutex looks like if I don't lock anything

{{0 0} 0 0 0 0}

Here is what it looks like if I lock the write

{{1 0} 0 0 -1073741824 0}

Not sure why my readerCount is a low negative number

Edit:2

I think I find where the issue is at, but not sure why I have to code that way

The issue is

type ThreadSafeMap struct {
    mapLock sync.RWMutex // This is causing problem
    hashMap map[string]interface{}
}

it should be

type ThreadSafeMap struct {
    mapLock *sync.RWMutex
    hashMap map[string]interface{}
}

Another weird thing is that in Put if I put print statement inside lock

bucket.mapLock.Lock()
fmt.Println("start")
fmt.Println(bucket)
fmt.Println(bucketIndex)
fmt.Println(bucket.mapLock)
fmt.Println(&bucket.mapLock)
bucket.hashMap[key] = val
defer bucket.mapLock.Unlock()

The following prints is possible

start
start
{0x4212861c0 map[123:123]}
{0x4212241c0 map[123:123]}

Its weird because each start printout should be follow with 4 lines of bucket info since you cannot have start back to back because that would indicate that multiple thread is access the line inside lock

Also for some reason each bucket.mapLock have different address even if I make the bucketIndex static, that indicate that I am not even accessing the same lock.

But despite the above weirdness changing mutex to pointer solves my problem

I would love to find out why I need pointers for mutex and why the prints seem to indicate multiple thread is accessing the lock and why each lock has different address.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

douzhang8840 2017-08-24 19:16

关注

The problem is with the statement

bucket := cMap.buckets[bucketIndex]

bucket now contains copy of the ThreadSafeMap at that index. As sync.RWMutex is stored as value, a copy of it is made while assigning. But map maps hold references to an underlying data structure, so the copy of the pointer or the same map is passed. The code locks a copy of the lock while writing to a single map, which cause the problem.

Thats why you don't face any problem when you change sync.RWMutex to *sync.RWMutex. It's better to store reference to structure in map as shown.

package concurrent_hashmap

import (
    "hash/fnv"
    "sync"
)

type ConcurrentMap struct {
    buckets     []*ThreadSafeMap
    bucketCount uint32
}

type ThreadSafeMap struct {
    mapLock sync.RWMutex
    hashMap map[string]interface{}
}

func NewConcurrentMap(bucketSize uint32) *ConcurrentMap {
    var threadSafeMapInstance *ThreadSafeMap
    var bucketOfThreadSafeMap []*ThreadSafeMap

    for i := 0; i <= int(bucketSize); i++ {
        threadSafeMapInstance = &ThreadSafeMap{sync.RWMutex{}, make(map[string]interface{})}
        bucketOfThreadSafeMap = append(bucketOfThreadSafeMap, threadSafeMapInstance)
    }

    return &ConcurrentMap{bucketOfThreadSafeMap, bucketSize}
}

func (cMap *ConcurrentMap) Put(key string, val interface{}) {
    bucketIndex := hash(key) % cMap.bucketCount
    bucket := cMap.buckets[bucketIndex]
    bucket.mapLock.Lock()
    bucket.hashMap[key] = val
    bucket.mapLock.Unlock()
}

// Helper
func hash(s string) uint32 {
    h := fnv.New32a()
    h.Write([]byte(s))
    return h.Sum32()
}

It's possible to validate the scenario by modifying the function Put as follows

func (cMap *ConcurrentMap) Put(key string, val interface{}) {
    //fmt.Println("index", key)
    bucketIndex := 1
    bucket := cMap.buckets[bucketIndex]
    fmt.Printf("%p %p
", &(bucket.mapLock), bucket.hashMap)
}

报告相同问题？

关注问题

与列表进行数据竞争。使用互斥体列出并发访问
2018-12-07 17:32

回答 1 已采纳 The statement list := myListOfLists[someIndex] copies the array element to variable list. This cop
使用多个goroutine写入互斥体映射是否快于一个互斥体映射？为什么呢？
2019-03-28 03:31

回答 1 已采纳 It's fairly simple. In the second scenario, with the 2 goroutines, because of the mutex, there ca
推迟进行互斥锁解锁时，您能否获得数据竞赛？
2018-03-01 10:39

回答 1 已采纳 Your code is safe, deferred functions are executed after the expression list of the return stateme
使用golang进行高性能编码-Golang开发
2021-05-26 14:36

high performance coding with golang（Go 语言高性能编程，Go 语言陷阱，Gotchas，Traps） Go 语言高性能编程目录序言关于本书第一章性能分析 benchmark 基准测试 pprof 性能分析第二章常用...
使用互斥体对吗？
2018-08-24 19:53

回答 1 已采纳 Welcome to the world of synchronization. Your assessment is correct, there is opportunity for conc
Golang，如何分享价值-消息还是互斥体？
2014-10-23 05:00

回答 3 已采纳 if I consider performance only, are there any reason to use channel instead of mutex? Not re
如何在Go中使用频道代替互斥体？
2010-10-17 04:39

回答 2 已采纳 An example of using a Channel as a Mutex: package main var global int = 0 var c = make(chan int,
Golang-12并发
2022-10-08 18:00

凯歌响起的博客并发：同一时间段内执行多个任务（你在同时用微信和两个朋友聊天）。并行：同一时刻执行多个任务（你和你朋友...很多人学习golang这个语言就是为了goalng的并发来的，那我们需要了解一下golang的并非是怎么实现的。
从两个互斥体报告的数据竞争
2016-11-25 16:07

回答 1 已采纳 Just going to take a guess, but one common cause of this issue is accidentally passing the struct
使用Linux mutex互斥量做锁产生的问题 c语言 linux unix
2022-10-18 16:48

回答 2 已采纳你的两个线程没有做到同步呀，你在循环之外上锁，循环之内解锁，无法保证变量的同步，把锁移到 while 循环里面应该就可以了
如果要对原子值和互斥锁进行二选一，最重要的三个决策条件应该是什么？ golang java
2021-02-23 23:26

回答 1 已采纳使用原子类型有ABA问题，若业务对ABA敏感，使用锁
Golang集合原理及使用
2023-04-07 15:31

xidianhuihui的博客使用互斥锁（mutex）来保证读写操作的同步可以在 slice 对象中添加一个互斥锁，然后在每次读写时对该锁进行加锁和解锁操作，以避免多个线程同时读写 slice 导致的数据竞争问题 func main() { slc := make([]int, ...
请问该如何写这个并发程序代码 c语言数据结构
2022-12-16 16:57

回答 4 已采纳 // multithread.cpp : Defines the entry point for the console application. #include "stdafx.h" #inclu
golang并发安全-sync.map
2023-12-27 20:25

木子林_的博客 1- sync.map 结构体加了readOnly 和 dirty 来实现读写分离，load，store...双重检测等等，这些都会导致性能下降3- sync.map 没有提供对read， dirty 的长度方法，这个对象使用在于并发场景下，会额外带来锁竞争的问题。
golang 并发编程
2022-05-19 00:24

Mars'Ares的博客两级线程模型M:N调度 GPM调度策略源码分析触发调度线程启动协程执行结束主动挂起系统调用协作式调度系统监控协程 goroutine对比状态转移源码分析使用通道 channel源码分析使用同步 sync互斥锁 mutex锁模式锁状态上...
没有解决我的问题, 去提问

悬赏问题

¥15 想问一下树莓派接上显示屏后出现如图所示画面，是什么问题导致的
¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
¥15 cmd cl 0x000007b
¥20 BAPI_PR_CHANGE how to add account assignment information for service line
¥500 火焰左右视图、视差（基于双目相机）
¥100 set_link_state
¥15 虚幻5 UE美术毛发渲染
¥15 CVRP 图论物流运输优化
¥15 Tableau online 嵌入ppt失败
¥100 支付宝网页转账系统不识别账号

码龄粉丝数原力等级 --

甚至使用互斥体进行自定义并发映射的Golang数据竞赛

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

甚至使用互斥体进行自定义并发映射的Golang数据竞赛

1条回答 默认 最新

悬赏问题

1条回答默认最新