在Go中同时访问带有“范围”的地图

The "Go maps in action" entry in the Go blog states:

Maps are not safe for concurrent use: it's not defined what happens when you read and write to them simultaneously. If you need to read from and write to a map from concurrently executing goroutines, the accesses must be mediated by some kind of synchronization mechanism. One common way to protect maps is with sync.RWMutex.

However, one common way to access maps is to iterate over them with the range keyword. It is not clear if for the purposes of concurrent access, execution inside a range loop is a "read", or just the "turnover" phase of that loop. For example, the following code may or may not run afoul of the "no concurrent r/w on maps" rule, depending on the specific semantics / implementation of the range operation:

 var testMap map[int]int
 testMapLock := make(chan bool, 1)
 testMapLock <- true
 testMapSequence := 0

...

 func WriteTestMap(k, v int) {
    <-testMapLock
    testMap[k] = v
    testMapSequence++
    testMapLock<-true
 }

 func IterateMapKeys(iteratorChannel chan int) error {
    <-testMapLock
    defer func() { 
       testMapLock <- true
    }
    mySeq := testMapSequence
    for k, _ := range testMap {
       testMapLock <- true
       iteratorChannel <- k
       <-testMapLock
       if mySeq != testMapSequence {
           close(iteratorChannel)
           return errors.New("concurrent modification")
       }
    }
    return nil
 }

The idea here is that the range "iterator" is open when the second function is waiting for a consumer to take the next value, and the writer is not blocked at that time. However, it is never the case that two reads in a single iterator are on either side of a write - this is a "fail fast" iterator, the borrow a Java term.

Is there anything anywhere in the language specification or other documents that indicates if this is a legitimate thing to do, however? I could see it going either way, and the above quoted document is not clear on exactly what consititutes a "read". The documentation seems totally quiet on the concurrency aspects of the for/range statement.

(Please note this question is about the currency of for/range, but not a duplicate of: Golang concurrent map access with range - the use case is completely different and I am asking about the precise locking requirement wrt the 'range' keyword here!)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongleman4760 2016-11-07 00:29
关注
You are using a for statement with a range expression. Quoting from Spec: For statements:

The range expression is evaluated once before beginning the loop, with one exception: if the range expression is an array or a pointer to an array and at most one iteration variable is present, only the range expression's length is evaluated; if that length is constant, by definition the range expression itself will not be evaluated.

We're ranging over a map, so it's not an exception: the range expression is evaluated only once before beginning the loop. The range expression is simply a map variable testMap:

for k, _ := range testMap {}

The map value does not include the key-value pairs, it only points to a data structure that does. Why is this important? Because the map value is only evaluated once, and if later pairs are added to the map, the map value –evaluated once before the loop– will be a map that still points to a data structure that includes those new pairs. This is in contrast to ranging over a slice (which would be evaluated once too), which is also only a header pointing to a backing array holding the elements; but if elements are added to the slice during the iteration, even if that does not result in allocating and copying over to a new backing array, they will not be included in the iteration (because the slice header also contains the length - already evaluated). Appending elements to a slice may result in a new slice value, but adding pairs to a map will not result in a new map value.

Now on to iteration:

for k, v := range testMap { t1 := time.Now() someFunction() t2 := time.Now() }

Before we enter into the block, before the t1 := time.Now() line k and v variables are holding the values of the iteration, they are already read out from the map (else they couldn't hold the values). Question: do you think the map is read by the for ... range statement between t1 and t2? Under what circumstances could that happen? We have here a single goroutine that is executing someFunc(). To be able to access the map by the for statement, that would either require another goroutine, or it would require to suspend someFunc(). Obviously neither of those happen. (The for ... range construct is not a multi-goroutine monster.) No matter how many iterations there are, while someFunc() is executed, the map is not accessed by the for statement.

So to answer one of your questions: the map is not accessed inside the for block when executing an iteration, but it is accessed when the k and v values are set (assigned) for the next iteration. This implies that the following iteration over the map is safe for concurrent access:

var ( testMap = make(map[int]int) testMapLock = &sync.RWMutex{} ) func IterateMapKeys(iteratorChannel chan int) error { testMapLock.RLock() defer testMapLock.RUnlock() for k, v := range testMap { testMapLock.RUnlock() someFunc() testMapLock.RLock() if someCond { return someErr } } return nil }

Note that unlocking in IterateMapKeys() should (must) happen as a deferred statement, as in your original code you may return "early" with an error, in which case you didn't unlock, which means the map remained locked! (Here modeled by if someCond {...}).

Also note that this type of locking only ensures locking in case of concurrent access. It does not prevent a concurrent goroutine to modify (e.g. add a new pair) the map. The modification (if properly guarded with write lock) will be safe, and the loop may continue, but there is no guarantee that the for loop will iterate over the new pair:

If map entries that have not yet been reached are removed during iteration, the corresponding iteration values will not be produced. If map entries are created during iteration, that entry may be produced during the iteration or may be skipped. The choice may vary for each entry created and from one iteration to the next.

The write-lock-guarded modification may look like this:

func WriteTestMap(k, v int) { testMapLock.Lock() defer testMapLock.Unlock() testMap[k] = v }

Now if you release the read lock in the block of the for, a concurrent goroutine is free to grab the write lock and make modifications to the map. In your code:

testMapLock <- true iteratorChannel <- k <-testMapLock

When sending k on the iteratorChannel, a concurrent goroutine may modify the map. This is not just an "unlucky" scenario, sending a value on a channel is often a "blocking" operation, if the channel's buffer is full, another goroutine must be ready to receive in order for the send operation to proceed. Sending a value on a channel is a good scheduling point for the runtime to run other goroutines even on the same OS thread, not to mention if there are multiple OS threads, of which one may already be "waiting" for the write lock in order to carry out a map modification.

To sum the last part: you releasing the read lock inside the for block is like yelling to others: "Come, modify the map now if you dare!" Consequently in your code encountering that mySeq != testMapSequence is very likely. See this runnable example to demonstrate it (it's a variation of your example):

package main import ( "fmt" "math/rand" "sync" ) var ( testMap = make(map[int]int) testMapLock = &sync.RWMutex{} testMapSequence int ) func main() { go func() { for { k := rand.Intn(10000) WriteTestMap(k, 1) } }() ic := make(chan int) go func() { for _ = range ic { } }() for { if err := IterateMapKeys(ic); err != nil { fmt.Println(err) } } } func WriteTestMap(k, v int) { testMapLock.Lock() defer testMapLock.Unlock() testMap[k] = v testMapSequence++ } func IterateMapKeys(iteratorChannel chan int) error { testMapLock.RLock() defer testMapLock.RUnlock() mySeq := testMapSequence for k, _ := range testMap { testMapLock.RUnlock() iteratorChannel <- k testMapLock.RLock() if mySeq != testMapSequence { //close(iteratorChannel) return fmt.Errorf("concurrent modification %d", testMapSequence) } } return nil }

Example output:

concurrent modification 24 concurrent modification 41 concurrent modification 463 concurrent modification 477 concurrent modification 482 concurrent modification 496 concurrent modification 508 concurrent modification 521 concurrent modification 525 concurrent modification 535 concurrent modification 541 concurrent modification 555 concurrent modification 561 concurrent modification 565 concurrent modification 570 concurrent modification 577 concurrent modification 591 concurrent modification 593

We're encountering concurrent modification quite often!

Do you want to avoid this kind of concurrent modification? The solution is quite simple: don't release the read lock inside the for. Also run your app with the -race option to detect race conditions: go run -race testmap.go

Final thoughts

The language spec clearly allows you to modify the map in the same goroutine while ranging over it, this is what the previous quote relates to ("If map entries that have not yet been reached are removed during iteration.... If map entries are created during iteration..."). Modifying the map in the same goroutine is allowed and is safe, but how it is handled by the iterator logic is not defined.

If the map is modified in another goroutine, if you use proper synchronization, The Go Memory Model guarantees that the goroutine with the for ... range will observe all modifications, and the iterator logic will see it just as if "its own" goroutine would have modified it – which is allowed as stated before.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

在Go中同时访问带有“范围”的地图
2016-11-05 20:25

回答 2 已采纳 You are using a for statement with a range expression. Quoting from Spec: For statements: The
如何在GO中访问地图值？
2015-12-13 16:21

回答 1 已采纳 Go is very strict on types. Your maps all have keys with typ int32 and you are trying to access th
如何在Golang中通过引用传递带有值类型接口的地图切片
2016-08-02 19:04

回答 2 已采纳 The function is passing a **interface{} to Unmarshal. To pass the the *[]map[string]interface{} t
golang go语言_在Go中了解地图
2020-08-12 17:47

cukw6666的博客 golang go语言Most modern programming languages have the concept of a dictionary or a hash type. These types are commonly used to store data in pairs with a key that maps to a value. 大多数现代编程...
Golang graphql遍历带有子图的地图 json
2018-08-10 17:58

回答 1 已采纳 Range through the map values in the result and append those values to Cell slice. If you are getti
在Golang的整数范围地图中查找
2016-09-28 13:07

回答 2 已采纳 If ranges are disjunct (that is a concrete number can only belong to one range), you can find a ra
可以在带有go模板的模板中使用模板
2018-11-16 15:09

回答 1 已采纳 You can use _helpers.tpl file to define logic and operate with values. _helpers.tpl {{/* Get red
goja:纯Go中的ECMAScriptJavaScript引擎
2021-04-28 01:25

通过几乎所有带有es5id标签的。目标是通过所有这些。请注意，当前的工作提交是。下一次提交删除了大多数es5id标记，这使得无法区分要运行的测试。能够运行Babel，Typescript编译器以及几乎所有用ES5编写的内容...
在golang中解析带有标点符号的发布数据不正确
2016-10-15 03:51

回答 1 已采纳 You're passing the raw code, which could be unsafe, your problem is because of this: https://gola
golang：在地图中快速访问地图数据 json
2013-06-12 00:43

回答 2 已采纳 If that's the only value you want, then how about using an anonymous struct that defines the path
在Go中排除带有构建标记的完整软件包[关闭]
2018-07-12 15:47

回答 1 已采纳 Package build Build Constraints A build constraint, also known as a build tag, is a
objx:Go软件包，用于处理地图，切片，JSON和其他数据
2021-05-23 04:58

开始使用：用安装Objx，或查看API文档概述Objx提供了objx.Map类型，它是map[string]interface{} ，它公开了一个强大的Get方法（以及其他方法），使您可以轻松快速地访问地图中的数据，而不必担心太多类型断言，...
如何在Go中发送带有图片和某些参数的http发布请求？ http
2019-08-21 10:16

回答 1 已采纳 testProduct := &Product{ Name: "TestProductName", ImageExtension: "png", }
【Golang】一篇文章带你快速了解Go语言&为什么你要学习Go语言
2023-04-20 22:32

凉云生烟的博客 Go语言（或 Golang）起源于 2007 年，并在 2009 年正式对外发布。Go 是非常年轻的一门语言，它的主要目标是“兼具 Python 等动态语言的开发速度和 C/C++ 等编译型语言的性能与安全性”。Go语言是编程语言设计的又一...
formam:一个用于在Go中将表单的值解码为struct的包
2021-05-08 08:09

可以访问带有interface{}字段，该字段具有map ， struct或slice作为值。解码time.Time与格式2006-01-02由其UnmarshalText()方法。解码url.URL 。附加到slice和array类型，但未明确指示索引。注册自定义类型
没有解决我的问题, 去提问

悬赏问题

¥15 stm32开发clion时遇到的编译问题
¥15 lna设计源简并电感型共源放大器
¥15 如何用Labview在myRIO上做LCD显示？(语言-开发语言)
¥15 Vue3地图和异步函数使用
¥15 C++ yoloV5改写遇到的问题
¥20 win11修改中文用户名路径
¥15 win2012磁盘空间不足,c盘正常，d盘无法写入
¥15 用土力学知识进行土坡稳定性分析与挡土墙设计
¥70 PlayWright在Java上连接CDP关联本地Chrome启动失败,貌似是Windows端口转发问题
¥15 帮我写一个c++工程

在Go中同时访问带有“范围”的地图

2条回答 默认 最新

悬赏问题

2条回答默认最新