上帝曾经的类型的效率测量

I have a piece of code that I want to run only once for initialization. So far I was using sync.Mutex combined with an if-clause to test if it has been run already. Later I came across the Once type and its DO() function in the same sync package.

The implementation is the following https://golang.org/src/sync/once.go:

func (o *Once) Do(f func()) {
    if atomic.LoadUint32(&o.done) == 1 {
        return
    }
    // Slow-path.
    o.m.Lock()
    defer o.m.Unlock()
    if o.done == 0 {
        defer atomic.StoreUint32(&o.done, 1)
        f()
    }
}

Looking at the code, it is basically the same thing I've been using before. A mutex combined with an if-clause. However, the added function calls makes this seem rather inefficient to me. I did some testing and tried varous versions:

func test1() {
    o.Do(func() {
        // Do smth
    })
    wg.Done()
}

func test2() {
    m.Lock()
    if !b {
        func() {
            // Do smth
        }()
    }
    b = true
    m.Unlock()
    wg.Done()
}

func test3() {
    if !b {
        m.Lock()
        if !b {
            func() {
                // Do smth
            }()
            b = true
        }
        m.Unlock()
    }
    wg.Done()
}

I tested all versions by running the following code:

    wg.Add(10000)
    start = time.Now()
    for i := 0; i < 10000; i++ {
        go testX()
    }
    wg.Wait()
    end = time.Now()

    fmt.Printf("elapsed: %v
", end.Sub(start).Nanoseconds())

with the following resutls:

elapsed: 8002700 //test1
elapsed: 5961600 //test2
elapsed: 5646700 //test3

Is it even worth using the Once type? It is convenient but performance is even worse than test2 which always serializes all routines.

Also, why are they using an atomic int for their if-clause? Storing happens inside the lock anyway.

Edit: Go playground link: https://play.golang.org/p/qlMxPYop7kS NOTICE: this doensn't show the results as time is fixed in the playground.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dpjs2005 2018-11-19 14:18
关注
That is not how you're supposed to test code performance. You should use Go's built-in testing framework (testing package and go test command). See Order of the code and performance for details.

Let's create the testable code:

func f() { // Code that must only be run once } var testOnce = &sync.Once{} func DoWithOnce() { testOnce.Do(f) } var ( mu = &sync.Mutex{} b bool ) func DoWithMutex() { mu.Lock() if !b { f() b = true } mu.Unlock() }

Let's write proper testing / benchmarking code using the testing package:

func BenchmarkOnce(b *testing.B) { for i := 0; i < b.N; i++ { DoWithOnce() } } func BenchmarkMutex(b *testing.B) { for i := 0; i < b.N; i++ { DoWithMutex() } }

We can run the benchmark with the following code:

go test -bench .

And here are the benchmarking results:

BenchmarkOnce-4 200000000 6.30 ns/op BenchmarkMutex-4 100000000 20.0 ns/op PASS

As you can see, using sync.Once() was almost 4 times faster than using a sync.Mutex. Why? Because sync.Once() has an "optimized", short path that uses only an atomic load to check if the task has been called before, and if so, no mutex is used. The "slow" path is likely only used once, on first call to Once.Do(). Although if you'd have many concurrent goroutines attempting to call DoWithOnce(), the slow path might be reached multiple times, but on the long run once.Do() will only need to use an atomic load.

Parallel testing (from multiple goroutines)

Yes, the above benchmarking code only uses a single goroutine to test. But using multiple concurrent goroutines will just make the mutex's case worse, as it always have to obtain a mutex to even check if the task is to be called while sync.Once just uses an atomic load.

Nevertheless, let's benchmark it.

Here are the benchmarking code using parallel testing:

func BenchmarkOnceParallel(b *testing.B) { b.RunParallel(func(pb *testing.PB) { for pb.Next() { DoWithOnce() } }) } func BenchmarkMutexParallel(b *testing.B) { b.RunParallel(func(pb *testing.PB) { for pb.Next() { DoWithMutex() } }) }

I have 4 cores on my machine, so I'm gonna use those 4 cores:

go test -bench Parallel -cpu=4

^{(You may omit the -cpu flag in which case it defaults to GOMAXPROCS–the number of cores available.)}

And here are the results:

BenchmarkOnceParallel-4 500000000 3.04 ns/op BenchmarkMutexParallel-4 20000000 93.7 ns/op

When "concurrency increases", the results are starting to become uncomparable in favor of sync.Once (in the above test, it's 30 times faster).

We may further increase the number of goroutines created using testing.B.SetPralleism(), but I got similar result when I set it to 100 (that means 400 goroutines were used to call the benchmarking code).
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

上帝曾经的类型的效率测量
2018-11-19 13:58

回答 1 已采纳 That is not how you're supposed to test code performance. You should use Go's built-in testing fra
你觉得100年后会有多少种编程语言？你最擅长哪种语言？ python rust 开发语言
2022-06-21 16:53

回答 2 已采纳量子计算的原生编程语言又会是什么样子的呢？
这个数学问题用c语言怎么写 c语言
2023-01-26 13:16

回答 2 已采纳 x，y不要用int，你用double。因为z=(x+y)/2 这里右边是转成了int，所以精度丢失，故和你手算出现偏差。
《人类简史:从动物到上帝》读书摘记
2021-03-07 18:24

匿名少侠的博客人类简史：从动物到上帝尤瓦尔·赫拉利 ◆ 推荐序毕竟，能够像他这样从容游走于这么多学科之间的历史学家，是旷世罕见的。读《人类简史》，我们每每会为作者非同寻常的想象力而赞叹。因为一旦人们发现...
目前来看，学习哪门编程语言最有未来前景？(语言-开发语言) 学习方法开发语言蓝桥杯
2022-09-15 23:23

回答 5 已采纳您好，您孩子多大岁数呢？学习编程，兴趣最关键。。然后，要做好长期不断学习的心理准备。第一阶段：12岁前，岁数较小时，要学好数学，空余时间可以学一些少儿编程方面的资料，培养培育孩子的逻辑思维、数据思维能
找不到实时时间显示在哪 html
2022-12-24 13:32

回答 1 已采纳这有个类似的问题, 你可以参考下: https://ask.csdn.net/questions/7570220
学c语言第一天，想写个求因子的程序，错在哪里啊 c语言
2022-09-22 16:53

回答 4 已采纳你这样写得输入数字\n才行，去掉第五行\n，直接输入数字就行了
第一篇：概述、目录、适用范围及术语 --- IAB/MRC《增强现实（AR）广告（效果）测量指南1.0 》
2024-03-24 20:39

数字化营销工兵的博客虽然数字技术的进步的确会提高广告行业的质量和效率。但是我有一种非常强烈的感觉，随着知识图谱，数字孪生等仿真技术的成熟，所有的这些评估工具包括广告效果评估，一旦拿到真实有效的数据后，大语言模型很快就能...
Error: '*' object has no attribute 'savedesc' python 有问必答
2021-06-07 17:52

回答 3 已采纳错误提示，没有找到类的属性，检查savedesc()是不是缩进错误，从贴上来的代码看，savedesc函数变成了kill函数的内函数了，应该与take()等其他函数对齐。
用sql写一张表中有A，B，C三个字段，对比B字段中内容是否包含A，如包含输出同行C字段？ sql 有问必答
2021-06-03 10:14

回答 5 已采纳 with t as ( select 'abc' as id,'abc' as pulldi,'wtf' as name union all select 'abc','abc-01','dpt
eclipse中maveninstall失败 eclipse maven 有问必答
2021-05-20 17:17

回答 3 已采纳可以参考一下这篇文章。是因为你上次下载的时候没有下载全。 https://blog.51cto.com/qiangsh/1743074
这一年，这些书：2022年读书笔记
2022-12-31 17:50

Heartsuit的博客据传，英国唯美主义诗人奥斯卡·王尔德（Oscar Wilde）曾经说过一句话，我非常赞同：世界上的一切都是有关性的，除了性本身。性是关于权力的。（Everything in the world is about sex, except sex. Sex is about ...
字符串流从xml节点提取数据并修改后，再返回原文件中？？？ c++
2020-10-29 16:27

回答 1 已采纳 ``` string strMsg = ""; root->first_node("node")->value(strMsg.c_str()); ```
【多传感器融合】VIO_FUSION
2021-09-27 20:17

大江东去浪淘尽千古风流人物的博客 VIO_FUSION 文章目录 VIO_FUSION 第一章：VIO技术概览具体讨论内容 1. VIO相比VO（单目、双目和RGBD...在vio里不多见，gps/ins里比较常用测量值=真值+bias+noise 这个模式忽略了3轴加速度计和陀螺仪的非正交误差标度...
如何高效学习，斯科特·扬（全文）
2018-05-20 14:19

懒散的鱼与消失的猫的博客比喻就像金·凯瑞拥有了上帝的力量，将月亮（陌生）拉得靠近自己（熟悉），使学习者能更清楚地观察陌生的知识。科学家哈定曾说：“如果科学家一生注意细微的观察，训练自己注意寻求类比，使自己具备有关的知识，那么...
这一年，这些书：2021年读书笔记
2021-12-31 22:41

Heartsuit的博客互联网时代注重的是流动和循环的效率，共享、分享才是大势所趋。每个人都只是一个信息节点。未来无论你做的是什么产品和服务，从本质上来讲，你经营的都是“数据”。你接触数据的大小和处理数据的能力，决定了你的...
半导体器件基础01：关于PN结的那些事（2）
2023-02-04 15:47

牧神园地的博客（参考自：曹天元-上帝掷骰子吗）二，半导体再深入理解上面是对PN型半导体以及PN结的初步理解，是不是觉得半导体的知识也不过如此，一般来说硬件工程师对半导体的认知止于“初步理解”，因为硬件工程师所需的数据...
明翰全日制英国硕士词汇篇V1.3（持续更新）
2021-03-05 09:17

十七号城市的博客下面的所有词汇与例句都是在英国留学期间，学到的、听到的、见到的，都来自英语母语使用者，其中包括：学校、同学、教授、教职人员、以及生活中形形色色的人，这篇文章有助于还没去英国的同学提前掌握一些高频...
《未来简史》的“数据主义”——企业运作就是一套数据算法！
2019-11-27 17:24

nayun123的博客为提升生产效率，泰勒放弃了“经验测量”的方式，开始对搬运矿石这个业务环节进行精确的测量和优化研究，他用秒表计算时间、用尺子计算移动距离，对铲子的大小、每次搬运的重量进行调整改进，他的优化研究甚至具体到...
软件测试基础知识 + 面试理论（超详细）
2021-02-25 10:47

皮皮鱼哟的博客根据我以前的工作和学习经验，我认为做好工作首先要有一个良好的沟通，只有沟通无障碍了，才会有好的协作，才会有更好的效率，再一个就是技术一定要过关，做测试要有足够的耐心，和一个良好的工作习惯，不懂的就要...
没有解决我的问题, 去提问

悬赏问题

¥15 BP神经网络控制倒立摆
¥20 要这个数学建模编程的代码并且能完整允许出来结果完整的过程和数据的结果
¥15 html5+css和javascript有人可以帮吗？图片要怎么插入代码里面啊
¥30 Unity接入微信SDK 无法开启摄像头
¥20 有偿写代码要用特定的软件anaconda 里的jvpyter 用python3写
¥20 cad图纸，chx-3六轴码垛机器人
¥15 移动摄像头专网需要解vlan
¥20 access多表提取相同字段数据并合并
¥20 基于MSP430f5529的MPU6050驱动，求出欧拉角
¥20 Java-Oj-桌布的计算

上帝曾经的类型的效率测量

1条回答 默认 最新

Parallel testing (from multiple goroutines)

悬赏问题

1条回答默认最新