性能随机下降

I'm kind of a newbie in Go and there is something that confused me recently.

I have a piece of code (simplified version posted below) and I was trying to measure performanc for it. I did this in two ways: 1) a bencmark with testing package 2) manually logging time

Running the benchmark outputs a result

30000 55603 ns/op

which is fine, BUT... When I do the 30k runs of the same function logging the time for each iteration I get an output like this:

test took 0 ns test took 0 ns ... ~10 records all the same test took 1000100 ns test took 0 ns test took 0 ns ... lots of zeroes again test took 0 ns test took 1000000 ns test took 0 ns ...

Doing the math shows that the average is indeed 55603 ns/op just as the benchmark claims.

Ok, I said, I'm not that good in optimizing performance and not that into all the hardcore compiler stuff, but I guess that might be random garbage collection? So I turned on the gc log, made sure it shows some output, then turned off the gc for good aaand... no garbage collection, but I see the same picture - some iterations take a million times longer(?).

It is 99% that my understanding of all this is wrong somewhere, maybe someone can point me to the right direction or maybe someone knows for sure what the hell is going on? :)

P.S. Also, to me less that a nanosecond (0 ns) is somewhat surprising, that seems too fast, but the program does provide the result of computation, so I don't know what to think anymore. T_T

EDIT 1: Answering Kenny Grant's question: I was using goroutines to implement sort-of generator of values to have laziness, now I removed them and simplified the code. The issue is much less frequent now, but it is still reproducible. Playground link: https://play.golang.org/p/UQMgtT4Jrf Interesting thing is that does not happen on playground, but still happens on my machine.

EDIT 2: I'm running Go 1.9 on win7 x64

EDIT 3: Thanks to the responses I now know that this code cannot possible work properly on playground. I will repost the code snippet here so that we don't loose it. :)

type PrefType string
var types []PrefType = []PrefType{
    "TYPE1", "TYPE2", "TYPE3", "TYPE4", "TYPE5", "TYPE6",
}

func GetKeys(key string) []string {
    var result []string
    for _, t := range types {
        rr := doCalculations(t)
        for _, k := range rr {
            result = append(result, key + "." + k)
        }
    }
    return result
}

func doCalculations(prefType PrefType) []string {
    return []string{ string(prefType) + "something", string(prefType) + "else" }
}

func test() {
    start := time.Now()
    keysPrioritized := GetKeys("spec_key")
    for _, k := range keysPrioritized {
        _ = fmt.Sprint(k)
    }
    fmt.Printf("test took %v ns
", time.Since(start).Nanoseconds())
}

func main() {
    for i := 0; i < 30000; i++  {
        test()
    }
}

Here is the output on my machine:

EDIT 4: I have tried the same on my laptop with Ubuntu 17.04, the output is reasonable, no zeroes and millions. Seems like a Windows-specific issue in the compiler/runtime lib. Would be great if someone can verify this on their machine (Win 7/8/10).

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

douzhi8488 2017-09-02 15:06

关注

On Windows, for such a tiny duration, you don't have precise enough time stamps. Linux has more precise time stamps. By design, Go benchmarks run for at least one second. Go1.9+ uses the monotonic (m) value to compute the duration.

On Windows:

timedur.go:

package main

import (
    "fmt"
    "os"
    "time"
)

type PrefType string

var types []PrefType = []PrefType{
    "TYPE1", "TYPE2", "TYPE3", "TYPE4", "TYPE5", "TYPE6",
}

func GetKeys(key string) []string {
    var result []string
    for _, t := range types {
        rr := doCalculations(t)
        for _, k := range rr {
            result = append(result, key+"."+k)
        }
    }
    return result
}

func doCalculations(prefType PrefType) []string {
    return []string{string(prefType) + "something", string(prefType) + "else"}
}

func test() {
    start := time.Now()
    keysPrioritized := GetKeys("spec_key")
    for _, k := range keysPrioritized {
        _ = fmt.Sprint(k)
    }
    end := time.Now()
    fmt.Printf("test took %v ns
", time.Since(start).Nanoseconds())
    fmt.Println(start)
    fmt.Println(end)
    if end.Sub(start) < time.Microsecond {
        os.Exit(1)
    }
}

func main() {
    for i := 0; i < 30000; i++ {
        test()
    }
}

Output:

>go run timedur.go
test took 1026000 ns
2017-09-02 14:21:58.1488675 -0700 PDT m=+0.010003700
2017-09-02 14:21:58.1498935 -0700 PDT m=+0.011029700
test took 0 ns
2017-09-02 14:21:58.1538658 -0700 PDT m=+0.015002000
2017-09-02 14:21:58.1538658 -0700 PDT m=+0.015002000
exit status 1
>

On Linux:

Output:

$ go run timedur.go
test took 113641 ns
2017-09-02 14:52:02.917175333 +0000 UTC m=+0.001041249
2017-09-02 14:52:02.917287569 +0000 UTC m=+0.001153717
test took 23614 ns
2017-09-02 14:52:02.917600301 +0000 UTC m=+0.001466208
2017-09-02 14:52:02.917623585 +0000 UTC m=+0.001489354
test took 22814 ns
2017-09-02 14:52:02.917726364 +0000 UTC m=+0.001592236
2017-09-02 14:52:02.917748805 +0000 UTC m=+0.001614575
test took 21139 ns
2017-09-02 14:52:02.917818409 +0000 UTC m=+0.001684292
2017-09-02 14:52:02.917839184 +0000 UTC m=+0.001704954
test took 21478 ns
2017-09-02 14:52:02.917911899 +0000 UTC m=+0.001777712
2017-09-02 14:52:02.917932944 +0000 UTC m=+0.001798712
test took 31032 ns
<SNIP>

The results are comparable. They were run on the same machine, a dual-boot with Windows 10 and Ubuntu 16.04.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(1条)

报告相同问题？

关注问题

性能随机下降 windows
2017-09-02 08:49

回答 2 已采纳 On Windows, for such a tiny duration, you don't have precise enough time stamps. Linux has more pr
随机森林比xgb性能好 r语言决策树随机森林
2023-01-15 11:01

回答 3 已采纳望采纳！！！点击回答右侧采纳即可！！可能是样本量太少导致的，因为样本量越少，模型的泛化能力越差，容易出现过拟合现象。另外，数据集的误差也可能是原因之一，因为误差越大，模型的泛化能力也会受到影响。要让X
随机森林-matlab matlab 回归随机森林
2022-12-26 14:14

回答 2 已采纳望采纳随机森林是一种机器学习算法，它的工作原理是建立许多决策树模型，然后将这些模型的预测结果结合起来得出最终的结果。由于随机森林使用了许多决策树模型，因此每个模型的结果可能会有所不同。解决这种情况的
KVM 虚拟化技术性能测试与调优
2023-01-15 16:59

wespten的博客虚拟化性能测试包括的范围比较广泛，可能包含CPU、内存、网络、磁盘的性能，也可能包含虚拟客户机动态迁移时的性能，也可能需要考虑多种物理平台上的性能，也可能需要考虑很多个虚拟客户机运行在同一个宿主机上时的...
Matlab随机森林预测模型输出权重问题 matlab 有问必答随机森林
2021-12-29 01:07

回答 1 已采纳你好，一般是输出重要性指标，比如你训练X和Y TreeNumber = 50; % 50棵树 minleafNumber = 2; % 2叶 b = TreeBagger(TreeNumber,X,Y
随机森林；群体；重要性比较机器学习随机森林
2023-03-05 09:20

回答 8 已采纳 from sklearn.ensemble import RandomForestClassifier import numpy as np # 假设data为样本特征矩阵，labels为样本标签 #
随机森林特征排序为什么输出全是0 python 随机森林
2022-09-15 15:36

回答 2 已采纳可以看到，你的数据都是小于1的浮点小数。但是这里不应该再将numpy数组中的数值类型都转化为整数型。不然你的数据就只剩0了。数据全为0，重要性当然也就没法研究了。 forest.fit(x_train
性能测试基础知识
2022-08-18 17:56

心皿月的博客性能测试是通过自动化的测试工具，模拟不同场景【正常、峰值以及异常负载条件】，对软件的各项性能指标进行测试和评估的过程。系统的性能是一个很大的概念，对一个软件系统而言包括执行效率、资源占用率、稳定性、...
R语言回归树/袋装树/随机森林预测 r语言随机森林
2023-02-04 15:22

回答 3 已采纳请参考： # 加载数据 data <- read.csv("data.csv") # 将前 90 个观察值分配给训练集，其余4个观察值分配给评估集 trainIndex <- 1:90
我应该如何把随机森林算法应用于光学图像分割图像处理随机森林
2023-03-02 16:52

回答 8 已采纳基于Monster 组和GPT的调写：光学图像分割是一种将数字图像划分为多个部分或区域的技术，其目的是提取图像中感兴趣的物体或区域。基于随机森林的图像分割方法可以被视为一种基于机器学习的图像分割方
机器学习随机森林做特征选择时报编码错误 python 机器学习随机森林
2022-12-08 12:30

回答 1 已采纳可以考虑修改"E:\Anaconda3\lib\site-packages\joblib\externals\loky\backend\resource_tracker.py"这个本地文件在204行的
全景解析SSD IO QoS性能优化
2023-01-28 21:17

古猫先生的博客不同的FW架构设计、FTL算法设计、NAND die plane/速率等的差异，都会直接影响SSD的性能与延迟，设计一块性能优越且稳定的SSD，是一项繁琐但具有很强艺术性的工程。
关于lasso回归和随机森林 python 随机森林
2022-04-29 18:44

回答 1 已采纳 lasso筛选后的变量可以用来做随机森林分类但是随机森林之前不需要lasso回归筛选变量，因为树模型可以学习到一些特征交叉，有些变量也许单独看不重要（被剔除掉了）但是和其它特征交叉起来就会变成很强的特
游戏性能优化技术干货分享——内存管理
2016-08-01 17:44

TxNet.Ltd.的博客项目的性能优化主要围绕CPU、GPU和内存三大方面进行。接上期CPU优化专讲，我们本期和大家分享内存方面的优化心得。　无论是游戏还是VR应用，内存管理都是其研发阶段的重中之重。　然而，在我们测评过的大量项目...
9 存储性能优化
2021-05-08 14:47

water___Wang的博客在网站应用中，海量的数据读写对磁盘访问造成巨大压力，虽然可以通过Cache解决一部分数据读压力，...访问的数据存储在连续的磁盘空间上）和随机访问（要访问的数据存储在不连续的磁盘空间）时，由于移动磁头臂的次数..
没有解决我的问题, 去提问

悬赏问题

¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！
¥15 drone 推送镜像时候 purge: true 推送完毕后没有删除对应的镜像,手动拷贝到服务器执行结果正确在样才能让指令自动执行成功删除对应镜像，如何解决？
¥15 求daily translation（DT）偏差订正方法的代码
¥15 js调用html页面需要隐藏某个按钮
¥15 ads仿真结果在圆图上是怎么读数的
¥20 Cotex M3的调试和程序执行方式是什么样的？
¥20 java项目连接sqlserver时报ssl相关错误
¥15 一道python难题3

码龄粉丝数原力等级 --

性能随机下降

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

性能随机下降

2条回答 默认 最新

悬赏问题

2条回答默认最新