如何对修改输入的函数进行基准测试？

When I'm benchmarking a function that modifies its input, I have to copy the test data for each loop of the benchmark, and pause the timer while I'm doing so. This can mean that if I run go test -bench MyTest -benchtime 1s the test can take 2 full minutes rather than 1 second.

Am I doing something wrong or will I just have to live with this?

More context:

I'm writing a program for reading syslog logs. Part of my logging paradigm is that the first line of a logged message contains readable text, and following lines contain "extra information", like a stack trace. My log reader therefore (among other things) splits the message on the first line break, which is escaped to #012 by rsyslog.

Here is the code for that:

// Splits the main line from extra information
func splitMessageExtra(line *string) string {
    var prev rune

    for i, char := range *line {
        if prev == 0 && char == '#' {
            prev = char
            continue
        }

        if prev == '#' && char == '0' {
            prev = char
            continue
        }

        if prev == '0' && char == '1' {
            prev = char
            continue
        }

        if prev == '1' && char == '2' {
            extra := (*line)[i+1:]
            *line = (*line)[0 : i-3]

            return extra
        }

        prev = 0
    }

    return ""
}

It originally used strings.Split and returned new strings, but cpu profiling showed that it was way too slow.

Here is the benchmark function:

var testMessage = `Feb 10 15:16:20 foo_stats[-] (warning): [foo_stats.postfix, line 166, thread "processor_mta03"]: Skipped line because there is no context:#012Feb 10 15:16:20 mta03 postfix/qmgr[7419]: ABCDEF123: from=<>, size=24431, nrcpt=1 (queue active)`

func BenchmarkSplitMessageExtra(b *testing.B) {
    for i := 0; i < b.N; i++ {
        b.StopTimer()
        msg := string([]byte(testMessage))
        b.StartTimer()

        splitMessageExtra(&msg)
    }
}

Here's a run without pausing the timer:

$ go test -bench SplitMessageExtra -benchtime 1s
BenchmarkSplitMessageExtra-8     3000000           434 ns/op
PASS
ok      github.com/Hubro/logreader  1.730s

And here's a run with the exact benchmark function above:

$ go test -bench SplitMessageExtra -benchtime 1s
BenchmarkSplitMessageExtra-8     5000000           385 ns/op
PASS
ok      github.com/Hubro/logreader  100.563s

Notice it takes AGES to run.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

duanrong3308 2017-02-26 09:20

关注

Your code and benchmark do seem slow. Here's a faster version.

package main

import (
    "strings"
    "testing"
)

// Splits the main line from extra information
func splitMessageExtra(line *string) string {
    const newline = "#012"
    i := strings.Index(*line, newline)
    if i < 0 {
        return ""
    }
    extra := (*line)[i+len(newline):]
    *line = (*line)[0:i]
    return extra
}

var testMessage = `Feb 10 15:16:20 foo_stats[-] (warning): [foo_stats.postfix, line 166, thread "processor_mta03"]: Skipped line because there is no context:#012Feb 10 15:16:20 mta03 postfix/qmgr[7419]: ABCDEF123: from=<>, size=24431, nrcpt=1 (queue active)`

func BenchmarkSplitMessageExtra(b *testing.B) {
    for i := 0; i < b.N; i++ {
        msg := testMessage
        splitMessageExtra(&msg)
    }
}

Output:

$ go test -bench=.
goos: linux
goarch: amd64
pkg: extra
BenchmarkSplitMessageExtra-4    50000000            32.2 ns/op
PASS
ok      extra   1.647s

For comparison, here are the results from your code and benchmark. Your code and benchmark are slower than mine: 968 ns/op and 50.184s versus 32.2 ns/op and 1.647s respectively.

package main

import (
    "testing"
)

// Splits the main line from extra information
func splitMessageExtra(line *string) string {
    var prev rune
    for i, char := range *line {
        if prev == 0 && char == '#' {
            prev = char
            continue
        }
        if prev == '#' && char == '0' {
            prev = char
            continue
        }
        if prev == '0' && char == '1' {
            prev = char
            continue
        }
        if prev == '1' && char == '2' {
            extra := (*line)[i+1:]
            *line = (*line)[0 : i-3]

            return extra
        }
        prev = 0
    }
    return ""
}

var testMessage = `Feb 10 15:16:20 foo_stats[-] (warning): [foo_stats.postfix, line 166, thread "processor_mta03"]: Skipped line because there is no context:#012Feb 10 15:16:20 mta03 postfix/qmgr[7419]: ABCDEF123: from=<>, size=24431, nrcpt=1 (queue active)`

func BenchmarkSplitMessageExtra(b *testing.B) {
    for i := 0; i < b.N; i++ {
        b.StopTimer()
        msg := string([]byte(testMessage))
        b.StartTimer()
        splitMessageExtra(&msg)
    }
}

Output:

$ go test -bench=.
goos: linux
goarch: amd64
pkg: extra
BenchmarkSplitMessageExtra-4     2000000           968 ns/op    
PASS
ok      extra   50.184s

Some of your code is unnecessary; it uses CPU time and triggers allocations. For example, converting to utf-8 bytes to runes,for i, char := range *line {}, and converting string to []byte to string, string([]byte(testMessage)) . Some algorithms could be improved. For example, searching for a newline.

报告相同问题？

关注问题

用Xcode写C程序的main函数参数怎么输入？ xcode
2017-03-10 05:20

回答 1 已采纳可以在运行前设置参数，菜单项为： **Product -> Edit Scheme... -> Run -> Arguments** 对应的快捷键为： **cmd
请问matlab自定义函数输入参数不足该怎么解决呢？ matlab 有问必答
2021-08-11 22:01

回答 2 已采纳将代码改为如下形式即可： a = [1 1 1 0 0 1]; b = [7,5,5]; c1 = [-3 -1]; c2 = [-1 -2]; [x1,g1] = linprog(c
请问这个对链表进行排序的函数有什么问题？ c++ 链表
2015-06-14 07:36

回答 2 已采纳我大概知道了，复制内存的时候指针也被复制过去了。
【go语言】3.3.1 单元测试和基准测试
2023-08-02 23:08

移动安全星球的博客 Go 语言的testing包为编写单元测试和基准测试提供了强大的支持。单元测试用于验证代码的正确性，基准测试用于测量代码的性能。
clearerr函数使用后可以继续对文件进行操作吗？
2016-06-08 06:32

回答 2 已采纳可以成功，不过你的fgets使得文件指针已经后移了1个字节，所以最好fseek向前一个字节 ``` void main() { FILE *pf=fopen("F:\\1.txt","w"
C++中如何控制某函数的运行时间？ c++ c语言开发语言
2019-11-07 20:07

回答 2 已采纳 ``` 如果能确保function()的执行时间小于100ms 可以写 #include time_t clk1 = clock(); .clk2 = clock(); wh
MATLAB运行函数显示出输入参数数目不足，怎么办？ matlab 有问必答
2021-05-09 23:23

回答 3 已采纳你看看你26，27行代码的match是什么，如果是调用函数match，就要传入两个参数，好似11行的代码那样，如果是个变量名，那就是和函数match重命名了，冲突了
Mysql - 基准测试和压力测试
2023-09-30 17:22

yueerba126的博客 ab 是 Apache HTTP 服务器的基准测试工具。它用于测试 HTTP 服务器每秒能够处理多少请求。对于 Web 应用程序服务，此结果可以转化为整个应用程序每秒可满足的请求数量。
如何用构造函数来初始化？？？？
2015-04-23 02:50

回答 3 已采纳 DateTime(int y, int m, int d) : date(y, m, d):time(y,m,d) 第二个冒号改为逗号
如下，构造函数会返回值？？ c++
2015-05-05 05:32

回答 3 已采纳生成一个默认的List赋值给当前对象，从而达到清空当前对象的内容
c++函数构造出现问题? c++
2017-02-01 03:21

回答 2 已采纳 > c = (5 / 9 * (f - 32)); 因为在C++里运算是从左边到右边的，/和\*的运算级别是一样的，所以先算5 / 9 在C++里其计算结果为0，所以 0 \* （f -
Java基准测试工具JMH使用
2022-02-05 17:52

流子的博客 JMH，即Java Microbenchmark Harness，这是专门用于进行代码的微基准测试的一套工具API。 JMH 由 OpenJDK/Oracle 里面那群开发了 Java 编译器的大牛们所开发。何谓 Micro Benchmark 呢？简单地说就是在方法层面上...
atoi函数代码怎么理解？
2015-10-20 12:15

回答 5 已采纳函数实现功能是将一个字符串转换成整数。如：“-12345” 转换成 -12345 转换过程：从左到右依次遍历每一个字符。首先判断是正数还是负数（ispnum做标记，true表示整数，fals
go benchmark 基准测试
2023-03-30 20:33

pakano的博客 go 基准测试
Golang单元测试、Mock测试以及基准测试
2022-06-29 10:41

小菜鸡本菜的博客单元测试主要包括：输入、测试单元、输出、期望以及与期望的校对。测试单元包括函数或者结合了一些函数的模块等。我们通过将输出与期望值进行校对，来验证代码的正确性。通过单元测试，可以一方面保证质量，例如在...
没有解决我的问题, 去提问

悬赏问题

¥15 HFSS 中的 H 场图与 MATLAB 中绘制的 B1 场部分对应不上
¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？

码龄粉丝数原力等级 --

如何对修改输入的函数进行基准测试？

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

如何对修改输入的函数进行基准测试？

2条回答 默认 最新

悬赏问题

2条回答默认最新