解释预分配切片的基准

I've been trying to understand slice preallocation with make and why it's a good idea. I noticed a large performance difference between preallocating a slice and appending to it vs just initializing it with 0 length/capacity and then appending to it. I wrote a set of very simple benchmarks:

import "testing"

func BenchmarkNoPreallocate(b *testing.B) {
    for i := 0; i < b.N; i++ {
        // Don't preallocate our initial slice
        init := []int64{}
        init = append(init, 5)
    }
}

func BenchmarkPreallocate(b *testing.B) {
    for i := 0; i < b.N; i++ {
        // Preallocate our initial slice
        init := make([]int64, 0, 1)
        init = append(init, 5)
    }
}

and was a little puzzled with the results:

$ go test -bench=. -benchmem
goos: linux
goarch: amd64
BenchmarkNoPreallocate-4    30000000            41.8 ns/op         8 B/op          1 allocs/op
BenchmarkPreallocate-4      2000000000           0.29 ns/op        0 B/op          0 allocs/op

I have a couple of questions:

Why are there no allocations (it shows 0 allocs/op) in the preallocation benchmark case? Certainly we're preallocating, but the allocation had to have happened at some point.
I imagine this may become clearer after the first question is answered, but how is the preallocation case so much quicker? Am I misinterpetting this benchmark?

Please let me know if anything is unclear. Thank you!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

duanpiao6679 2017-11-09 02:52

关注

Go has an optimizing compiler. Constants are evaluated at compile time. Variables are evaluated at runtime. Constant values can be used to optimize compiler generated code. For example,

package main

import "testing"

func BenchmarkNoPreallocate(b *testing.B) {
    for i := 0; i < b.N; i++ {
        // Don't preallocate our initial slice
        init := []int64{}
        init = append(init, 5)
    }
}

func BenchmarkPreallocateConst(b *testing.B) {
    const (
        l = 0
        c = 1
    )
    for i := 0; i < b.N; i++ {
        // Preallocate our initial slice
        init := make([]int64, l, c)
        init = append(init, 5)
    }
}

func BenchmarkPreallocateVar(b *testing.B) {
    var (
        l = 0
        c = 1
    )
    for i := 0; i < b.N; i++ {
        // Preallocate our initial slice
        init := make([]int64, l, c)
        init = append(init, 5)
    }
}

Output:

$ go test alloc_test.go -bench=. -benchmem
BenchmarkNoPreallocate-4         50000000    39.3 ns/op     8 B/op    1 allocs/op
BenchmarkPreallocateConst-4    2000000000     0.36 ns/op    0 B/op    0 allocs/op
BenchmarkPreallocateVar-4        50000000    28.2 ns/op     8 B/op    1 allocs/op

Another interesting set of benchmarks:

package main

import "testing"

func BenchmarkNoPreallocate(b *testing.B) {
    const (
        l = 0
        c = 8 * 1024
    )
    for i := 0; i < b.N; i++ {
        // Don't preallocate our initial slice
        init := []int64{}
        for j := 0; j < c; j++ {
            init = append(init, 42)
        }
    }
}

func BenchmarkPreallocateConst(b *testing.B) {
    const (
        l = 0
        c = 8 * 1024
    )
    for i := 0; i < b.N; i++ {
        // Preallocate our initial slice
        init := make([]int64, l, c)
        for j := 0; j < cap(init); j++ {
            init = append(init, 42)
        }
    }
}

func BenchmarkPreallocateVar(b *testing.B) {
    var (
        l = 0
        c = 8 * 1024
    )
    for i := 0; i < b.N; i++ {
        // Preallocate our initial slice
        init := make([]int64, l, c)
        for j := 0; j < cap(init); j++ {
            init = append(init, 42)
        }
    }
}

Output:

$ go test peter_test.go -bench=. -benchmem
BenchmarkNoPreallocate-4       20000   75656 ns/op   287992 B/op   19 allocs/op
BenchmarkPreallocateConst-4   100000   22386 ns/op    65536 B/op    1 allocs/op
BenchmarkPreallocateVar-4     100000   22112 ns/op    65536 B/op    1 allocs/op

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

解释预分配切片的基准
2017-11-09 02:05

回答 1 已采纳 Go has an optimizing compiler. Constants are evaluated at compile time. Variables are evaluated at
c++ 如何给list结构体分配预空间 c++ 有问必答
2021-04-02 20:42

回答 3 已采纳据我所知list没有说预分配内存的操作，都是插入一个元素分配一段内存。vector有，reserve函数可以预分配一段大的内存，而不构造对象，等插入的时候再调用复制构造函数。
为什么在预分配的切片上附加索引比建立索引更快？
2017-11-05 17:25

回答 1 已采纳 On computer one: $ go test same_test.go -run=! -bench=. -benchmem -count=3 goos: linux goarch: am
基于图神经网络的切片级漏洞检测及解释方法
2023-06-19 10:08

renhongxia1的博客但基于深度学习的漏洞检测方法尚未完善, 其中, 函数级别的检测方法存在检测粒度较粗且检测准确率较低的问题, 切片级别的检测方法虽然能够有效减少样本噪声, 但仍存在以下两方面的问题: 一方面, 现有
如何以惯用的方式预分配和填充指针切片？
2013-06-03 21:26

回答 4 已采纳 For your first example, I would do: mySlice := make([]*UselessStruct, 5) for i := range mySlice {
Go何时分配新的支持数组进行切片？
2019-03-07 10:33

回答 2 已采纳 The answer is simple: append() allocates a new backing array (and copies current content over) if
JavaScript预解析 javascript
2022-08-15 19:46

回答 2 已采纳不分配内存空间，相同名不会冲突
Go 学习笔记（11）— 切片定义、切片初始化、数组和切片差异、字符串和切片转换、len()、cap()、空 nil 切片、append()、copy() 函数、删除切片元素
2019-08-25 19:19

wohu007的博客 1. 切片的定义 Go 语言切片是对数组的抽象。Go中提供了一种灵活，功能强悍的内置类型切片(“动态数组”),与数组相比切片的长度是不固定的，可以追加元素，在追加时可能使切片的容量增大。 2. 定义切片声明一个未...
jupyter上导入预训练模型 pytorch 机器学习计算机视觉
2022-07-15 01:14

回答 2 已采纳尝试把模型重命名，使其不含中文字符以及符号
JS预解析全局变量问题 javascript
2023-01-29 17:45

回答 2 已采纳你这样再试一下就行了 var a = 1 function fn(){ a = 2 console.log(a); } fn() console.log(a); // 分解 va
vue预约活动进度条里程碑 vue.js
2022-12-09 11:58

回答 1 已采纳提供思路： 1、用两个div，一个div叠在另一个之上，你图片里的就是一个黄色div（上层），一个灰色div（底层） 2、根据人数计算出比例，控制上层div的宽度百分比，再结合overflow:hid
白话概念解释-总结1
2021-01-27 16:01

weixin_ry5219775的博客 LSTM BiLSTM BILSTM是双向LSTM；将前向的LSTM与后向的LSTM结合成LSTM。视图举例如下：双向LSTM 条件随机场个人总结 1.分成两部分一部分是为前面的标注对现在标注的影响转移同一个系统前对后转移矩阵是固定的...
GAN预训练模型的问题 python 有问必答生成对抗网络
2022-08-06 22:52

回答 2 已采纳这个你只能是继续问作者了,别人不清楚论文的前因没办法回答你
谷歌出品！机器学习常用术语总结
2021-07-24 20:00

kaiyuan_sjtu的博客如需查看完整的解释，请参阅这篇论文[1]。 ROC 曲线下面积 (AUC, Area under the ROC Curve) 一种会考虑所有可能分类阈值的评估指标。 ROC 曲线下面积是，对于随机选择的正类别样本确实为正类别，以及随机选择的负...
QLORA:量化LLMA的有效微调
2023-06-28 06:12

AI浩的博客 QLORA通过冻结的4位量化预训练语言模型将梯度反向传播到Low RankAdapters (LoRA)中。我们最好的模型家族，我们命名为Guanaco，在Vicuna基准上优于之前所有公开发布的模型，达到ChatGPT性能水平的99.3%，而只需要在...
没有解决我的问题, 去提问

悬赏问题

¥15 深度学习根据CNN网络模型，搭建BP模型并训练MNIST数据集
¥15 lammps拉伸应力应变曲线分析
¥15 C++ 头文件/宏冲突问题解决
¥15 用comsol模拟大气湍流通过底部加热（温度不同）的腔体
¥50 安卓adb backup备份子用户应用数据失败
¥20 有人能用聚类分析帮我分析一下文本内容嘛
¥15 请问Lammps做复合材料拉伸模拟，应力应变曲线问题
¥30 python代码，帮调试，帮帮忙吧
¥15 #MATLAB仿真#车辆换道路径规划
¥15 java 操作 elasticsearch 8.1 实现索引的重建

码龄粉丝数原力等级 --

解释预分配切片的基准

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

解释预分配切片的基准

1条回答 默认 最新

悬赏问题

1条回答默认最新