dongluobei9359 2015-01-15 14:19

已采纳

为什么此Go代码的速度与Python相当（且速度不快）？

I need to calculate sha256 checksums for files over 1GB (read file by chunks), currently I am using python with this:

import hashlib
import time

start_time = time.time()


def sha256sum(filename="big.txt", block_size=2 ** 13):
    sha = hashlib.sha256()
    with open(filename, 'rb') as f:
        for chunk in iter(lambda: f.read(block_size), b''):
           sha.update(chunk)
    return sha.hexdigest()

input_file = '/tmp/1GB.raw'
print 'checksum is: %s
' % sha256sum(input_file)
print 'Elapsed time: %s' % str(time.time() - start_time)

I wanted to give a try to golang thinking I could get faster results, but after trying the following code, it runs a couple of seconds slower:

package main

import (
    "crypto/sha256"
    "fmt"
    "io"
    "math"
    "os"
    "time"
)   

const fileChunk = 8192

func File(file string) string {
    fh, err := os.Open(file)

    if err != nil {
        panic(err.Error())
    }   

    defer fh.Close()

    stat, _ := fh.Stat()
    size := stat.Size()
    chunks := uint64(math.Ceil(float64(size) / float64(fileChunk)))
    h := sha256.New()

    for i := uint64(0); i < chunks; i++ {
        csize := int(math.Min(fileChunk, float64(size-int64(i*fileChunk))))
        buf := make([]byte, csize)
        fh.Read(buf)
        io.WriteString(h, string(buf))
    }   

    return fmt.Sprintf("%x", h.Sum(nil))
}   

func main() {
    start := time.Now()
    fmt.Printf("checksum is: %s
", File("/tmp/1G.raw"))
    elapsed := time.Since(start)
    fmt.Printf("Elapsed time: %s
", elapsed)
}

Any idea how to improve the golang code if possible? maybe to use all computer CPU cores, one for reading and other for hashing, any ideas ?

Update

As suggested I am using this code:

package main

import (
    "crypto/sha256"
    "encoding/hex"
    "fmt"
    "io"
    "os"
    "time"
)

func main() {
    start := time.Now()
    fh, err := os.Open("/tmp/1GB.raw")
    if err != nil {
        panic(err.Error())
    }
    defer fh.Close()

    h := sha256.New()
    _, err = io.Copy(h, fh)
    if err != nil {
        panic(err.Error())
    }
    fmt.Println(hex.EncodeToString(h.Sum(nil)))

    fmt.Printf("Elapsed time: %s
", time.Since(start))
}

For testing I am creating the 1GB file with this:

# mkfile 1G /tmp/1GB.raw

The new version is faster but not that much, what about using channels? could the use of more than one CPU/core could help to improve? I was expecting to have an improvement of at least 20% but unfortunately I am getting almost no gain, is almost nothing.

time result for python

 5.867u 0.250s 0:06.15 99.3%    0+0k 0+0io 0pf+0w

time results for go after compiling (go build) and executing the binary:

 5.687u 0.198s 0:05.93 98.9%    0+0k 0+0io 0pf+0w

Any more ideas?

test results

Using the version using channels posted below on the accepted answer by @icza

Elapsed time: 5.894779733s

Using the version with no channels:

Elapsed time: 5.823489239s

I thought that using channels would increase a little bit but seems to not.

I am running this on a MacBook Pro OS X Yosemite. using go version:

go version go1.4.1 darwin/amd64

update 2

Setting runtime.GOMAXPROCS to 4:

runtime.GOMAXPROCS(4)

Made things faster:

Elapsed time: 5.741511748s

update 3

Changing the chunk size to 8192 (like in the python version) give the expected result:

...
for b, hasMore := make([]byte, 8192<<10), true; hasMore; {
...

Also using only runtime.GOMAXPROCS(2)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douxiangui5011 2015-01-15 14:32
关注
Your solution is quite inefficient as you're making new buffers in each iteration, you use them once and you just throw them away.

Also you convert the content of your buffer (buf) to string and you write that string to the sha256 calculator which converts it back to bytes: an absolutely unnecessary round-trip.

Here is another quite fast solution, test this for performance:

fh, err := os.Open(file) if err != nil { panic(err.Error()) } defer fh.Close() h := sha256.New() _, err = io.Copy(h, fh) if err != nil { panic(err.Error()) } fmt.Println(hex.EncodeToString(h.Sum(nil)))

A little explanation:

io.Copy() is a function which will read all the data (until EOF is reached) from a Reader and write all those to the specified Writer. Since the sha256 calculator (hash.Hash) implements Writer and the File (or rather *File) implements Reader, this is as easy as it can be.

Once all the data has been written to the hash, hex.EncodeToString() will simply convert the result (obtained by hash.Sum(nil)) to a human-readable, hex string.

Final Verdict

The program reads 1GB of data from the hard disk and does some calculation with it (calculates its SHA-256 hash). Since reading from the hard disk is a relatively slow operation, the performance gain of the Go version will not be significant compared to the Python solution. The overall run takes a couple of seconds which is in the same order of magnitude as the time required to read 1 GB of data from the hard disk. Since both the Go and the Python solution requires approximately the same amount of time to read the data from the disk, you won't see much different results.

Possibility of Performance Improvements with Multiple Goroutines

There is a slight margin where you can improve performance by reading a chunck of the file into one buffer, start calculating its SHA-256 hash, and at the same time read the next chunck of the file. Once its done, send that to the SHA-256 calculator and at the same time read the next chunk into the first buffer.

But since reading the data from the disk takes more time than calculating its SHA-256 digest (or updating the state of the digest calculator), you won't see significant improvement. The performance bottleneck in your case will always be the time required to read the data into memory.

Here is a complete, runnable solution using 2 goroutines where while 1 goroutine reads a chunk of the file the other calculates hash of a previously read chunk, and when the reading of a goroutine finishes continues with hashing and allowing the other to read in parallel.

Proper synchronization between the phases (reading, hashing) is done with channels. As suspected, the performance gain is just a little over 4% in time (may vary based on CPU and hard disk speed) because the hashing computation is negligible compared to the disk reading time. The performance gain will most likely be higher if the reading speed of the hard disk is greater (test it on SSD).

So the complete program:

package main import ( "crypto/sha256" "encoding/hex" "fmt" "hash" "io" "os" "runtime" "time" ) const file = "t:/1GB.raw" func main() { runtime.GOMAXPROCS(2) // Important as Go 1.4 uses only 1 by default! start := time.Now() f, err := os.Open(file) if err != nil { panic(err) } defer f.Close() h := sha256.New() // 2 channels: used to give green light for reading into buffer b1 or b2 readch1, readch2 := make(chan int, 1), make(chan int, 1) // 2 channels: used to give green light for hashing the content of b1 or b2 hashch1, hashch2 := make(chan int, 1), make(chan int, 1) // Start signal: Allow b1 to be read and hashed readch1 <- 1 hashch1 <- 1 go hashHelper(f, h, readch1, readch2, hashch1, hashch2) hashHelper(f, h, readch2, readch1, hashch2, hashch1) fmt.Println(hex.EncodeToString(h.Sum(nil))) fmt.Printf("Elapsed time: %s ", time.Since(start)) } func hashHelper(f *os.File, h hash.Hash, mayRead <-chan int, readDone chan<- int, mayHash <-chan int, hashDone chan<- int) { for b, hasMore := make([]byte, 64<<10), true; hasMore; { <-mayRead n, err := f.Read(b) if err != nil { if err == io.EOF { hasMore = false } else { panic(err) } } readDone <- 1 <-mayHash _, err = h.Write(b[:n]) if err != nil { panic(err) } hashDone <- 1 } }

Notes:

In my solution I only used 2 goroutines. There is no point using more because as noted before the disk reading speed is the bottleneck which is already used at its maximum as 2 goroutines will be able to perform reading at any time.

Notes on synchronization: 2 goroutines run parallel. Each goroutine is allowed to use its local buffer b at any time. Access to the shared File and to the shared Hash is synchronized by the channels, only 1 goroutine is allowed to use the Hash at any given time, and only 1 goroutine is allowed to use (read) from the File at any given time.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

Python代码，什么意思？ python
2023-04-15 13:39

回答 2 已采纳这段代码是一个二分查找算法。lst 是一个有序列表，num 是要在列表中查找的数字。high 减一是因为如果 lst[mid] 大于 num，那么 num 一定不在 lst[mid] 及其右边的位置，
求解代码求解速度Python python 有问必答
2021-09-06 21:16

回答 1 已采纳 import numpy as np a = np.arange(20).reshape(4, 5) print(a) # 求和 print(a.sum(axis=0)) # print(a.sum
这几行python代码怎么缩减为一行呢？ python
2022-05-16 15:47

回答 2 已采纳 print(*[[["%d*%d=%2d" %(i, j, i*j) for j in range(i,10)] for k in range(1,i)] for i in range(1,10)]
python函数能否提高代码执行速度_python可以提高程序执行速度N倍你知道吗？
2020-12-15 15:47

weixin_39751391的博客 1.1 Numba的约5分钟指南Numba是...当调用Numba修饰函数时，它被编译为机器代码“及时”执行，并且您的全部或部分代码随后可以以本机机器代码速度运行！开箱即用的Numba使用以下方法：操作系统：Windows(32位和64位)...
为什么Python运行不出结果，只有进程已结束退出代码为0 python
2023-01-14 16:09

回答 2 已采纳你的初始化函数写错了，导致初始化失败，r没有被赋值是 init 而不是 int def __init__(self, r):
python语法中selenium浏览器驱动为什么我的代码中间有一个横线？ python
2022-07-17 19:48

回答 1 已采纳 selenium更新了怕
为什么这行python 代码插入逗号，输出引号会消失？ python
2023-03-07 08:18

回答 3 已采纳引号是语法，懂吗它代表引住的部分是个字符串而不是数字你为什么不纠结输出的时候不输出[]和()呢，因为你知道它们是语法的一部分引号也是语法呀'1'这个表达式，1才是数据，'1'表示1是个字符串类型的数据
python运行速度慢为什么越来越流行_为什么python这么慢（翻译）
2020-12-03 11:15

weixin_39980234的博客与c ， c++ ， c# 或 python相比，java的运行速度如何呢？这个问题的答案取决于你在运行什么类型的程序。没有哪一个用于比较速度的基准是完美的，但使用编程语言基准游戏(The Computer Language Benchmarks G...
为什么在终端运行python打印f字符串会出错？明明代码没有错 python
2022-08-22 14:51

回答 2 已采纳 python2，不支持f-string表示法。f-string 语法是 python3.6 之后版本添加的，称之为字面量格式化字符串。python3 没有raw_input，只有input。pytho
python中print(),括号里为空，在代码末尾代表什么？ python
2021-05-12 21:11

回答 2 已采纳换行理解没错也没其他作用了
请问为什么红框内代码会报错？(语言-python) python
2022-08-14 21:04

回答 2 已采纳因为stu_no是类的成员变量，不是成员函数，后面不需要加括号“（）”。正确写法修改如下： stu.stu_no 测试的完整代码如下： class Person(object): '''
mac运行python速度慢_为什么 Python 这么慢？
2020-12-22 11:51

weixin_39743414的博客然而，相比起 Python 扩张的速度，Python 代码的运行速度就显得有点逊色了。在代码运行速度方面，Java、C、C++、C# 和 Python 要如何进行比较呢？并没有一个放之四海而皆准的标准，因为具体结果很大程度上取决于运行...
为什么下面两段python代码运行速度大不相同？
2016-09-09 12:57

回答 2 已采纳输出需要打印到管道，比较耗时
网络IO谁更快？Python与Go请求速度对比
2020-11-10 11:52

Wang_AI的博客现在，考虑这样的一种场景：我们需要从某些网址中同步数据并进行计算，保存到本地redis缓存中。现在，我们可以通过编写Go Worker的方式，将计算和保存的过程保存在本地的redis缓存...
python 导入excel 速度慢_为什么 Python 这么慢？-excel打开很慢
2020-12-03 02:59

weixin_39630735的博客然而，相比起 Python 扩张的速度，Python 代码的运行速度就显得有点逊色了。在代码运行速度方面，Java、C、C++、C# 和 Python 要如何进行比较呢？并没有一个放之四海而皆准的标准，因为具体结果很大程度上取决于运行...
没有解决我的问题, 去提问

悬赏问题

¥15 聚类分析或者python进行数据分析
¥15 如何用visual studio code实现html页面
¥15 逻辑谓词和消解原理的运用
¥15 三菱伺服电机按启动按钮有使能但不动作
¥15 js，页面2返回页面1时定位进入的设备
¥50 导入文件到网吧的电脑并且在重启之后不会被恢复
¥15 （希望可以解决问题）ma和mb文件无法正常打开，打开后是空白，但是有正常内存占用，但可以在打开Maya应用程序后打开场景ma和mb格式。
¥20 ML307A在使用AT命令连接EMQX平台的MQTT时被拒绝
¥20 腾讯企业邮箱邮件可以恢复么
¥15 有人知道怎么将自己的迁移策略布到edgecloudsim上使用吗？