Go：为什么我的哈希表实现这么慢？

So I'm trying to make a super light, deliberately memory heavy, yet very fast hashtable for very fast lookups where I don't care about memory usage and I don't care if it makes a rare mistake.

Basically it just creates a ginormous array (yes array, not slice), hashes a string using a modified FNVa hash (modified to give only a hash within the array bounds) and then saves or lookups up the value using the hash as the array index. In theory this should be the fastest possible way to store and retrieve a key=>value pair.

This is my benchmark:

package main
import (
"fmt"
"time"
)

const dicsize250 = 2097152000 // tested 115 collisions

type Dictionary250_uint16 struct {
  dictionary [dicsize250]uint16
}

func (d *Dictionary250_uint16) Add(s string, v uint16) {
    i := id(s,dicsize250)
    d.dictionary[i]=v
    return
}

func (d *Dictionary250_uint16) Delete(s string) {
    i := id(s,dicsize250)
    d.dictionary[i]=0
    return
}

func (d *Dictionary250_uint16) Exists(s string) bool {
    i := id(s,dicsize250)
    if d.dictionary[i]==0 {
        return false
        } else {
        return true
    }
}

func (d *Dictionary250_uint16) Find(s string) uint16 {
    i := id(s,dicsize250)
    return d.dictionary[i]
}

// This is a FNVa hash algorithm, modified to limit to dicsize
func id(s string, dicsize uint64) uint64 {
    var hash uint64 = 2166136261
    for _, c := range s {
        hash = (hash^uint64(c))*16777619
    }
    return hash%dicsize
}

var donothing bool
func main() {

dic := new(Dictionary250_uint16)
dic.Add(`test1`,10)
dic.Add(`test2`,20)
dic.Add(`test3`,30)
dic.Add(`test4`,40)
dic.Add(`test5`,50)

mc := make(map[string]uint16)
mc[`test1`]=10
mc[`test2`]=10
mc[`test3`]=10
mc[`test4`]=10
mc[`test5`]=10

var t1 uint
var t2 uint
var t3 uint
donothing = true

// Dic hit
t1 = uint(time.Now().UnixNano())
for i:=0; i<50000000; i++ {
        if dic.Exists(`test4`) {
            donothing = true
        }
}
t3 = uint(time.Now().UnixNano())
t2 = t3-t1
fmt.Println("Dic (hit) took ",t2)

// Dic miss
t1 = uint(time.Now().UnixNano())
for i:=0; i<50000000; i++ {
        if dic.Exists(`whate`) {
            donothing = true
        }
}
t3 = uint(time.Now().UnixNano())
t2 = t3-t1
fmt.Println("Dic (miss) took ",t2)

// Map hit
t1 = uint(time.Now().UnixNano())
for i:=0; i<50000000; i++ {
    _,ok := mc[`test4`]
    if ok {
        donothing=true
        }
}
t3 = uint(time.Now().UnixNano())
t2 = t3-t1
fmt.Println("Map (hit) took ",t2)

// Map miss
t1 = uint(time.Now().UnixNano())
for i:=0; i<50000000; i++ {
    _,ok := mc[`whate`]
    if ok {
        donothing=true
        }
}
t3 = uint(time.Now().UnixNano())
t2 = t3-t1
fmt.Println("Map (miss) took ",t2)

donothing = false
}

The results I get are:

Dic (hit) took  2,858,604,059
Dic (miss) took  2,457,173,526
Map (hit) took  1,574,306,146
Map (miss) took  2,525,206,080

Basically my hashtable implementation is a lot slower, especially on hits, than just using a map. I don't see how this is possible since map is a heavy implementation (in comparison) which doesn't ever have any collisions, and does a lot more calculations. Whereas my implementation is super simple and relies on having a massive array of all possible indices.

What I am doing wrong?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

dougan6982 2014-07-22 19:59

关注

For one thing, you're using a very large amount of memory compared to the built-in map, but that's a trade-of you mentioned you wanted to make.

Use the standard libraries benchmark utilities. It will give you a solid base to work from, easier profiling access, and remove a lot of guesswork. I had a moment to cut&paste some of your code into a benchmark:

func BenchmarkDictHit(b *testing.B) {
    donothing = true

    dic := new(Dictionary250_uint16)
    dic.Add(`test1`, 10)
    dic.Add(`test2`, 20)
    dic.Add(`test3`, 30)
    dic.Add(`test4`, 40)
    dic.Add(`test5`, 50)

    // The initial Dict allocation is very expensive!
    b.ResetTimer()

    for i := 0; i < b.N; i++ {
        if dic.Exists(`test4`) {
            donothing = true
        }
    }
}

func BenchmarkDictMiss(b *testing.B) {
    donothing = true

    dic := new(Dictionary250_uint16)
    dic.Add(`test1`, 10)
    dic.Add(`test2`, 20)
    dic.Add(`test3`, 30)
    dic.Add(`test4`, 40)
    dic.Add(`test5`, 50)

    // The initial Dict allocation is very expensive!
    b.ResetTimer()

    for i := 0; i < b.N; i++ {
        if dic.Exists(`test6`) {
            donothing = true
        }
    }
}

func BenchmarkMapHit(b *testing.B) {
    donothing = true
    mc := make(map[string]uint16)
    mc[`test1`] = 10
    mc[`test2`] = 10
    mc[`test3`] = 10
    mc[`test4`] = 10
    mc[`test5`] = 10

    b.ResetTimer()

    // Map hit
    for i := 0; i < b.N; i++ {
        _, ok := mc[`test4`]
        if ok {
            donothing = true
        }
    }

    donothing = false
}
func BenchmarkMapMiss(b *testing.B) {
    donothing = true
    mc := make(map[string]uint16)
    mc[`test1`] = 10
    mc[`test2`] = 10
    mc[`test3`] = 10
    mc[`test4`] = 10
    mc[`test5`] = 10

    b.ResetTimer()

    for i := 0; i < b.N; i++ {
        _, ok := mc[`test6`]
        if ok {
            donothing = true
        }
    }
    donothing = false
}

Without the ResetTimer() call, the initial allocation of your backing slice dominates the benchmark time, and even when amortized across the runs it skew the results heavily. After the reset, the benchmark times look in order:

BenchmarkDictHit    50000000            39.6 ns/op         0 B/op          0 allocs/op
BenchmarkDictMiss   50000000            39.1 ns/op         0 B/op          0 allocs/op
BenchmarkMapHit 100000000           22.9 ns/op         0 B/op          0 allocs/op
BenchmarkMapMiss    50000000            36.8 ns/op         0 B/op          0 allocs/op

Your id function needs to iterate over a string. With strings, range doesn't iterate bytes, it looks for runes which is going to be more expensive. You will want to index the string directly, or possibly use []byte throughout (about the same cost-wise). With better string handling, these are the final timings from my test.

BenchmarkDictHit    100000000           17.8 ns/op         0 B/op          0 allocs/op
BenchmarkDictMiss   100000000           17.2 ns/op         0 B/op          0 allocs/op

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(1条)

报告相同问题？

关注问题

Go：为什么我的哈希表实现这么慢？
2014-07-22 18:54

回答 2 已采纳 For one thing, you're using a very large amount of memory compared to the built-in map, but that's
哈希表：哈希表到底是什么东西？ c++
2022-05-11 22:41

回答 1 已采纳最浅显的理解就是 哈希表是一个单向链表数组,每一个元素都是一个单向链表
想问问哈希表是什么呀？听过很多人说但是一直不了解散列表数据结构
2021-12-16 09:11

回答 1 已采纳今天正好写了一篇关于哈希表的，如果有需要可以看下：《算法零基础100讲》(第56讲) 哈希表进阶_英雄哪里出来-CSDN博客实现一个带冲
什么是散列表（哈希表）？
2019-05-06 07:30

守望之名的博客本文文末有福利，不要错过奥！前言假设你们班级100个同学每个人的学号是...因此将学号除以1100100取余，即得到编号作为该表的下标，那么，要查找学号为01100168的成绩的时候，只要直接访问表下标为68的数据即可...
哈希表比二进制有什么优势吗？数据库为什么要选用它？数据库
2015-06-10 01:45

回答 5 已采纳 哈希表是一种折衷。当然，直接寻址是好，问题是，你需要一种数据结构，还能不断添加、删除、修改吧。再说，寻址是文件的角度来说的，和哈希表没有关系。数据库用哈希表，当然不全用哈希表，同时数据库也是用
构造哈希表的重点包括哪两个方面？人工智能数据结构机器学习
2022-11-13 03:36

回答 1 已采纳一是哈希计算函数不能太复杂，效率要高，空间不能跨越太大。二是，计算后重复的可能性不高，即不需要经常多次哈希
问一下哈希表是不是符号表啊,如果不是有什么区别? java
2021-07-05 21:04

回答 1 已采纳 hash表只是字符串的hash值作为key的键值对表，方便快速查询字符串符号表是一种用于语言翻译器（例如编译器和解释器）中的数据结构，常用与常数表、变量名表、数组名表、过程名表，是以表格的形式保存与系
哈希表哪家强？几大编程语言吵起来了！
2020-04-26 09:10

编程技术宇宙的博客比特宇宙编程语言联合委员会准备举办一次大会，主题为哈希表，给各大编程语言帝国都发去了邀请函。很快就到了大会这一天联合委员会秘书长开场发言：“诸位，为促进技术交流与发展，增强各帝国友谊，联合委员会...
HashMap为什么要分高低位链表？哈希算法链表
2022-09-15 14:42

回答 1 已采纳数组扩容之后，需要重新计算索引，原链表的索引也会变化。但是由于hashMap的数组长度是 2的n次方，每次扩容使数组长度：newlength = 2* oldlength；并且计算索引方法是：has
请问这个是什么？哈希表那一块的么？ java
2017-03-28 09:22

回答 2 已采纳 ``` >> 4相当于除以16 0xf就是让它按位and 二进制的1111 因为无论0还是1，and 1等于自己，and 0等于0，所以 >> 4 & 0xf就
哈希表是什么又简单的解释吗？数据结构算法
2021-12-17 18:33

回答 1 已采纳其实哈希表就是函数映射比如本来你是从数组中找到一个数，需要遍历，有了哈希表，只需要一次查找即可，具体可以看我的专栏《画解数据结构》，还是很容易理解的
浅谈MatrixOne如何用Go语言设计与实现高性能哈希表
2022-04-15 12:15

licdn的博客作为一款Go语言实现的数据库，可以与C++实现的顶级OLAP数据库Clickhouse性能媲美，这其中就涉及到了多方面的优化，包括高性能哈希表的实现。本文就将详细说明MatrixOne是如何用Go实现高性能哈希表的。
怎么用python来实现一个哈希表啊 python
2021-12-17 12:07

回答 1 已采纳 Python 不需要自己实现哈希表，dict 的底层就是哈希表，直接用就行： a = {} a["324324"] = 5
哈希表哪家强？编程语言找你来帮忙！
2022-09-09 22:49

「已注销」的博客比特宇宙编程语言联合委员会准备举办一次大会，主题为哈希表，给各大编程语言帝国都发去了邀请函。很快就到了大会这一天联合委员会秘书长开场发言：“诸位，为促进技术交流与发展，增强各帝国友谊，联合委员...
浅谈 Go 语言高性能哈希表的设计与实现
2022-04-24 08:00

Go中国的博客目录1. MatrixOne数据库是什么?2. 哈希表数据结构基础3. 哈希表基本设计与对性能的影响3.1 链地址法3.2 开放寻址法3.3 碰撞处理3.4 Max load factor3.5 Growth factor... 一些常见的哈希表实现4.1C++4.2std::unord...
没有解决我的问题, 去提问

悬赏问题

¥50 如何增强飞上天的树莓派的热点信号强度，以使得笔记本可以在地面实现远程桌面连接
¥15 MCNP里如何定义多个源？
¥20 双层网络上信息-疾病传播
¥50 paddlepaddle pinn
¥20 idea运行测试代码报错问题
¥15 网络监控：网络故障告警通知
¥15 django项目运行报编码错误
¥15 请问这个是什么意思？
¥15 STM32驱动继电器
¥15 Windows server update services

码龄粉丝数原力等级 --

Go：为什么我的哈希表实现这么慢？

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

Go：为什么我的哈希表实现这么慢？

2条回答 默认 最新

悬赏问题

2条回答默认最新