dozr13344 2018-06-13 18:42
浏览 15
已采纳

样品无需更换在golang中

What's the best way to sample without replacement from a slice in golang?

a := make([]int, 100)
for i := range a {
    a[i] = i
}

# TODO sample 5 elements from a without replacement.
  • 写回答

2条回答 默认 最新

  • dongnong7524 2018-06-13 18:48
    关注

    If the set size is relatively small overall, or you are sampling a large portion of the set, the simplest method is to shuffle the elements and pick the first n:

    rand.Shuffle(len(a), func(i, j int) { a[i], a[j] = a[j], a[i] })
    fmt.Println(a[:5])
    

    https://play.golang.org/p/lQx44Mn9RQL

    If you don't want to shuffle the entire set, but it's acceptable to alter the order of the set (or copy the entire set), you can "record" the used values more efficiently by removing them from the slice.

    // create a copy of the slice header
    c := a
    samples := make([]int, n)
    
    for i := 0; i < n; i++ {
        r := int(rand.Int63n(int64(len(c))))
        samples[i] = c[r]
    
        // remove the sample from the copy slice
        c[r], c[len(c)-1] = c[len(c)-1], c[r]
        c = c[:len(c)-1]
    }
    

    In the case that the set size is quite large and you are sampling only a small portion, you can sample from the original set without modification by recording the sample index and not repeating it. Obviously as the ratio of the sample size to the set size grows, the number of collisions will grow making this less efficient.

    For example:

    // record indexes here to prevent duplicates
    indexes := make(map[int]bool)
    
    // create n random indexes
    for i := 0; i < n; i++ {
        var r int
        for {
            r = int(rand.Int63n(int64(len(a))))
            if indexes[r] {
                continue
            }
            break
        }
    
        indexes[r] = true
    }
    
    samples := make([]int, 0, n)
    for i := range indexes {
        samples = append(samples, a[i])
    }
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 远程桌面文档内容复制粘贴,格式会变化
  • ¥15 关于#java#的问题:找一份能快速看完mooc视频的代码
  • ¥15 这种微信登录授权 谁可以做啊
  • ¥15 请问我该如何添加自己的数据去运行蚁群算法代码
  • ¥20 用HslCommunication 连接欧姆龙 plc有时会连接失败。报异常为“未知错误”
  • ¥15 网络设备配置与管理这个该怎么弄
  • ¥20 机器学习能否像多层线性模型一样处理嵌套数据
  • ¥20 西门子S7-Graph,S7-300,梯形图
  • ¥50 用易语言http 访问不了网页
  • ¥50 safari浏览器fetch提交数据后数据丢失问题