在并行quicksort实现中使用go例程时，性能较差

Note: The "Go-lang parallel segment runs slower than series segment" question dealt with race conditions, this one has another issue, so imho it's not a duplicate.

I'm trying to find an explanation for the following situation: Running parallel quicksort results in a significantly longer runtime when done using go routines.

Benchmarks are after the code:

package c9sort

import (
    "time"
)

var runInParllel bool

func Quicksort(nums []int, parallel bool) ([]int, int) {
    started := time.Now()
    ch := make(chan int)
    runInParllel = parallel

    go quicksort(nums, ch)

    sorted := make([]int, len(nums))
    i := 0
    for next := range ch {
        sorted[i] = next
        i++
    }
    return sorted, int(time.Since(started).Nanoseconds() / 1000000)
}

func quicksort(nums []int, ch chan int) {

    // Choose first number as pivot
    pivot := nums[0]

    // Prepare secondary slices
    smallerThanPivot := make([]int, 0)
    largerThanPivot := make([]int, 0)

    // Slice except pivot
    nums = nums[1:]

    // Go over slice and sort
    for _, i := range nums {
        switch {
        case i <= pivot:
            smallerThanPivot = append(smallerThanPivot, i)
        case i > pivot:
            largerThanPivot = append(largerThanPivot, i)
        }
    }

    var ch1 chan int
    var ch2 chan int

    // Now do the same for the two slices
    if len(smallerThanPivot) > 1 {
        ch1 = make(chan int, len(smallerThanPivot))
        if runInParllel {
            go quicksort(smallerThanPivot, ch1)
        } else {
            quicksort(smallerThanPivot, ch1)
        }
    }
    if len(largerThanPivot) > 1 {
        ch2 = make(chan int, len(largerThanPivot))
        if runInParllel {
            go quicksort(largerThanPivot, ch2)
        } else {
            quicksort(largerThanPivot, ch2)
        }
    }

    // Wait until the sorting finishes for the smaller slice
    if len(smallerThanPivot) > 1 {
        for i := range ch1 {
            ch <- i
        }
    } else if len(smallerThanPivot) == 1 {
        ch <- smallerThanPivot[0]
    }
    ch <- pivot

    if len(largerThanPivot) > 1 {
        for i := range ch2 {
            ch <- i
        }
    } else if len(largerThanPivot) == 1 {
        ch <- largerThanPivot[0]
    }

    close(ch)
}

Benchmarks for a random perm of 500000 integers:

Ran 100 times

Non parallel average - 1866ms

Parallel average - 2437ms

Any explanation would be appreciated. I know goroutines may not be best for this kind of parallelism, but I'm trying to understand the reason.

Thank you in advance.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dougang1967 2015-03-21 05:05
关注
Turns out it was very simple. As I'm on a new machine, the GOMAXPROCS variable wasn't set.

The new benchmark favors, as predicted, the parallel implementation: Set to double the number of cores:

Using 16 goroutines Ran 100 times Non parallel average - 1980 Parallel average - 1133

Set to the number of cores:

Using 8 goroutines Ran 100 times Non parallel average - 2004 Parallel average - 1197

By the way, this is fairly consistent. The average for double the number of cores is always a bit better.

Benchmark for a larger collection (1000000):

Using 8 goroutines Ran 100 times Non parallel average - 3748 Parallel average - 2265

With double:

Using 16 goroutines Ran 100 times Non parallel average - 3817 Parallel average - 2012
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

在并行quicksort实现中使用go例程时，性能较差
2015-03-20 16:29

回答 2 已采纳 Turns out it was very simple. As I'm on a new machine, the GOMAXPROCS variable wasn't set. The ne
Go中并行Quicksort的死锁
2013-06-07 15:20

回答 1 已采纳 The code has one problem, and at least one potential buggy usage case: It is missing a base case
Go中的Quicksort
2014-06-30 05:20

回答 1 已采纳 The bug you have is that you never append the pivot value to the returned slice. So for each recur
不归类
2018-12-19 23:50

wakaakaa的博客 2、大数据类型转小数据类型像大杯子里的水倒进小杯会出现溢出 ...面向对象的过程：a、找出几个类中的共同部分；b、将共同有的特性与方法而提取出新的类。继而子类可以继承父类的状态与方法，子类覆...
Go中的Quicksort实施 javascript
2014-12-26 18:12

回答 1 已采纳 left++ should be after the swap() function as follow if arr[i] <= v { swa
排序系统中使用快速排序时数组有重复数字无法排序各位大佬帮忙改一下 c语言
2021-01-10 10:08

回答 2 已采纳问题的原因出在这段上： while (first < last) { while (first < last && A[last] > key) {
Go中的惯用快速排序
2013-04-04 05:04

回答 4 已采纳 Well, I ended up with this. I don't know enough Go to say it's idiomatic, but I used slices, one-l
Java编程思想第四版学习总结
2019-11-25 16:41

Asahi_desu的博客文章目录Java编程思想第四版学习总结第 1 章对象入门1.1 抽象的进步1.2 对象的接口1.3 实现方案的隐藏1.4 方案的重复使用1.5 继承：重新使用接口1.5.1 改善基础类1.5.2 等价于类似关系1.6 多形对象的互换使用1.6.1 ...
C++：基于分治思想实现二维平面最近点对算法，n>3时程序崩溃 c++
2020-03-21 17:38

回答 1 已采纳你好，是closest()函数point *SR 指针越界导致程序崩溃的。做如下修改可以解决程序崩溃问题 point *SR = new point[(high - low) / 2]; 改为poi
快速排序算法中，使用splice操作数组没问题，使用push出错
2017-08-03 08:20

回答 1 已采纳 temp = a.splice(key, 1);会把a列表删除一个元素并返回给temp； temp.push(a[key]);只是在temp中添加一个元素，a并没变
Go：在这种情况下需要频道吗？
2013-11-11 19:19

回答 3 已采纳 I wrote the original version of that! My original write-up answers your second question I think..
.net基本面试题
2018-08-31 15:21

weixin_30821731的博客 OOP:ObjectOrientedProgramming：面向对象编程技术的关键性观念是它将数据及对数据的操作行为放在一起...在软件业，AOP为AspectOrientedProgramming的缩写，意为：面向切面编程，通过预编译方式和运行期动态代理实...
在PHP 7中对关联数组进行排序[重复] php
2018-06-02 08:11

回答 2 已采纳 You can use usort $user1 = array('username' => 'test1', 'score' => 2000, 'someotherdata' =&
.net知识集合
2017-10-18 09:52

木胭脂沾染了灰的博客系统知识总结： ...面向对象编程技术的关键性观念是它将数据及对数据的操作行为放在一起，作为一个相互依存、不可分割的整体——对象。对于相同类型的对象进行分类...在软件业，AOP为Aspect Oriented Prog
c/c++笔试题
2015-12-25 15:53

yihan9527的博客 c++中的explicit关键字用来修饰类的构造函数，表明该构造函数是显式的，在某些情况下，我们要求类的使用者必须显示调用类的构造函数时就需要使用explicit,反之默认类型转换可能会造成无法预期的问题。 ...
JAVA开发全集
2016-09-23 16:16

u神的博客 //4��创建QName来指定消息中传递数据�� QName ename = new QName(ns,"add","nn");// SOAPBodyElement ele = body.addBodyElement(ename); ele.addChildElement("a").setValue("22"); ...
C/C++笔试题（很多）
2014-05-27 08:23

空中海的博客中的explicit关键字用来修饰类的构造函数，表明该构造函数是显式的，在某些情况下，我们要求类的使用者必须显示调用类的构造函数时就需要使用explicit,反之默认类型转换可能会造成无法预期的问题。 protected ...
iOS经典面试题
2013-04-21 14:24

孤独_求败的博客 1. 简单的C++程序源程序的基本结构：主函数 int main() ...解答：mian中，c标准认为0表示成功，非0表示错误。具体的值是某中具体出错信息 2. 文件包含：头文件头文件的作用是什么? 答：一、通过头
C++面试题
2013-01-20 14:29

backard的博客写个is-a和has-a(T) 组合常常被称为"has-a"(有)关系，比如"在小汽车中有发动机" 继承常常被称为"is-a"(是)关系，比如"圆形是一种形体" 当我们在选择使用组合还是继承时就可以根据这两种模型判断。你可以说小汽车是...
没有解决我的问题, 去提问

悬赏问题

¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？
¥15 c++头文件不能识别CDialog

在并行quicksort实现中使用go例程时，性能较差

2条回答 默认 最新

悬赏问题

2条回答默认最新