如何实现流水线到goroutines？

I need some help on understanding how to use pipeline to get data to transfer from one goroutine to another.

I read the golang blogpost on pipeline, I understood it but couldn't fully put it into action and thus thought seeking help from the community.

Now, I have come up with this ugly code ( Playground ) :

package main

import (
    "fmt"
    "sync"
    "time"
)

func main() {
    wg := sync.WaitGroup{}
    ch := make(chan int)
    for a := 0; a < 3; a++ {
        wg.Add(1)
        go func1(int(3-a), ch, &wg)
    }
    go func() {
        wg.Wait()
        close(ch)
    }()
    wg2 := sync.WaitGroup{}
    ch2 := make(chan string)
    for val := range ch {
        fmt.Println(val)
        wg2.Add(1)
        go func2(val, ch2, &wg2)
    }
    go func() {
        wg2.Wait()
        close(ch2)
    }()
    for val := range ch2 {
        fmt.Println(val)
    }
}

func func1(seconds int, ch chan<- int, wg *sync.WaitGroup) {
    defer wg.Done()
    time.Sleep(time.Duration(seconds) * time.Second)
    ch <- seconds
}

func func2(seconds int, ch chan<- string, wg *sync.WaitGroup) {
    defer wg.Done()
    ch <- "hello"
}

Problem

I want to do it the proper way using pipelines or whatever is the proper way to do it.

Also, the pipeline shown in the blogpost isn't for goroutines and thus I am not able to do it myself.

In real life those func1 and func2 are functions which fetch resources from the web and hence they're launched in their own goroutine.

Thanks.
Temporarya
( A golang noobie )

P.S. Real life examples and usage of pipeline using goroutines would be of great help too.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doukuang6795 2019-02-09 20:09
关注
The key pattern of that pipelines post is that you can view the contents of a channel as a stream of data, and write a set of cooperating goroutines that build up a data-processing stream graph. This can be a way to get some concurrency into a data-oriented application.

In terms of design, you may also find it more helpful to build up blocks that aren't tied to the goroutine structure, and wrap them in channels. This makes it much easier to test the lower-level code, and if you change your mind about running things in a goroutine or not, it's easier to add or remove the wrapper.

So in your example I'd start by refactoring the lowest-level tasks out into their own (synchronous) functions:

func fetch(ms int) int { time.Sleep(time.Duration(ms) * time.Millisecond) return ms } func report(ms int) string { return fmt.Sprintf("Hello after %d ms", ms) }

Since the second half of your example is fairly synchronous, it's easy to adapt to the pipeline pattern. We write a function that consumes all of its input stream and produces a complete output stream, closing it when it's done.

func reportAll(mss <-chan int, out chan<- string) { for ms := range mss { out <- report(ms) } close(out) }

The function that calls the asynchronous code is a little tricker. In the main loop of the function, every time you read a value, you need to launch a goroutine to process it. Then after you've read everything out of the input channel you need to wait for all of those goroutines to finish before closing the output channel. You can use a small anonymous function here to help.

func fetchAll(mss <-chan int, out chan<- int) { var wg sync.WaitGroup for ms := range mss { wg.Add(1) go func(ms int) { out <- fetch(ms) wg.Done() }(ms) } wg.Wait() close(out) }

It's also helpful here (because channel writes are blocking) to write another function to seed the input values.

func produceInputs(mss chan<- int) { for ms := 1000; ms > 0; ms -= 300 { mss <- ms } close(mss) }

Now your main function needs to create the channels between these and run the final consumer.

// main is the entry point to the program. // // mss fetched results // produceInputs --> fetchAll --> reportAll --> main func main() { mss := make(chan int) fetched := make(chan int) results := make(chan string) go produceInputs(mss) go fetchAll(mss, fetched) go reportAll(fetched, results) for val := range results { fmt.Println(val) } }

https://play.golang.org/p/V9Z7ECUVIJL is a complete example.

I've avoided manually passing around sync.WaitGroups here (and tend to do that in general: you wouldn't have a WaitGroup unless you're explicitly calling something as the top level of a goroutine, so pushing the WaitGroup management up to the caller makes the code more modular; see my fetchAll function above for an example). How do I know all of my goroutines have finished? We can trace through:

If I've reached the end of main, the results channel is closed.

The results channel is the output channel of reportAll; if it closed, then that function reached the end of its execution; and if that happened then the fetched channel is closed.

The fetched channel is the output channel of fetchAll; ...

Another way to look at this is that as soon as the pipeline's source (produceInputs) closes its output channel and finishes, that "I'm done" signal flows down the pipeline and causes the downstream steps to close their output channels and finish too.

The blog post mentions a separate explicit close channel. I haven't gone into that here at all. Since it was written, though, the standard library gained the context package, which is now the standard idiom for managing those. You'd need to use a select statement in the body of the main loop, which makes the handling a little more complicated. This might look like:

func reportAllCtx(ctx context.Context, mss <-chan int, out chan<- string) { for { select { case <-ctx.Done(): break case ms, ok := <-mss: if ok { out <- report(ms) } else { break } } } } close(out) }
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

一键控流水灯咋实现？ proteus 单片机
2022-10-26 00:53

回答 1 已采纳对照
Jenkins流水线Input后设置定时执行 jenkins 运维
2022-08-17 16:27

回答 1 已采纳添加sleep可以达到效果sleep time: 100, unit: 'MINUTES'
FPGA：49位计数器拆成4 12 12 12 9高低位计数器，这种结构是流水线吗？ fpga开发
2021-10-15 09:54

回答 3 已采纳 流水线是多个时钟，后一个时钟处理前一个时钟的。（这句话可能字面的意思不严谨）流水线是多个时钟周期才出结果的。后一个时钟处理前一个时钟的结果。到最后一级出得结果要延迟多个时钟周期。流水线各模块的时钟一般
GitLabRunner和流水线的数据采集与监控
2020-07-27 10:56

DevOps云学堂的博客使用Prometheus对GitLab Runner监控1.1 配置GitLab Runner监控1.2 配置GitLabCI 流水线监控本文主要阐述如何配置GitLabRunner和G...
流水线技术说的是啥意思多彩生活
2023-03-30 20:58

回答 1 已采纳说白了，就是分段操作，达到隐藏延迟提高效率的目的。比如说一个指令需要5个周期，那么分为5个stage，每个stage 1个周期执行5条指令，第一条进入stage 2的时候，第二条就可以开始执行了，依此
实施工作人员职能流水线
2018-10-14 12:39

回答 1 已采纳 If your intent is to propagate a signal along the pipeline to communicate when previous pipeline
实现0.5秒间隔的单向流水灯 c语言
2022-11-29 09:49

回答 3 已采纳具体仿真见我的博客。 // 12MHz晶振 #include "reg52.h" #include "intrins.h" #define time (65536-50000) // 单次定时5
一次模块优化——基于Go chan的流水线模式
2023-03-04 18:01

Corleo的博客一次模块优化——基于Go chan的流水线模式
MISP流水线操作阶段全称
2018-06-23 08:49

回答 1 已采纳虽然不懂misp，但是根据英文单词和你的中文，我猜测下 i = instruction f = fetch if = instruction fetch d = decode ex = ex
单片机定时器控制led实现流水灯功能的问题 51单片机
2022-06-23 22:06

回答 2 已采纳你的程序在i=20的时候是一次性把灯全部闪了一遍，并不是按照你期望的时间间隔依次闪的
我的流水灯思路有问题吗？ 51单片机单片机
2022-09-29 22:13

回答 2 已采纳 //你的思路没有问题，是你的延时程序太短了，可以百度一下人眼识别灯亮灭的间隔时间，我的建议是间隔时间做到150MS或以上。//假设单片机晶振12MHZ,运行单周期指令需要1US。//下面的DELAY程
golang 的精髓--pipeline流水线，对现实世界的完美模拟
2019-09-26 01:00

dielucui7698的博客 ... 简介 Go语言的并发原语允许开发者以类似于 Unix Pipe 的方式构建数据流水线 (data pipelines)，数据流水线能够高效地利用 I/O和多核 CPU 的优势。本文要讲的就是一些使用流水线的一...
电子技术课设流水灯的设计与实现嵌入式硬件有问必答
2023-01-02 13:16

回答 2 已采纳 555+4017 为一个十进制计数器，共有Q0 到Q9 十个输出端。它的CLK 端接收工作周期调整电路所产生的频率，一开始时Q0 为1，其余为0；
Goroutines and Channels
2018-10-25 18:40

Anokata的博客本章介绍Goroutines和Channels，他们支持communicating sequential processes[CSP] (交谈循序程序，又译为通信顺序进程、交换消息的循序程序，一种形式语言，用来描述并发性系统间进行交互的模式)，一种并发模型，...
【GoLang】golang 的精髓--流水线，对现实世界的完美模拟
2016-12-23 18:23

weixin_34416649的博客 Go语言的并发原语允许开发者以类似于 Unix Pipe 的方式构建数据流水线 (data pipelines)，数据流水线能够高效地利用 I/O和多核 CPU 的优势。本文要讲的就是一些使用流水线的一些例子，流水线的错误处理也是本文的...
没有解决我的问题, 去提问

悬赏问题

¥15 制裁名单20240508芯片厂商
¥20 易康econgnition精度验证
¥15 线程问题判断多次进入
¥15 msix packaging tool打包问题
¥28 微信小程序开发页面布局没问题，真机调试的时候页面布局就乱了
¥15 python的qt5界面
¥15 无线电能传输系统MATLAB仿真问题
¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接

如何实现流水线到goroutines？

Problem

1条回答 默认 最新

悬赏问题

1条回答默认最新