死锁所有goroutine睡着了

This is a follow up to my earlier post:

    http://stackoverflow.com/questions/34736825/goroutine-exit-status-2-what-does-it-mean-why-is-it-happening?noredirect=1#comment57238789_34736825

I'm still having trouble figuring out where the channels should be closed, after reading multiple topics and articles both on and off SO.

This program will open a list of files, create an output file for each input file (with the same name),visit all the urls in each input file and get all href links from these - which are saved to the corresponding output file. However, I'm getting the following error:

    http://play.golang.org/p/8X-1rM3aXC

The linkgetter, and getHref functions are mainly for processing. Head and tail are run as separate goroutines, and worker does the processing.

    package main

    import (
    "bufio"
    "bytes"
    "fmt"
    "golang.org/x/net/html"
    "io"
    "io/ioutil"
    "log"
    "net/http"
    "os"
    "path/filepath"
    "regexp"
    "sync"
    )

    type Work struct {
    Link     string
    Filename string
    }

    type Output struct {
    Href     string
    Filename string
    }

    func getHref(t html.Token) (href string, ok bool) {
    // Iterate over all of the Token's attributes until we find an    "href"
    for _, a := range t.Attr {
            if a.Key == "href" {
                    href = a.Val
                    ok = true
            }
    }
    return
    }

    func linkGetter(out chan<- Output, r io.Reader, filename string) {
    z := html.NewTokenizer(r)
    for {
            tt := z.Next()
            switch {
            case tt == html.ErrorToken:
                    return
            case tt == html.StartTagToken:
                    t := z.Token()
                    isAnchor := t.Data == "a"
                    if !isAnchor {
                            continue
                    }

                    // Extract the href value, if there is one
                    url, ok := getHref(t)
                    if !ok {
                            continue
                    }

                    out <- Output{url, filename}
            }
    }
    }

    func worker(out chan<- Output, in <-chan Work, wg *sync.WaitGroup)    {
    defer wg.Done()
    for work := range in {
            resp, err := http.Get(work.Link)
            if err != nil {
                    continue
            }
            body, err := ioutil.ReadAll(resp.Body)
            if err != nil {
                    continue
            }
            if err = resp.Body.Close(); err != nil {
                    fmt.Println(err)
            }
            linkGetter(out, bytes.NewReader(body), work.Filename)
    }
    }

    func head(c chan<- Work) {
    r, _ := regexp.Compile("(.*)(?:.json)")
    files, _ := filepath.Glob("*.json")

    for _, elem := range files {
            res := r.FindStringSubmatch(elem)
            for k, v := range res {

                    if k == 0 {
                            outpath, _ :=  filepath.Abs(fmt.Sprintf("go_tester/%s", v))

                            abspath, _ := filepath.Abs(fmt.Sprintf("url_links/%s", v))
                            f, _ := os.Open(abspath)
                            scanner := bufio.NewScanner(f)

                            for scanner.Scan() {
                                    c <- Work{outpath, scanner.Text()}
                            }

                    }
            }

    }


    }

    func tail(c <-chan Output) {
    currentfile := ""
    var f *os.File
    var err error
    for out := range c {
            if out.Filename != currentfile {
                    if err = f.Close(); err != nil { // might cause an error on first run
                            fmt.Println(err)
                    }
                    f, err = os.OpenFile(out.Filename, os.O_APPEND|os.O_WRONLY, 0600)
                    if err != nil {
                            log.Fatal(err)
                    }
                    currentfile = out.Filename
            }
            if _, err = f.WriteString(out.Href + "
"); err != nil {
                    fmt.Println(err)
            }
    }

    }

    const (
    nworkers = 80
    )

    func main() {
    //fmt.Println("hi")
    in := make(chan Work)
    out := make(chan Output)

    go head(in)
    go tail(out)

    var wg sync.WaitGroup
    for i := 0; i < 85; i++ {
            wg.Add(1)
            go worker(out, in, &wg)
    }
    close(in)   
    close(out)    
    wg.Wait()


    }

What is wrong with the way the channels are closed?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dsgdhf5674 2016-01-13 03:03
关注
You're not really paying attention to your pipeline design here. You have to ask yourself "When is section X done? What should happen when it is done? What happens after it is done?" for every section of the pipeline.

You start up head, tail, and worker to range over channels. The only way these functions are going to return successfully is if these channels are closed.

Draw it out of you need to.

head(in) feeds in to in

worker(out, in, &wg) ranges over in, feeds into out, and tells you it is done with wg once in is closed

tail(out) ranges over out

So what do you need to do to:

Make sure all input is processed?

Make sure all goroutines return?

Like so:

You need to close in from head once it is done processing all of the files.

This will cause worker to actually return once all items it can get from in are processed, causing wg.Wait() to return

it is now safe to close out since nothing is feeding in to it and this will cause tail to eventually return.

But you'll probably need another sync.WaitGroup associated with tail for this particular design because the whole program will exit right when wg.Wait() returns, thus possibly not finishing all of the work that tail is doing. See here. Specifically:

Program execution begins by initializing the main package and then invoking the function main. When that function invocation returns, the program exits. It does not wait for other (non-main) goroutines to complete.

You'll probably also want to use buffered channels referenced here to aid in not having switch execution between goroutines so much. With your current design you're wasting a lot of time with context switching.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

死锁所有goroutine睡着了
2016-01-13 00:47

回答 1 已采纳 You're not really paying attention to your pipeline design here. You have to ask yourself "When is
goroutine睡着了-死锁
2016-11-04 13:06

回答 2 已采纳 You're blocking the iteration over the channel in your handle function with the send on the done c
goroutine之间的死锁
2017-04-08 16:22

回答 2 已采纳 The receiving for loop blocks on receive from ch after receiving all values from the sending gorou
面试高频：Go语言死锁与goroutine泄露问题谈论
2021-07-22 09:47

机智的程序员小熊的博客在计算机组成原理里说过死锁有三个必要条件他们分别是循环等待、资源共享、非抢占式，在并发中出现通道死锁只有两种情况：数据要发送，但是没有人接收数据要接收，但是没有人发送发送单个值时的死锁牢记这两...
所有goroutine都处于睡眠状态-死锁（无限循环+选择）
2018-12-26 12:33

回答 1 已采纳 You only need merge the select on one and remove the default statement on main function. Removing
致命错误：所有goroutine都处于睡眠状态-死锁
2018-07-10 05:33

回答 1 已采纳 In this simplified example: func Cheat(guess chan Choice) chan Choice { new_guess := make
goroutine死锁？
2016-01-15 09:19

回答 1 已采纳 You never close c, so your for range loop waits forever. Close it like this: var wg sync.WaitGrou
GO goroutine死锁
2019-03-21 18:32

人月神话的博客死锁现场1 ： package main func main() { ch := make(chan int) <- ch } 运行结果： fatal error: all goroutines are asleep - deadlock! 分析：只有一个主协程，在<-ch时就阻塞上了，并未检测到其它...
Golang的“所有goroutine都睡着了-死锁！”错误背后的算法是什么？
2018-05-13 17:24

回答 2 已采纳 You can check the Go source and easily find out: it happens in this function, which is called in v
使用for循环遍历通道时获取Goroutine死锁
2019-06-24 20:07

回答 2 已采纳 When the goroutines are done, close the channel to indicate that no more values will be added. The
所有goroutine都在睡着-死锁！ ----—错误
2011-11-23 16:56

回答 2 已采纳 go Routine1(command12, response12,command13, response13 ) go Routine2(command12, response12,comman
goroutine详解
2022-04-12 09:35

勤天的博客一、创建 goroutine 1、启动单个协程 2、使用goroutine的问题 3、启动多个goroutine 二、使用匿名函数创建goroutine 一、创建 goroutine Go 程序中使用go关键字为一个函数创建一个 goroutine。一个函数...
测试Go通道吞吐量-所有goroutine死锁
2014-01-08 21:47

回答 2 已采纳 Your code go func(i int) { anItem := <-workQueue[i]; ... } removes juste 1 item from workQueue[
如何等待所有goroutine的退出
2018-07-01 17:38

厂圩菠萝菠萝蜜的博客当然channel阻塞并不是最佳方案，首先需要知道确定个数的goroutine，同时稍不注意就极易产生死锁。优雅利用sync.WaitGroup 使用go标准库sync，其中提供了专门的解决方案sync.WaitGroup 用于等待一个goroutines集合...
goroutine和channel与死锁详解
2018-10-08 15:51

BobChill的博客 Go语言中有个概念叫做goroutine, 这类似我们熟知的线程，但是更轻。 goroutine和线程的具体区别在于： 1. OS的线程由OS内核调度，每隔几毫秒，一个硬件时钟中断发到CPU，CPU调用一个调度器内核函数。这个函数暂停...
没有解决我的问题, 去提问

悬赏问题

¥20 测距传感器数据手册i2c
¥15 RPA正常跑，cmd输入cookies跑不出来
¥15 求帮我调试一下freefem代码
¥15 matlab代码解决，怎么运行
¥15 R语言Rstudio突然无法启动
¥15 关于#matlab#的问题：提取2个图像的变量作为另外一个图像像元的移动量，计算新的位置创建新的图像并提取第二个图像的变量到新的图像
¥15 改算法，照着压缩包里边，参考其他代码封装的格式写到main函数里
¥15 用windows做服务的同志有吗
¥60 求一个简单的网页(标签-安全|关键词-上传)
¥35 lstm时间序列共享单车预测，loss值优化，参数优化算法

死锁所有goroutine睡着了

1条回答 默认 最新

悬赏问题

1条回答默认最新