douchui3933 2018-09-03 23:12
浏览 2
已采纳

结果与并发功能不一致?

I am trying to process lines from a file concurrently, but for some reason I appear to be getting inconsistent results. A simplified version of my code is below:

  var wg sync.WaitGroup
  semaphore := make(chan struct{}, 2)
  lengths:= []int{}

  for _, file := range(args[1:]){
    // Open the file and start reading it
    reader, err := os.Open(file)
    if err != nil {
      fmt.Println("Problem reading input file:", file)
      fmt.Println("Error:", err)
      os.Exit(0)
    }
    scanner := bufio.NewScanner(reader)
    // Start streaming lines
    for scanner.Scan() {
      wg.Add(1)
      text := scanner.Text()
      semaphore <- struct{}{}
      go func(line string) {
          length := getInformation(line)
          lengths = append(lengths, length)
          <-semaphore
          wg.Done()
      }(text)
    }
  }
  wg.Wait()
  sort.Ints(lengths)
  fmt.Println("Lengths:", lengths)

The getInformation function is just returning the length of the line. I then take that line and add it to an array. The issue I'm having is that when I run this multiple times against the same file I get different number of items in my array. I had assumed that since I was using a waitGroup that all lines would be processed every time and therefore the contents of lengths would be the same, but this does not appear to be the case. Can anyone see what I am doing wrong here?

  • 写回答

1条回答 默认 最新

  • drus39136 2018-09-04 00:17
    关注

    the lengths = append(lengths, length) is getting executed concurrently. This is not safe and will cause problems like missing entries from slice. You can fix this by wrapping the append calls in a mutex, or have the gorountines publish their results to a channel and have a single place that collects them up into a slice.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 装 pytorch 的时候出了好多问题,遇到这种情况怎么处理?
  • ¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
  • ¥15 手机接入宽带网线,如何释放宽带全部速度
  • ¥30 关于#r语言#的问题:如何对R语言中mfgarch包中构建的garch-midas模型进行样本内长期波动率预测和样本外长期波动率预测
  • ¥15 ETLCloud 处理json多层级问题
  • ¥15 matlab中使用gurobi时报错
  • ¥15 这个主板怎么能扩出一两个sata口
  • ¥15 不是,这到底错哪儿了😭
  • ¥15 2020长安杯与连接网探
  • ¥15 关于#matlab#的问题:在模糊控制器中选出线路信息,在simulink中根据线路信息生成速度时间目标曲线(初速度为20m/s,15秒后减为0的速度时间图像)我想问线路信息是什么