RabbitMQ多工模式

I'm trying to find a good method to consume asynchronously from an input queue, process the content using several workers and then publish to an output queue. So far I've tried a number of examples, most recently using the code from here and here as inspiration.

My current code doesn't appear to be doing what it should be however, increasing the number of workers doesn't increase performance (msg/s consumed or published) and the number of goroutines remains fairly static whilst running.

main:

func main() {
    maxWorkers := 10

    // channel for jobs
    in := make(chan []byte)
    out := make(chan []byte)

    // start workers
    wg := &sync.WaitGroup{}
    wg.Add(maxWorkers)
    for i := 1; i <= maxWorkers; i++ {
        log.Println(i)
        defer wg.Done()
        go processor(in, out)
    }

    // add jobs
    go collector(in)
    go sender(out)

    // wait for workers to complete
    wg.Wait()
}

The collector is basically the example from the RabbitMQ site with a goroutine that collects messages from the queue and places them on the 'in' channel:

forever := make(chan bool)
go func() {
    for d := range msgs {
        in <- d.Body
        d.Ack(false)
    }
}()
log.Printf("[*] Waiting for messages. To exit press CTRL+C")
<-forever

The processor receives an 'in' and 'out' channel, unmarshals JSON, performs a series of regexes and then places the output into the 'out' channel:

func processor(in chan []byte, out chan []byte) {

    var (
    // list of regexes declared here
    )

    for {
        body := <-in

        jsonIn := &Data{}
        err := json.Unmarshal(body, jsonIn)
        if err != nil {
            log.Fatalln("Failed to decode:", err)
        }

        content := jsonIn.Content

        //process regexes using:
        //jsonIn.a = r1.FindAllString(content, -1)

        jsonOut, _ := json.Marshal(jsonIn)

        out <- jsonOut
    }
}

And finally the sender is simply the code from the RabbitMQ site, setting up a connection, reading from the 'out' channel and then publishing to a RMQ queue:

for {
    jsonOut := <-out

    err = ch.Publish(
        "",     // exchange
        q.Name, // routing key
        false,  // mandatory
        false,
        amqp.Publishing{
            DeliveryMode: amqp.Persistent,
            ContentType:  "text/json",
            Body:         []byte(jsonOut),
        })
    failOnError(err, "Failed to publish a message")

}

This is a pattern that I'll be using quite a lot, so I'm spending a lot of time trying to find something that works correctly (and well) - any advice or help would be appreciated (and in case it isn't obvious, I'm new to Go).

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dreamer1231 2017-09-27 13:47
关注
There are a couple of things that jump out:

Done within main function

wg.Add(maxWorkers) for i := 1; i <= maxWorkers; i++ { log.Println(i) defer wg.Done() go processor(in, out) }

The defer here is executed when main returns so it's not actually indicating when processing is complete. I don't think this'll have an effect on the performance profile of your program though.

To address this you could pass in wg *sync.WaitGroup to your processor so your processor can indicate when it's done.

CPU Bound Processing

Parsing messages and performing Regex is a cpu intensive workload. How many cores is your machine? How is throughput affected if you run your program on two separate machines, does throughput 2x? What if you double your amount of cores? What about running your program with 1 worker vs 2 processor workers? does that double throughput? Are you maxing out your rabbitmq local instance? is it the bottleneck??

Setting up benchmarking and load testing harnesses should allow you to setup experiments to see where your bottle necks are :)

For queue based services it's pretty easy to setup a test harness to fill rabbitmq with a set backlog and benchmark how fast you can process those messages, or to setup a load generator to send x messages/second to rabbitmq and observe if you can keep up.

Does rabbitmq have good visibility into message processing throughput? If not I frequently add a counter to go code and then log the overall averaged throughput on an interval to get a rough idea of performance:

start := time.Now() updateInterval := time.Tick(1 * time.Second) numIn := 0 for { select { case <-updateInterval: log.Infof("IN - Count: %d", numIn) log.Infof("IN - Througput: %.0f events/second", float64(numIn)/(time.Now().Sub(start)).Seconds()) case e := <-msgs: numIn++ in <- d.Body d.Ack(false) } }
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

rabbitmq安装时问题 erlang rabbitmq 分布式
2022-10-06 00:20

回答 2 已采纳报错已经有提示你了，所以先明确你已经运行了服务，端口侦听各种已经正常，之后再启用插件啊，如果服务不正常，其余都是浮云。如果启动出现报错相关，去查查服务日志再定位。
rabbitmq可视化页面删除交换机 rabbitmq
2021-09-18 17:18

回答 1 已采纳至少要有一个存在，，不能全部删除。
redis rabbitmq秒杀的疑问 rabbitmq redis
2022-07-31 16:23

回答 1 已采纳用rabbitmq的原因是要解耦，跟库存量大小没关系，试想一下，同时成千上万个人要秒杀同一件商品，你的这个接口即使是原子操作，即使你加了锁，你是不是也要保证你这个接口的快速响应哇，说白了就是这个接口要
RabbitMQ的使用建议
2021-12-29 16:37

走错路的程序员的博客经过长达4年多的使用经验, 发现网上有很多的误解和错误的代码. 为清楚一些错误的观念和使用方式. 特意写了这个文章统一大家的编程规范和命名规范, 方便系统开发和系统集成. 增加了一些其它文章中的常见问题和使用...
启动rabbitmq报错 rabbitmq
2023-04-17 09:02

回答 1 已采纳这个错误是因为缺少libcrypto.so库文件，检查有没有安装OpenSSL，或者将已安装的OpenSSL升级到最新版本。
rabbitMq间隔1秒消费消息问题 java rabbitmq 有问必答
2021-10-12 11:31

回答 4 已采纳一个思路：使用延迟队列，设置ttl时间，每次设置ttl时加1s
rabbitMq消费者在接收String类型的JSON字符串时前后多了俩个双引号 gradle java rabbitmq
2022-01-22 01:53

回答 3 已采纳找到问题了，另外一方的MQ序列化配置错了
大数据Maxwell（一）：Maxwell介绍和工作原理
2023-03-13 10:06

Lansonli的博客 Maxwell是由美国Zendesk开源，使用Java编写的MySQL实时抓取工具，可以实时读取MySQL二进制日志binlog，并生成 JSON 格式的消息，作为生产者发送给 Kafka，Kinesis、RabbitMQ、Redis、Google Cloud Pub/Sub、文件或...
rabbitMQ远程连接失败 java rabbitmq 服务器
2022-10-23 22:01

回答 3 已采纳正常滴很用命令操作 rabbitmqctl add_user username passwd //添加用户，后面两个参数分别是用户名和密码 rabbitmqctl set_permissions -
无法访问aws的rabbitmq 15672端口 aws rabbitmq
2022-07-29 15:33

回答 1 已采纳检查一下你访问的ip写对了没，看下你的aws实例的公网ip
rabbitmq15672无法访问 rabbitmq
2022-07-09 10:20

回答 1 已采纳可以参考一下我的这篇文章，希望对你有帮助https://blog.csdn.net/qq_35429398/article/details/100983670?spm=1001.2014.3001.5
大数据开源框架技术汇总
2022-10-17 10:43

慕@白的博客主要基于对现阶段一些常用的大数据开源框架技术的整理，只是一些简单的介绍，并不是详细技术梳理。可能会有疏漏，发现再整理。参考得太多，就不一一列出来了。这只是作为一个梳理，对以后选型或者扩展的做个参考。
rabbitmq什么时候发送的ack java rabbitmq
2022-09-02 09:45

回答 2 已采纳 1、消费者获取到消息，并且业务正常处理完成之后，再确认ack；2、如果消息重复消息，业务代码里面也要有幂等性校验，这时候发现重复消费（即之前已完成业务处理），这时候也要确认ack；
RabbitMQ入门到熟悉
2021-11-22 14:36

m0_61585241的博客 RabbitMQ尚硅谷笔记--->从入门到基础
大数据实时流计算详解
2022-07-13 16:12

办公模板库素材蛙的博客最近这两年，越来越多的业务和数据分析对实时性提出更高的要求，与之对应解决实时计算问题的流计算框架，也开始流行起来。因为工作原因，常有人问我有关实时流计算系统的问题。整体观察下来我发现：很多时候，他们...
没有解决我的问题, 去提问

悬赏问题

¥35 平滑拟合曲线该如何生成
¥100 c语言，请帮蒟蒻写一个题的范例作参考
¥15 名为“Product”的列已属于此 DataTable
¥15 安卓adb backup备份应用数据失败
¥15 eclipse运行项目时遇到的问题
¥15 关于#c##的问题：最近需要用CAT工具Trados进行一些开发
¥15 南大pa1 小游戏没有界面，并且报了如下错误，尝试过换显卡驱动，但是好像不行
¥15 自己瞎改改，结果现在又运行不了了
¥15 链式存储应该如何解决
¥15 没有证书，nginx怎么反向代理到只能接受https的公网网站

RabbitMQ多工模式

1条回答 默认 最新

悬赏问题

1条回答默认最新