进行队列处理并重试失败

We have a bunch of files to be uploaded to remote blob store after processing.

Currently, the frontend (PHP) creates a redis list of such files and gives it a unique ID, called JobID. It then passes the unique ID to a beanstalk tube, which is received by a Go process. It uses a library called Go workers to process each job ID in the fashion of what net/http does. It receives the job ID, retrieves the redis list and starts processing files.

However, currently only one file is processed at a time. Since the operation here is I/O bound, not CPU bound, intuition suggests that it would be benefitial to use a goroutine per file.

However, we want to retry uploading on failure, as well as track the number of items processed per job. We cannot start a unbound number of goroutines because a single Job can contain about ~10k files to process and 100s of such Jobs can be sent per second during peak times. What would be the correct approach for this?

NB: We can change the technology stack a bit if needed (such as swapping out beanstalkd for something)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douran9707 2017-02-23 06:16
关注
You can limit the number of goroutines by using a buffered chan with a size of the maximum number of goroutines you want. You can then block on this chan if it reaches maximum capacity. As your goroutines finish, they will free up slots to allow new goroutines to run.

Example:

package main import ( "fmt" "sync" ) var ( concurrent = 5 semaphoreChan = make(chan struct{}, concurrent) ) func doWork(wg *sync.WaitGroup, item int) { // block while full semaphoreChan <- struct{}{} go func() { defer func() { // read to release a slot <-semaphoreChan wg.Done() }() // This is where your work actually gets done fmt.Println(item) }() } func main() { // we need this for the example so that we can block until all goroutines finish var wg sync.WaitGroup wg.Add(10) // start the work for i := 0; i < 10; i++ { doWork(&wg, i) } // block until all work is done wg.Wait() }

Go Playground link: https://play.golang.org/p/jDMYuCe7HV

Inspired by this Golang UK Conference talk: https://youtu.be/yeetIgNeIkc?t=1413
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

进行队列处理并重试失败 redis
2017-02-23 05:44

回答 1 已采纳 You can limit the number of goroutines by using a buffered chan with a size of the maximum number
linux多个进程通过消息队列进行通信 linux
2022-05-10 11:19

回答 2 已采纳尝试下下面代码：代码如下：发送端： #include<stdio.h> #include<stdlib.h> #include<string.h> #incl
用循环队列进行广度搜索 c语言算法
2023-04-22 19:06

回答 2 已采纳基于Monster 组和GPT的调写：队列的大小被定义为#define MAXN 1000，也就是说，队列的最大长度为1000。但是在你的代码中，并没有判断队列的长度是否超过了最大值，也就是说，你的
【消息队列】一文搞定大数据消息队列Kafka
2022-11-10 12:06

互联网小阿祥的博客一文搞定大数据消息队列Kafka
redis使用队列进行消费 java java-ee spring
2022-07-11 10:14

回答 2 已采纳 https://blog.csdn.net/qq_41358574/article/details/121759793
Laravel SQS队列 - >延迟工作失败 laravel php
2018-03-07 22:23

回答 1 已采纳 Your specific error is because the getSeconds() method was removed in Laravel 5.4. That package ha
win10打印机共享连接不上显示Windows无法连接到打印机，请检查打印机名并重试，如果这是网络打印机，确保打印机已经打开，并且打印机地址正确 windows
2021-10-14 10:02

回答 3 已采纳卸载9月更新汇总（kb50006670），重启就好了。
KubeMQ的消息重试和失败处理
2023-11-18 12:38

研发咨询顾问的博客消息重试和失败处理是KubeMQ的重要特性之一，它可以确保消息在发送和接收过程中的可靠性。当消息发送失败或者接收方无法处理消息时，KubeMQ提供了一套机制来处理这些失败的消息，包括自动重试、死信队列和错误处理。
求助！使用c++模板实现自定义队列报错 c++ 其他有问必答测试用例
2021-04-13 16:51

回答 4 已采纳 Node的构造函数第一个参数为Person，你传NULL进去肯定不行啊。改为new Node<T>(Person("",0),NULL);试试
Disruptor队列进行update写入数据的问题数据库
2018-05-06 09:07

回答 1 已采纳 https://blog.csdn.net/t_332741160/article/details/48346265
关于websphereMQ 对死信队列的处理
2017-08-18 02:22

回答 3 已采纳据我了解，死信队列里面的数据其实跟其它队列的数据差不多，只是头不一样，而且死信队列的业务数据基本上都没显示完全，基本没什么用，一般到死信队列的数据都没什么用了，我一般都是清理掉；把队列深度改大其实
大数据开发：消息队列如何实现分布式事务
2021-07-12 17:57

加米谷大数据张老师的博客在大数据技术生态当中，消息队列，主要是针对实时消息流的处理，而实时消息流场景下，常常需要解决的一个问题，就是数据一致性的问题，这其中又涉及到分布式事务。今天的大数据开发学习分享，我们就来讲讲消息队列...
queue：listen timeout停止处理队列处理 laravel php
2016-07-01 19:58

回答 1 已采纳 You could use supervisor to restart the process again Laravel docs. You can't really catch a PHP t
消息队列消息丢失和消息重复发送的处理策略
2022-07-06 09:30

Hollis Chuang的博客来源：https://www.jianshu.com/p/533fc6fc0963分布式事务什么是分布式事务我们的服务器从单机发展到拥有多台机器的分布式系统，各个系统之前需要借助于网络进行通信，原有单机中相对可靠的方法调用以及进程间通信...
9 Kafka高级特性解析-延时队列和重试队列
2021-03-07 23:41

微毂的博客 9 Kafka高级特性解析-延时队列和重试队列 9.1 延时队列两个follower副本都已经拉取到了leader副本的最新位置，此时又向leader副本发送拉取请求，而 leader副本并没有新的消息写入，那么此时leader副本该如何处理...
没有解决我的问题, 去提问

悬赏问题

¥15 mmocr的训练错误，结果全为0
¥15 python的qt5界面
¥15 无线电能传输系统MATLAB仿真问题
¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接
¥15 孟德尔随机化结果不一致
¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀

进行队列处理并重试失败

1条回答 默认 最新

悬赏问题

1条回答默认最新