有没有更好的方法来限制“门”上的请求？

Right now I'm testing an extremely simple Semaphore in one of my production regions in AWS. On deployment the latency jumped from 150ms to 300ms. I assumed latency would occur, but if it could be dropped that would be great. This is a bit new to me so I'm experimenting. I've set the semaphore to allow 10000 connections. That's the same number as the maximum number of connections Redis is set to. Is the code below optimal? If not can someone help me optimize it, if I doing something wrong etc. I want to keep this as a piece of middleware so that I can simply call it like this in on the server n.UseHandler(wrappers.DoorMan(wrappers.DefaultHeaders(myRouter), 10000)).

package wrappers

import "net/http"

// DoorMan limit requests
func DoorMan(h http.Handler, n int) http.Handler {
    sema := make(chan struct{}, n)

    return http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
        sema <- struct{}{}
        defer func() { <-sema }()

        h.ServeHTTP(w, r)
    })
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douque9815 2017-09-08 16:00
关注
The solution you outline has some issues. But first, let's take a small step back; there are two questions in this, one of them implied:

How do you rate limit inbound connections efficiently?

How do you prevent overloading a backend service with outbound connections?

What it sounds like you want to do is actually the second, to prevent too many requests from hitting Redis. I'll start by addressing the first one and then make some comments on the second.

Rate limiting inbound connections

If you really do want to rate limit inbound connections "at the door", you should normally never do that by waiting inside the handler. With your proposed solution, the service will keep accepting requests, which will queue up at the sema <- struct{}{} statement. If the load persists, it will eventually take down your service, either by running out of sockets, memory, or some other resource. Also note that if your request rate is approaching saturation of the semaphore, you would see an increase in latency caused by goroutines waiting at the semaphore before handling the request.

A better way to do it is to always respond as quickly as possible (especially when under heavy load). This can be done by sending a 503 Service Unavailable back to the client, or a smart load balancer, telling it to back off.

In your case, it could for example look like something along these lines:

select { case sema <- struct{}{}: defer func() { <-sema }() h.ServeHTTP(w, r) default: http.Error(w, "Overloaded", http.StatusServiceUnavailable) }

Rate limiting outbound connections to a backend service

If the reason for the rate limit is to avoid overloading a backend service, what you typically want to do is rather to react to that service being overloaded and apply back pressure through the request chain.

In practical terms, this could mean something as simple as putting the same kind of semaphore logic as above in a wrapper protecting all calls to the backend, and return an error through your call chain of a request if the semaphore overflows.

Additionally, if the backend sends status codes like 503 (or equivalent), you should typically propagate that indication downwards in the same way, or resort to some other fallback behaviour for handling the incoming request.

You might also want to consider combining this with a circuit breaker, cutting off attempts to call the backend service quickly if it seems to be unresponsive or down.

Rate limiting by capping the number of concurrent or queued connection as above is usually a good way to handle overload. When the backend service is overloaded, requests will typically take longer, which will then reduce the effective number of requests per second. However, if, for some reason, you want to have a fixed limit on number of requests per second, you could do that with a rate.Limiter instead of a semaphore.

A comment on performance

The cost of sending and receiving trivial objects on a channel should be sub-microsecond. Even on a highly congested channel, it wouldn't be anywhere near 150 ms of additional latency only to synchronise with the channel. So, assuming the work done in the handler is otherwise the same, whatever your latency increase comes from it should almost certainly be associated with goroutines waiting somewhere (e.g. on I/O or to get access to synchronised regions that are blocked by other goroutines).

If you are getting incoming requests at a rate close to what can be handled with your set concurrency limit of 10000, or if you are getting spikes of requests, it is possible you would see such an increase in average latency stemming from goroutines in the wait queue on the channel.

Either way, this should be easily measurable; you could for example trace timestamps at certain points in the handling pathway. I would do this on a sample (e.g. 0.1%) of all requests to avoid having the log output affect the performance.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

有没有更好的方法来限制“门”上的请求？
2017-03-22 16:40

回答 2 已采纳 The solution you outline has some issues. But first, let's take a small step back; there are two q
有没有更好的方法来解析此Map？
2016-10-04 08:01

回答 2 已采纳 First, I would recommend to read this related question: How to handle configuration in Go Next, I
有没有更好的方法来获得PHP的货币汇率？ php
2018-06-05 03:13

回答 1 已采纳 You have several issues: You're not calling an actual API, you're scraping a web page, which mea
设计好接口的方法总结：全栈程序员如何把一个接口设计好？
2022-05-12 09:37

猫头虎的博客设计好接口的方法总结我们做后端开发工程师，主要工作就是：如何把一个接口设计好。所以，今天就给大家介绍，设计好接口的36个锦囊。本文就是后端思想专栏的第一篇哈。文章目录设计好接口的方法总结1. 接口参数...
有没有更好的方法来获得限制1的随机记录 mysql php
2016-05-18 04:48

回答 5 已采纳 I have tried this which has given me a better performance $r = mysql_query("SELECT count(*) FROM
有什么更好的方法来观看imap邮箱更新？
2017-04-07 09:07

回答 1 已采纳 IDLE doesn't tell you that there is one new message, it tells you that something happened. It may
有没有更好的方法来声明json变量
2018-10-31 07:22

回答 1 已采纳 The simplest way would be to use the type map[string]interface{}. Since the empty interface, inter
到底是选择用GET请求还是POST请求呢？
2021-03-21 10:19

不会编程的派大星的博客当请求有副作用时（如添加数据行），则用POST方法。一个比较实际的问题是：GET方法可能会产生很长的URL，或许会超过某些浏览器与服务器对URL长度的限制。根据具体需求来悬着 1、若符合下列任一情况，则用POST方法：...
mysql 查询结果分段整合有什么更好的方法嘛？ mysql
2022-05-11 13:21

回答 3 已采纳找到整洁些的方法啦！n_n # 先划分等级，再分组统计 DROP VIEW IF EXISTS u_1_3; CREATE VIEW u_1_3 AS SELECT ( CASE WHE
有一个更好的方法来运行这个PHP？ php
2016-10-18 20:58

回答 2 已采纳 Look at number_format for ($i = 0; $i <= 1; $i += 0.000000000000000000000000000001) { $txt
python列表排序，太过冗余，有没有更好的算法？ list python
2022-04-07 21:13

回答 1 已采纳 sorted(xd, key=lambda item: item[2]) 第二问, 用1表示男, 0表示女, 先年龄排序, 再性别排序sorted(xd, key=lambda item: item[
get请求和post请求的详细区别
2019-06-06 16:29

小水牛水水题的博客 GET和POST是HTTP请求的两种基本方法，要说它们的区别，接触过WEB开发的人都能说出一二。最直观的区别就是GET把参数包含在URL中，POST通过request body传递参数。你可能自己写过无数个GET和POST请求，或者...
有更好的方法来验证我的表单吗？ PHP php
2016-12-30 14:38

回答 2 已采纳 <?php if (isset($_POST['register'])) { try { if (empty($_POST['username']))
GET和POST两种基本请求方法的区别
2021-02-01 15:09

戴国进的博客 GET和POST是HTTP请求的两种基本方法，要说它们的区别，接触过WEB开发的人都能说出一二。最直观的区别就是GET把参数包含在URL中，POST通过request body传递参数。你可能自己写过无数个GET和POST请求，或者...
POST请求真的只会发送一次请求吗，GET和POST本质上有区别吗
2020-03-12 13:37

菅有志的博客 GET和POST是HTTP请求的两种基本方法，要说他们的区别，接触过WEB开发的人都能说出一二。最直观的区别就是GET把参数包含在URL中，POST通过request body传递参数。你可能自己写过无数个GET和POST请求，或者已经看过...
没有解决我的问题, 去提问

悬赏问题

¥15 #MATLAB仿真#车辆换道路径规划
¥15 java 操作 elasticsearch 8.1 实现索引的重建
¥15 数据可视化Python
¥15 要给毕业设计添加扫码登录的功能！！有偿
¥15 kafka 分区副本增加会导致消息丢失或者不可用吗？
¥15 微信公众号自制会员卡没有收款渠道啊
¥100 Jenkins自动化部署—悬赏100元
¥15 关于#python#的问题：求帮写python代码
¥20 MATLAB画图图形出现上下震荡的线条
¥15 关于#windows#的问题：怎么用WIN 11系统的电脑克隆WIN NT3.51-4.0系统的硬盘

有没有更好的方法来限制“门”上的请求？

2条回答 默认 最新

Rate limiting inbound connections

Rate limiting outbound connections to a backend service

A comment on performance

悬赏问题

2条回答默认最新