使用Go标准库，为什么在这种两层体系结构中不断泄漏TCP连接？

In this situation, I'm using all standard Go libraries -- net/http, most importantly.

The application consists of two layers. The first layer is the basic web application. The web application serves out the UI, and proxies a bunch of API calls back to the second layer based on username -- so, it's effectively a load balancer with consistent hashing -- each user is allocated to one of these second-layer nodes, and any requests pertaining to that user must be sent to that particular node.

Quick details

These API endpoints in the first layer effectively read in a JSON body, check the username, use that to figure out which of the layer 2 nodes to send the JSON body to, and then it sends it there. This is done using a global http.Client that has timeouts set on it, as appropriate.

The server side does a defer request.Body.Close() in each of the handlers after ensuring no error comes back from decoder.Decode(&obj) calls that unmarshal the JSON. If there is any codepath where that could happen, it isn't one that's likely to get followed very often.

Symptoms

On the node in the second layer (the application server) I get log lines like this because it's leaking sockets presumably and sucking up all the FDs:

2019/07/15 16:16:59 http: Accept error: accept tcp [::]:8100: accept4: too many open files; retrying in 1s
2019/07/15 16:17:00 http: Accept error: accept tcp [::]:8100: accept4: too many open files; retrying in 1s

And, when I do lsof 14k lines are output, of which 11,200 are TCP sockets. When I look into the contents of lsof, I see that nearly all these TCP sockets are in connection state CLOSE_WAIT, and are between my application server (second layer node) and the web server (the first layer node).

Interestingly, nothing seems to go wrong with the web application server (layer 1) during this timeframe.

Why does this happen?

I've seen lots of explanations, but most either point out that you need to specify custom defaults on a custom http.Client and not use the default, or they tell you to make sure to close the request bodies after reading from them in the layer 2 handlers.

Given all this information, does anyone have any idea what I can do to at least put this to bed once and for all? Everything I search on the internet is user error, and while I certainly hope that's the case here, I worry that I've nailed down every last quirk of the Go standard library I can find.

Been having trouble nailing down exactly how long it takes for this to happen -- the last time it happened, it was up for 3 days before I started to see this error, and at that point obviously nothing recovers until I kill and restart the process.

Any help would be hugely appreciated!

EDIT: example of client-side code

Here is an example of what I'm doing in the web application (layer 1) to call the layer 2 node:


var webHttpClient = &http.Client{
    Transport: &http.Transport{
        MaxIdleConnsPerHost: MaxIdleConnections,
    },
    Timeout: time.Second * 20,
}
// ...
                    uri := fmt.Sprintf("http://%s/%s", tsUri, "pms/all-venue-balances")
                    req, e := http.NewRequest("POST", uri, bytes.NewBuffer(b))
                    resp, err := webHttpClient.Do(req)
                    if err != nil {
                        log.Printf("Submit rebal error 3: %v
", err)
                        w.WriteHeader(500)
                        return
                    }
                    defer resp.Body.Close()

                    body, _ := ioutil.ReadAll(resp.Body)
                    w.WriteHeader(200)
                    w.Write(body)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

为什么在java中使用 @Value("${alipay.publicKey}") 就会报错呢？ java spring
2021-12-14 22:03

回答 4 已采纳 @AllArgsConstructor默认无参构造，你给加个这玩意就变有参了，你有成员变量，就需要给成员变量值，但是这个时候还没有注入呢，所以不行，去掉这个就可以了，大概就这么个意思
如何在Go测试中测试连接是否泄漏？
2017-08-18 00:26

回答 1 已采纳 If you want your test to represent reality, you need to use it in the same manner that you do outs
为什么这种goroutine泄漏？
2019-07-28 09:42

回答 1 已采纳 As strings is nil range-loop just skipped. This assumption is not correct. In Go, reading fro
计算机网络期末考试题库(超级多的那种)
2020-12-04 17:42

罡罡同学的博客废话不多说，不管是应对期末考试还是考研基础复习，刷题是必不可少的！！！大家冲就完了！！！！记得给罡罡同学点关注哦！后期还会更新其他题库的呢！...2、在 OSI模型中，第 N层和其上的 N＋ l层的关系是 (A
Go http标准库中的内存泄漏？ http
2014-01-12 21:41

回答 1 已采纳 From the heap pprof you have provided in comments, it looks like you are leaking memory via gorill
为什么整周期截断也会发生泄露现象呢？ matlab
2022-07-06 22:25

回答 1 已采纳截断周期变大后，FFT点数仍未改变可能会有影响，麻烦用插入代码片上传一下代码，不然还要重新打一遍，有空我帮您调试看一下；频谱泄漏指的是时域信号经截断后，原来的离散谱线向附近展宽，造成频谱模糊、失真，使
在http.Response中使用空白标识符是否足以防止Golang中的内存泄漏？ http
2016-11-15 01:09

回答 1 已采纳 The application must close the response body to reclaim the resources used by the underlying netwo
网络安全管理员高级工理论题库（持续更新中）
2023-11-06 10:37

越来越不懂！的博客网络安全管理员高级工理论题库，持续更新中……
这段具有非缓冲通道的代码是否会在Go中导致goroutine泄漏？
2019-05-28 19:42

回答 1 已采纳 The problem with your code is twofold. First, there is, theoretically, a goroutine leak since any
如何在Google App Engine标准环境下的Gorilla会话中避免内存泄漏？
2018-01-11 03:01

回答 2 已采纳 The doc you quoted tells everything you need to do: wrap your handlers using context.ClearHandler(
买加速器会不会造成信息泄露？或者其他的隐患？ http tcp/ip 网络安全
2023-04-22 03:38

回答 1 已采纳注意个人信息保护，加速器不应该让你输入身份证等敏感信息。一般都有免费体验的，不过效果不怎么好。
计算机网络复习题库
2020-01-10 17:34

小陈同学，，的博客 //[父试题分类]:试题分类/电子信息工程学院/计算机网络 1.{ l假定有一个带宽为3kHz的理想低通信道，其最高码元传输速率为 6000码元/秒。若每个码元能携带3bit的信息量，则最高信息传输速率为_____ } 答案:18000 bit/...
在两个foreach循环中使用“continue 2”时，php 5.5内存泄漏？ php
2014-04-22 22:47

回答 2 已采纳 WAIT !! You should not disable opcache: as well as caching, opcache performs optimization. Optim
腾讯云TCP运维题库
2022-08-02 09:18

不甘平凡※的博客您在迁移的过程中打算将一部分机密数据仍然放在本地机房，其他的业务放在腾讯云上，中间使用专线通信，但是本地机房和云上的网络范围都用了的 192.168.1.0/24网段，此时您应该如何保证云上和云下网络之间进行高...
关于Go语言的底层，你想知道的都在这里！
2023-03-09 12:13

夏沫の梦的博客 Go语言提供了一种机制在运行时更新和检查变量的值、调用变量的方法和变量支持的内在操作，但是在编译时并不知道这些变量的具体类型，这种机制被称为反射。反射也可以让我们将类型本身作为第一类的值类型处理。反射是...
没有解决我的问题, 去提问

悬赏问题

¥15 HFSS 中的 H 场图与 MATLAB 中绘制的 B1 场部分对应不上
¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？

码龄粉丝数原力等级 --

使用Go标准库，为什么在这种两层体系结构中不断泄漏TCP连接？

0条回答

悬赏问题