在封闭的net.Conn上写入，但返回nil错误

Talk is cheap, so here we go the simple code:

package main

import (
    "fmt"
    "time"
    "net"
)

func main() {
    addr := "127.0.0.1:8999"

    // Server
    go func() {
        tcpaddr, err := net.ResolveTCPAddr("tcp4", addr)
        if err != nil {
            panic(err)
        }
        listen, err := net.ListenTCP("tcp", tcpaddr)
        if err != nil {
            panic(err)
        }
        for  {
            if conn, err := listen.Accept(); err != nil {
                panic(err)
            } else if conn != nil {
                go func(conn net.Conn) {
                    buffer := make([]byte, 1024)
                    n, err := conn.Read(buffer)
                    if err != nil {
                        fmt.Println(err)
                    } else {
                        fmt.Println(">", string(buffer[0 : n]))
                    }
                    conn.Close()
                }(conn)
            }
        }
    }()

    time.Sleep(time.Second)

    // Client
    if conn, err := net.Dial("tcp", addr); err == nil {
        for i := 0; i < 2; i++ {
            _, err := conn.Write([]byte("hello"))
            if err != nil {
                fmt.Println(err)
                conn.Close()
                break
            } else {
                fmt.Println("ok")
            }
            // sleep 10 seconds and re-send
            time.Sleep(10*time.Second)
        }
    } else {
        panic(err)
    }

}

Ouput:

> hello
ok
ok

The Client writes to the Server twice. After the first read, the Server closes the connection immediately, but the Client sleeps 10 seconds and then re-writes to the Server with the same already closed connection object(conn).

Why can the second write succeed (returned error is nil)?

Can anyone help?

PS:

In order to check if the buffering feature of the system affects the result of the second write, I edited the Client like this, but it still succeeds:

// Client
if conn, err := net.Dial("tcp", addr); err == nil {
    _, err := conn.Write([]byte("hello"))
    if err != nil {
        fmt.Println(err)
        conn.Close()
        return
    } else {
        fmt.Println("ok")
    }
    // sleep 10 seconds and re-send
    time.Sleep(10*time.Second)

    b := make([]byte, 400000)
    for i := range b {
        b[i] = 'x'
    }
    n, err := conn.Write(b)
    if err != nil {
        fmt.Println(err)
        conn.Close()
        return
    } else {
        fmt.Println("ok", n)
    }
    // sleep 10 seconds and re-send
    time.Sleep(10*time.Second)
} else {
    panic(err)
}

And here is the screenshot: attachment

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douting0585 2018-07-13 08:06
关注
There are several problems with your approach.

Sort-of a preface

The first one is that you do not wait for the server goroutine to complete. In Go, once main() exits for whatever reason, all the other goroutines still running, if any, are simply teared down forcibly.

You're trying to "synchronize" things using timers, but this only works in toy situations, and even then it does so only from time to time.

Hence let's fix your code first:

package main import ( "fmt" "log" "net" "time" ) func main() { addr := "127.0.0.1:8999" tcpaddr, err := net.ResolveTCPAddr("tcp4", addr) if err != nil { log.Fatal(err) } listener, err := net.ListenTCP("tcp", tcpaddr) if err != nil { log.Fatal(err) } // Server done := make(chan error) go func(listener net.Listener, done chan<- error) { for { conn, err := listener.Accept() if err != nil { done <- err return } go func(conn net.Conn) { var buffer [1024]byte n, err := conn.Read(buffer[:]) if err != nil { log.Println(err) } else { log.Println(">", string(buffer[0:n])) } if err := conn.Close(); err != nil { log.Println("error closing server conn:", err) } }(conn) } }(listener, done) // Client conn, err := net.Dial("tcp", addr) if err != nil { log.Fatal(err) } for i := 0; i < 2; i++ { _, err := conn.Write([]byte("hello")) if err != nil { log.Println(err) err = conn.Close() if err != nil { log.Println("error closing client conn:", err) } break } fmt.Println("ok") time.Sleep(2 * time.Second) } // Shut the server down and wait for it to report back err = listener.Close() if err != nil { log.Fatal("error closing listener:", err) } err = <-done if err != nil { log.Println("server returned:", err) } }

I've spilled a couple of minor fixes like using log.Fatal (which is log.Print + os.Exit(1)) instead of panicking, removed useless else clauses to adhere to the coding standard of keeping the main flow where it belongs, and lowered the client's timeout. I have also added checking for possible errors Close on sockets may return.

The interesting part is that we now properly shut the server down by closing the listener and then waiting for the server goroutine to report back (unfortunately Go does not return an error of a custom type from net.Listener.Accept in this case so we can't really check that Accept exited because we've closed the listener). Anyway, our goroutines are now properly synchronized, and there is no undefined behaviour, so we can reason about how the code works.

Remaining problems

Some problems still remain.

The more glaring is you making wrong assumption that TCP preserves message boundaries—that is, if you write "hello" to the client end of the socket, the server reads back "hello". This is not true: TCP considers both ends of the connection as producing and consuming opaque streams of bytes. This means, when the client writes "hello", the client's TCP stack is free to deliver "he" and postpone sending "llo", and the server's stack is free to yield "hell" to the read call on the socket and only return "o" (and possibly some other data) in a later read.

So, to make the code "real" you'd need to somehow introduce these message boundaries into the protocol above TCP. In this particular case the simplest approach would be either using "messages" consisting of a fixed-length and agreed-upon endianness prefix indicating the length of the following data and then the string data itself. The server would then use a sequence like

var msg [4100]byte _, err := io.ReadFull(sock, msg[:4]) if err != nil { ... } mlen := int(binary.BigEndian.Uint32(msg[:4])) if mlen < 0 { // handle error } if mlen == 0 { // empty message; goto 1 } _, err = io.ReadFull(sock, msg[5:5+mlen]) if err != nil { ... } s := string(msg[5:5+mlen])

Another approach is to agree on that the messages do not contain newlines and terminate each message with a newline (ASCII LF, , 0x0a). The server side would then use something like a usual bufio.Scanner loop to get full lines from the socket.

The remaining problem with your approach is to not dealing with what Read on a socket returns: note that io.Reader.Read (that's what sockets implement, among other things) is allowed to return an error while having had read some data from the underlying stream. In your toy example this might rightfully be unimportant, but suppose that you're writing a wget-like tool which is able to resume downloading of a file: even if reading from the server returned some data and an error, you have to deal with that returned chunk first and only then handle the error.

Back to the problem at hand

The problem presented in the question, I beleive, happens simply because in your setup you hit some TCP buffering problem due to the tiny length of your messages.

On my box which runs Linux 4.9/amd64 two things reliably "fix" the problem:

Sending messages of 4000 bytes in length: the second call to Write "sees" the problem immediately.

Doing more Write calls.

For the former, try something like

msg := make([]byte, 4000) for i := range msg { msg[i] = 'x' } for { _, err := conn.Write(msg) ...

and for the latter—something like

for { _, err := conn.Write([]byte("hello")) ... fmt.Println("ok") time.Sleep(time.Second / 2) }

(it's sensible to lower the pause between sending stuff in both cases).

It's interesting to note that the former example hits the write: connection reset by peer (ECONNRESET in POSIX) error while the second one hits write: broken pipe (EPIPE in POSIX).

This is because when we're sending in chunks worth 4k bytes, some of the packets generated for the stream manage to become "in flight" before the server's side of the connection manages to propagate the information on its closure to the client, and those packets hit an already closed socket and get rejected with the RST TCP flag set. In the second example an attempt to send another chunk of data sees that the client side already knows that the connection has been teared down and fails the sending without "touching the wire".

TL;DR, the bottom line

Welcome to the wonderful world of networking. ;-)

I'd recommend buying a copy of "TCP/IP Illustrated", read it and experiment. TCP (and IP and other protocols above IP) sometimes works not like people expect them to by applying their "common sense".
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

Golang net.Conn并行写入
2016-07-25 10:44

回答 2 已采纳 The io.Write says that in case of partial write, err will be != nil Found here on StackOverflow t
在net.Conn上创建一个http响应编写器 http
2017-10-12 01:06

回答 1 已采纳 http.ResponseWriter is an interface. So to create your own, just make any data type that satisfies
conn（net.Conn）并不总是写在套接字上
2017-05-23 14:20

回答 2 已采纳 [Edit & Solved] Several modifications to solve this... Server: func handleRequest(conn net.Conn
通向Golang的捷径【15. 网络, 模板和 web 应用】
2019-12-31 17:04

点点吃得太多了的博客将使用 TCP 协议和并发协程, 开发一个简单的客户端-服务器应用, 一个 (web) 服务器应用需响应多个客户端的并发请求, 在 Go 语言中, 每个客户端请求都将生成一个并发协程, 并对请求进行处理, 同时还需要 net ...
从net.Conn获取io.ByteReader
2014-02-08 06:50

回答 2 已采纳 The problem is that the underlying net.TCPConn returned by net.Dial as net.Conn only implements th
如何在Golang的单元测试中测试net.Conn？
2015-06-06 23:49

回答 3 已采纳 Although it will depend on the implementation details of your particular case, the general approac
net.Conn.Read用于持久TCP套接字的正确方法是什么
2017-12-01 01:18

回答 1 已采纳 TCP doesn't provide any message framing, it's up to you to buffer the stream and parse the message
解析Quorum -- 摩根大通的企业级区块链解决方案
2019-02-13 14:38

hello2mao的博客对于私有交易，会进行加密处理，公链（Quorum chain）上只存储加密后的数据的hash值，而私有交易的数据加密后将存储在链下，通过定制的一个模块（Tessera或者Constellation）在节点间安全的共享。状态数据库被分成...
golang：如何从bfio.Reader释放net.Conn
2016-06-01 20:38

回答 1 已采纳 You can get the buffered data from a *bufio.Reader using the following code: p, _ := br.Peek(br.
AS创建新项目报错java.util.concurrent.ExecutionException: org.apache.http.conn.ConnectTimeoutException android android-studio
2019-11-02 11:12

回答 1 已采纳 https://blog.csdn.net/yyhaohaoxuexi/article/details/79517989
从net.Conn检索URI路径
2018-10-10 15:31

回答 1 已采纳 This is not possible because not every TCP connection is an HTTP connection and the TCP protocol h
Go入门系列（十四） go并发编程之Goroutine与channel（上）
2021-02-16 15:59

张柏沛的博客 = nil{ log.Fatal() } defer conn.Close() // 开一个子协程用于接收服务端的消息 go mustCopy(os.Stdout, conn) // 将服务端发过来的消息发送到客户端的标准输出（屏幕上） // 主协程则向服务端发送消息 ...
尝试安装Go-SQL-Driver时发生错误：未定义：syscall.Conn
2019-06-21 08:22

回答 1 已采纳 Go-MySQL-Driver only supports Go 1.9 or later. You are using Go 1.8, and the syscall.Conn interfac
Golang redis(二)redigo连接详解
2019-07-14 20:20

comprel的博客 go 连接redis主要使用conn.go文件中的连接函数，一般使用Dial， DialURL 当然也有 NewConn 创建于redis的连接，在应用程序使用完毕后必须调用连接的Close() 方法将连接关闭，否则，有可能出现连接池溢出的问题 1....
JAVA 实习面试题大全必看
2020-04-02 09:05

Jason Carl的博客 ②Error指Java程序运行错误，出现Error通常是因为系统的内部错误或资源耗尽，Error不能在运行过程中被动态处理，如果程序运行中出现Error，系统只能记录错误的原因和安全终止。③Exception指Java程序运行异常，即...
Java2022面试题集锦
2022-02-05 12:15

x.h.z的博客 ②Error指Java程序运行错误，出现Error通常是因为系统的内部错误或资源耗尽，Error不能在运行过程中被动态处理，如果程序运行中出现Error，系统只能记录错误的原因和安全终止。③Exception指Java程序运行异常，即...
golang 1.16 发布
2021-02-19 11:01

wide288的博客 Go 1.16简介最新的Go版本1.16版在Go 1.15之后六个月到货。它的大部分更改是在工具链，运行时和库的...围棋1.16上添加与MacOS的支持64位ARM架构（即Apple硅）GOOS=darwin，GOARCH=arm64。像darwin/amd64端口，所述d
goalng1.8 的变化
2021-03-09 17:36

billgates_wanbin的博客它的大部分更改是在工具链，runtime, 和 libraries的实现上。语言规范有两个小的更改。与往常一样，该版本保留了Go 1兼容性的承诺。我们希望几乎所有Go程序都能像以前一样继续编译和运行。该版本增加了对32位MIPS的...
Nginx知识点总结
2022-07-16 13:14

it界的哈士奇的博客 cache-controlexpires强制缓存页面首次打开，直接读取缓存数据，刷新，会向服务器发起请求etaglastmodify协商缓存没发生变化返回304不发送数据。
并发设计模式
2023-02-13 19:26

saas软件销售顾问的博客只用在父进程或者子进程需要写入的时候才会复制地址空间，从而使父子进程拥有各自的地址空间。本质上来讲，父子进程的地址空间以及数据都是要隔离的，使用 Copy-on-Write 更多地体现的是一种延时策略，只有在真正...
没有解决我的问题, 去提问

悬赏问题

¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀
¥20 手写数字识别运行c仿真时，程序报错错误代码sim211-100
¥15 关于#hadoop#的问题
¥15 (标签-Python|关键词-socket)
¥15 keil里为什么main.c定义的函数在it.c调用不了
¥50 切换TabTip键盘的输入法
¥15 可否在不同线程中调用封装数据库操作的类
¥15 微带串馈天线阵列每个阵元宽度计算

在封闭的net.Conn上写入，但返回nil错误

1条回答 默认 最新

Sort-of a preface

Remaining problems

Back to the problem at hand

TL;DR, the bottom line

悬赏问题

1条回答默认最新