dongxie1907 2015-05-25 04:39
浏览 31

bufio.ScanLines与转义的新行

I am trying to adapt bufio.ScanLines so it is aware of escaped new lines \ .

Input:

line1 \
continues on line2

Expected output:

["line1 continues on line2"]

Right now the output of bufio.ScanLines (see example code below) is:

["line1 \\", "continues on line2"]

Example code:

s := bufio.NewScanner(f)
s.Split(bufio.ScanLines)

for s.Scan() {
    fmt.Println(s.Text())
}

What would be the best approach here? Looking for an implementation that still passes the tests in https://golang.org/src/bufio/scan_test.go.

  • 写回答

1条回答 默认 最新

  • dongsuoying9059 2015-05-25 13:20
    关注

    A few obvious approaches come to mind.

    First, take a look at the source for bufio.ScanLines, it's not large and you could easily implement your own bufio.SplitFunc from scratch starting from a copy of that modified to do what you want.

    Second, you could write a bufio.SplitFunc that called bufio.ScanLines in a loop, combining tokens as long as it returns ones that end in your escape character, and then returning the combined token.

    Given the short size and simplicity of the first approach that's what I'd probably do. The second approach would likely end up just as long, be less efficient, and probably require state since you'd need to store the combined-token-so-far when returning (0, nil, nil) to ask for more input.

    Another solution would be to implement a Transformer (from the golang.org/x/text/transform package) that strips relevant escaped characters from the input (e.g. removes "\\ ") and use transform.NewReader to make a filtered reader that you'd then do use however you want (e.g. passing to a bufio.Scanner with the regular ScanLines).

    In any case, you could copy the appropriate tests from scan_test.go as well as adding your own for the escaped newline behaviour. Beware of bufio.MaxScanTokenSize as well.

    评论

报告相同问题?

悬赏问题

  • ¥15 R语言Rstudio突然无法启动
  • ¥15 关于#matlab#的问题:提取2个图像的变量作为另外一个图像像元的移动量,计算新的位置创建新的图像并提取第二个图像的变量到新的图像
  • ¥15 改算法,照着压缩包里边,参考其他代码封装的格式 写到main函数里
  • ¥15 用windows做服务的同志有吗
  • ¥60 求一个简单的网页(标签-安全|关键词-上传)
  • ¥35 lstm时间序列共享单车预测,loss值优化,参数优化算法
  • ¥15 Python中的request,如何使用ssr节点,通过代理requests网页。本人在泰国,需要用大陆ip才能玩网页游戏,合法合规。
  • ¥100 为什么这个恒流源电路不能恒流?
  • ¥15 有偿求跨组件数据流路径图
  • ¥15 写一个方法checkPerson,入参实体类Person,出参布尔值