扫描仪提前终止

I am trying to write a scanner in Go that scans continuation lines and also clean the line up before returning it so that you can return logical lines. So, given the following SplitLine function (Play):

func ScanLogicalLines(data []byte, atEOF bool) (int, []byte, error) {
    if atEOF && len(data) == 0 {
        return 0, nil, nil
    }

    i := bytes.IndexByte(data, '
')
    for i > 0 && data[i-1] == '\\' {
        fmt.Printf("i: %d, data[i] = %q
", i, data[i])
        i = i + bytes.IndexByte(data[i+1:], '
')
    }

    var match []byte = nil
    advance := 0
    switch {
    case i >= 0:
        advance, match = i + 1, data[0:i]
    case atEOF: 
        advance, match = len(data), data
    }
    token := bytes.Replace(match, []byte("\\
"), []byte(""), -1)
    return advance, token, nil
}

func main() {
    simple := `
Just a test.

See what is returned. \
when you have empty lines.

Followed by a newline.
`

    scanner := bufio.NewScanner(strings.NewReader(simple))
    scanner.Split(ScanLogicalLines)
    for scanner.Scan() {
        fmt.Printf("line: %q
", scanner.Text())
    }
}

I expected the code to return something like:

line: "Just a test."
line: ""
line: "See what is returned, when you have empty lines."
line: ""
line: "Followed by a newline."

However, it stops after returning the first line. The second call return 1, "", nil.

Anybody have any ideas, or is it a bug?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongzhunqiu4841 2013-11-12 22:16
关注
I would regard this as a bug because an advance value > 0 is not intended to make a further read call, even when the returned token is nil (bufio.SplitFunc):

If the data does not yet hold a complete token, for instance if it has no newline while scanning lines, SplitFunc can return (0, nil) to signal the Scanner to read more data into the slice and try again with a longer slice starting at the same point in the input.

What happens is this

The input buffer of the bufio.Scanner defaults to 4096 byte. That means that it reads up to this amount at once if it can and then executes the split function. In your case the scanner can read your input all at once as it is well below 4096 byte. This means that the next read it will do results in EOF which is the main problem here.

Step by step

scanner.Scan reads all your data

You get all the text that is there

You look for a token, you find the first newline which is only one newline

You return nil as a token by removing the newline from the match

scanner.Scan assumes: user needs more data

scanner.Scan attempts to read more

EOF happens

scanner.Scan tries to tokenize one last time

You find "Just a test."

scanner.Scan tries to tokenize one last time

You look for a token, you find the third line which is only one newline

You return nil as a token by removing the newline from the match

scanner.Scan sees nil token and set error (EOF)

Execution ends

How to circumvent

Any token that is non-nil will prevent this. As long as you return non-nil tokens the scanner will not check for EOF and continues executing your tokenizer.

The reason why your code returns nil tokens is that bytes.Replace returns nil when there's nothing to be done. append([]byte(nil), nil...) == nil. You could prevent this by returning a slice with a capacity and no elements as this would be non-nil: make([]byte, 0, 1) != nil.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

扫描仪提前终止
2013-11-12 20:28

回答 1 已采纳 I would regard this as a bug because an advance value > 0 is not intended to make a further rea
佳能 扫描仪，文件上传读取 java
2020-12-14 14:05

回答 8 已采纳软件： pip install opencv-python 开源 OpenCV pip install pytesseract Google OCR
新电脑设备里有扫描仪，磁盘有驱动，但是扫描仪找不到驱动，驱动找不到扫描仪 其他多彩生活驱动开发
2021-11-18 16:39

回答 1 已采纳 1、可以在官网找一找更早/更新的驱动2、部分老设备不兼容win10，建议换一个
Epic Games Launcher安装向导提前终止解决办法
2023-07-12 00:08

gggiweeq的博客安装Epic Games出现：由于错误，Epic Games Launcher安装向导提前终止。安装程序未对您的系统进行修改。如需稍后安装，请再次运行安装程序。单击“完成”按钮退出安装向导。
java中scanner扫描器关闭问题 java
2023-03-24 20:36

回答 1 已采纳是你代码出问题了，数组越界了，在16行
为什么代码异常显示扫描仪关闭 java
2023-02-26 13:21

回答 4 已采纳该回答引用ChatGPT 这段代码异常显示扫描器关闭的原因是，sc.close() 方法应该在 for 循环结束后才调用。在当前代码中，sc.close() 在循环内部被调用，因此扫描器会在第一次输
如何监听USB二维码扫描仪扫描到的数据
2016-04-07 08:59

回答 2 已采纳 window的下USB编程,关键词,重叠IO,线程调度writefile or readfile函数监控缓冲区
扫描仪_使用条款
2021-04-23 10:47

Foundao159_6712的博客 1.2本协议适用于扫描仪提供的，供您使用和/或访问的全部视频教程服务、客户端服务、及其他服务。 1.3本协议会不时更新，更新后的协议条款一旦公布即代替原有的协议条款，请您及时查阅。在更新协议
扫描器不能解析为一个类型。 java 有问必答
2021-10-31 14:18

回答 2 已采纳点击这个import
用了各种扫描器都找不到后台网址后端
2021-12-01 14:44

回答 2 已采纳现在扫描器只能扫个大概，需要手动使用bp等工具进行抓包才有可能发现漏洞，web的漏洞现在已经很少了（悬赏的）。
角度扫描仪自动提交 javascript php
2017-02-22 16:18

回答 1 已采纳 Does your scanner send a return after the string? I know some scanners have this ability, you cou
C#实现端口扫描器小程序
2020-11-19 20:49

咩咩叫的闲鱼的博客目录一、创建项目及UI设计二、只用单一进程实现端口扫描器三、用多线程方式实现端口扫描器四、参考编译软件：Visual Studio 2019 编译环境：Windows 10 使用语言：C# 一、创建项目及UI设计 1、打开VS2019，创建新...
如何使用自定义拆分实现扫描仪
2015-10-11 18:34

回答 3 已采纳 The Scanner type has a function called Split which allows you to pass a SplitFunc to determine how
网络安全实验一 Part 2 Windows环境下的扫描器程序
2022-03-15 00:05

red1y的博客提前结束扫描实验步骤一、熟悉QtCreator编程 C语言中文网Qt教程，按序看10~15篇 Qt弹窗，QString，用于和用户交互，发出警告/提示 Qt多线程，本实验要用到 Qt Socket，Qt有封装好的socke
编写端口扫描器程序
2020-11-18 21:54

满足没有的博客 //显示框显示 textBox4.AppendText("端口扫描器 v1.0.0" + Environment.NewLine + Environment.NewLine); //调用端口扫描函数 PortScan(); } else { //若端口号不合理，弹窗报错 MessageBox.Show("输入错误，端口...
没有解决我的问题, 去提问

悬赏问题

¥15 多电路系统共用电源的串扰问题
¥15 slam rangenet++配置
¥15 有没有研究水声通信方面的帮我改俩matlab代码
¥15 对于相关问题的求解与代码
¥15 ubuntu子系统密码忘记
¥15 信号傅里叶变换在matlab上遇到的小问题请求帮助
¥15 保护模式-系统加载-段寄存器
¥15 电脑桌面设定一个区域禁止鼠标操作
¥15 求NPF226060磁芯的详细资料
¥15 使用R语言marginaleffects包进行边际效应图绘制

扫描仪提前终止

1条回答 默认 最新

What happens is this

Step by step

How to circumvent

悬赏问题

1条回答默认最新