如何确定空白fmt.Fscanf消耗的数量？

I am trying to implement a PPM decoder in Go. PPM is an image format that consists of a plaintext header and then some binary image data. The header looks like this (from the spec):

Each PPM image consists of the following:

A "magic number" for identifying the file type. A ppm image's magic number is the two characters "P6".

Whitespace (blanks, TABs, CRs, LFs).

A width, formatted as ASCII characters in decimal.

Whitespace.

A height, again in ASCII decimal.

Whitespace.

The maximum color value (Maxval), again in ASCII decimal. Must be less than 65536 and more than zero.

A single whitespace character (usually a newline).

I try to decode this header with the fmt.Fscanf function. The following call to fmt.Fscanf parses the header (not addressing the caveat explained below):

var magic string
var width, height, maxVal uint

fmt.Fscanf(input,"%2s %d %d %d",&magic,&width,&height,&maxVal)

The documentation of fmt states:

Note: Fscan etc. can read one character (rune) past the input they return, which means that a loop calling a scan routine may skip some of the input. This is usually a problem only when there is no space between input values. If the reader provided to Fscan implements ReadRune, that method will be used to read characters. If the reader also implements UnreadRune, that method will be used to save the character and successive calls will not lose data. To attach ReadRune and UnreadRune methods to a reader without that capability, use bufio.NewReader.

As the very next character after the final whitespace is already the beginning of the image data, I have to be certain about how many whitespace fmt.Fscanf did consume after reading MaxVal. My code must work on whatever reader the was provided by the caller and parts of it must not read past the end of the header, therefore wrapping stuff into a buffered reader is not an option; the buffered reader might read more from the input than I actually want to read.

Some testing suggests that parsing a dummy character at the end solves the issues:

var magic string
var width, height, maxVal uint
var dummy byte

fmt.Fscanf(input,"%2s %d %d %d%c",&magic,&width,&height,&maxVal,&dummy)

Is that guaranteed to work according to the specification?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doujian7132 2013-04-05 19:46
关注
No, I would not consider that safe. While it works now, the documentation states that the function reserves the right to read past the value by one character unless you have an UnreadRune() method.

By wrapping your reader in a bufio.Reader, you can ensure the reader has an UnreadRune() method. You will then need to read the final whitespace yourself.

buf := bufio.NewReader(input) fmt.Fscanf(buf,"%2s %d %d %d",&magic,&width,&height,&maxVal) buf.ReadRune() // remove next rune (the whitespace) from the buffer.

Edit:

As we discussed in the chat, you can assume the dummy char method works and then write a test so you know when it stops working. The test can be something like:

func TestFmtBehavior(t *testing.T) { // use multireader to prevent r from implementing io.RuneScanner r := io.MultiReader(bytes.NewReader([]byte("data "))) n, err := fmt.Fscanf(r, "%s%c", new(string), new(byte)) if n != 2 || err != nil { t.Error("failed scan", n, err) } // the dummy char read 1 extra char past "data". // one byte should still remain if n, err := r.Read(make([]byte, 5)); n != 1 { t.Error("assertion failed", n, err) } }
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

如何确定空白fmt.Fscanf消耗的数量？
2013-04-05 18:44

回答 1 已采纳 No, I would not consider that safe. While it works now, the documentation states that the function
为什么在打印指针时fmt.Println不一致？
2017-09-23 00:46

回答 1 已采纳 The "technical" answer to your question can be found here: https://golang.org/src/fmt/print.go?#L
为什么在使用fmt.Fscanf时出现“输入格式不匹配”的提示？
2019-07-21 20:11

回答 1 已采纳 The error comes from how Fscanf parses space-separated strings. This becomes an issue when reading
Golang学习笔记（一）
2020-05-07 17:00

某热心知名群众的博客 int { x++ return x * x } } func main() { f := squares() fmt.Println(f()) // "1" fmt.Println(f()) // "4" fmt.Println(f()) // "9" fmt.Println(f()) // "16" } //通过这个例子，我们看到变量的生命周期不由它...
为什么我不能使用flag.StringVar传递指向fmt.Println的指针？
2018-08-17 13:04

回答 1 已采纳 Well this is fairly simple, you are trying to dereference a value instead of pointer. var svar st
为什么要使用fmt.Sprint？
2017-07-20 00:11

回答 4 已采纳 In your example there are no real differences as you are Sprintf to simply concaternate strings. T
为什么我不能做fmt.Sprintf（“％d。％d。％d。％d”，a ...）？
2015-10-29 15:17

回答 6 已采纳 A Tour of Go Exercise: Stringers Make the IPAddr type implement fmt.Stringer to print
C语言编程规范 — 头文件、函数
2022-04-18 00:45

yunfan188的博客 ” ——Steve McConnell 一般情况下，代码的可阅读性高于性能，只有确定性能是瓶颈时，才应该主动优化。 2、简洁为美简洁就是易于理解并且易于实现。代码越长越难以看懂，也就越容易在修改时引入错误。写的代码越...
添加表达式“ fmt.Println（）”时发生了什么
2019-08-29 09:56

回答 1 已采纳 golang uses a scheduler to schedule go routines. You can read more about it here https://povilasv.
Go中的`fmt.Println`如何工作？
2016-05-03 14:53

回答 1 已采纳 Println determines whether the value implements the Stringer interface. If it does then it will c
在fmt.Sprintf格式字符串中多次引用相同的参数
2018-10-27 22:09

回答 1 已采纳 Package fmt import "fmt" Explicit argument indexes: In Printf, Sprintf, and Fprint
Linux ftrace 2.1、ftrace的使用
2018-05-23 15:41

pwl999的博客关于Ftrace的使用，最权威的解读就在”Documentation/trace”文件夹下，我们挑选其中最经典的几个文件来进行翻译，加上自己理解的解读。参考原文：ftrace - Function Tracer 1、背景： Ftrace本来设计作为一个...
如何通过go程序未分配的fmt.Println（）内存？
2017-02-13 17:09

回答 2 已采纳 You can convert that address to a slice of bytes, which can then be passed to any Write method. T
C++笔试题目大全
2014-07-31 19:11

a48351217a的博客 1 c++ c++ c++ c++ 笔试题汇总 ① 链表反转单向链表的反转是一个经常被问到的一个面试题，也是一个非常基础的问题。比如一个链表是这样的： 1->2->3->4->5 通过反转后成为 5->4->3->2->1 。...
对于格式化字符串的总结
2016-05-10 09:19

panjieke的博客使用的百分比字符由当前的 NumberFormatInfo 类确定。 [最小占位宽度 ] 如其含义,当宽度超过这个范围后,这个限制无效当宽度少于最小占位宽度,即用上面的填充字符来填充空下的部分使用...
itpt_TCPL 第五章：指针和数组 - 第八章：UNIX系统接口
2016-11-01 10:24

竹影半墙的博客 /* 跳过空白字符 */ ; if (! isdigit (c) && c != EOF && c != ‘+’ && c != ‘-‘) { ungetch(c); return 0 ; } sign = (c == ‘-‘) ? - 1 : 1 ; if (c == ‘+’ || c == ‘-‘) c = getch(); ...
格式化字符串的总结
2015-01-22 10:23

piaopiaopiaopiaopiao的博客使用的百分比字符由当前的 NumberFormatInfo 类确定。 [最小占位宽度 ] 如其含义,当宽度超过这个范围后,这个限制无效当宽度少于最小占位宽度,即用上面的填充字符来填充空下的部分 ...
格式化字符串
2013-08-19 14:11

lovenessless的博客使用的百分比字符由当前的 NumberFormatInfo 类确定。 [最小占位宽度 ] 如其含义,当宽度超过这个范围后,这个限制无效当宽度少于最小占位宽度,即用上面的填充字符来填充空下的部分使用...
没有解决我的问题, 去提问

悬赏问题

¥15 抖音咸鱼付款链接转码支付宝
¥15 ubuntu22.04上安装ursim-3.15.8.106339遇到的问题
¥15 求螺旋焊缝的图像处理
¥15 blast算法（相关搜索：数据库）
¥15 请问有人会紧聚焦相关的matlab知识嘛？
¥15 网络通信安全解决方案
¥50 yalmip+Gurobi
¥20 win10修改放大文本以及缩放与布局后蓝屏无法正常进入桌面
¥15 itunes恢复数据最后一步发生错误
¥15 关于#windows#的问题：2024年5月15日的win11更新后资源管理器没有地址栏了顶部的地址栏和文件搜索都消失了

如何确定空白fmt.Fscanf消耗的数量？

1条回答 默认 最新

悬赏问题

1条回答默认最新