Golang从字符串中的子字符串解析日期

I am writing a log file parser, and have written some test code to parse this in C.

The string to be parsed looks as follows:

s := `10.0.0.1 Jan 11 2014 10:00:00 hello`

In C, parsing this in place is quite easy. First I find the pointer to the date within the string and then just consume as much as possible using strptime(). This is possible as strptime() will return the position in the string after the call.

Eventually I decided to go with Golang instead of C, but while porting the code over I have some issues. As far as I can tell, time.Parse() does not give me any option to parse from within an existing string (though this can be solved with slices) or indication about how much of the original string it have consumed when parsing the date from within the string.

Is there any elegant way in Go I can parse the date/time right out of the string without having to first extract the datetime into an exact slice e.g. by returning the number of characters extracted after parsing?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duananyu9231 2014-05-25 14:15
关注
Unfortunately, the time.Parse method can't tell you how many characters it parsed, so we will need to investigate other elegant solutions. In your example of parsing log statements, the use of regular expressions, as @rob74 suggested, is a reasonably elegant strategy. The example below ignores errors for brevity:

var r = regexp.MustCompile(`^((?:\d{1,3}\.){3}\d{1,3}) ([a-zA-Z]{3} \d{1,2} \d{4} \d{1,2}:\d{2}:\d{2}) (.*)`) const longForm = "Jan 02 2006 15:04:05" func parseRegex(s string) (ip, msg string, t time.Time) { m := r.FindStringSubmatch(s) t, _ = time.Parse(longForm, m[2]) ip, msg = m[1], m[3] return ip, msg, t }

Benchmarks show the above regular expression to be about two times more efficient than @rob74's example on my machine, parsing about a 100,000 lines per second:

BenchmarkParseRegex 100000 17130 ns/op BenchmarkParseRegexRob74 50000 32788 ns/op

We can, however, keep the solution short and more efficient if we use strings.SplitN instead. For example:

func parseSplit(s string) (ip, msg string, t time.Time) { parts := strings.SplitN(s, " ", 6) t, _ = time.Parse(longForm, strings.Join(parts[1:5], " ")) ip, msg = parts[0], parts[5] return ip, msg, t }

This splits the string on the first 5 spaces and puts the remaining string (the message part) inside the final parts slice element. This is not very elegant, since we rely on the number of spaces in the date format, but we could count the spaces in the date format string programmatically for a more general solution. Let's see how this compares to our regular expression solution:

BenchmarkParseRegex 100000 17130 ns/op BenchmarkParseSplit 500000 3557 ns/op

It compares quite favorably, as it turns out. Using SplitN is about five times faster than using regular expressions, and still results in concise and readable code. It does this at the cost of using slightly more memory for the slice allocation.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

Golang从字符串中的子字符串解析日期
2014-02-17 13:37

回答 2 已采纳 Unfortunately, the time.Parse method can't tell you how many characters it parsed, so we will need
Golang符文字符串或如何转换？
2017-12-29 09:33

回答 1 已采纳 Go accepts hexadecimal rune literals. So you can use your input as a regular string: fmt.Println
如何在golang中向字符串变量添加变量
2018-04-13 22:40

回答 2 已采纳 Why not use fmt.Sprintf? data := 14 response := fmt.Sprintf("Variable string %d content", data)
Golang字符串函数用法
2017-11-22 16:19

nudt_qxx的博客代码示例本关必读字符串介绍几乎任何程序都离不开字符串，字符串是 UTF-8 字符的一个序列（当字符为 ASCII 码时则占用 1 个字节，其它字符根据需要占用 2-4 个字节）。Go语言字符串是一种值类型，且值不可变，即创建...
从字符串golang中删除转义的双引号
2017-12-31 04:22

回答 1 已采纳 First mistake is you have enter in your jsonBytes. remove that near "0.26 you have \ in the fir
在golang中创建二维字符串数组
2018-08-14 19:55

回答 2 已采纳 You have a slice of slices, and the outer slice is nil until it's initialized: matrix := make([][
如何在golang中将字符串解析为url.Values？
2018-03-31 04:39

回答 2 已采纳 You could use url.ParseQuery to convert the raw query to url.Values with unescaping package main
Golang 从 Json 串中快速取出需要的字段
2020-07-05 21:58

qhh0205的博客 Golang 从 Json 串中快速取出需要的字段在 web 编程中很多情况下接口的数据是 json 格式，在我们拿到接口的 json 数据后如何方便地从中提取出需要的字段呢？我们可以自定义一个结构体，然后通过 Golang 的标准库 ...
从golang中的字符或字符串之前的字符串grep子字符串的最佳方法
2016-03-16 05:15

回答 1 已采纳 You can use net.SplitHostPort, like so ip, _, err := net.SplitHostPort(conn.RemoteAddr().String()
在golang中替换字符串中的字符
2019-08-01 12:50

回答 2 已采纳 Strings in Go are immutable, you can't change their content. To change the value of a string varia
GoLang中的模板字符串？
2018-04-02 11:39

回答 1 已采纳 The Solution is sentence := fmt.Sprintf("My Name is %s", name)
go语言字符串换行_Go语言中的字符串处理方法示例详解
2020-12-29 07:42

润姐姐Samantha的博客 1 概述字符串，string，一串固定长度的字符连接起来的字符...反引号：``，用于定义多行字符串，内部会原样解析。示例：// 单行"心有猛虎，细嗅蔷薇"// 多行`大风歌大风起兮云飞扬。威加海内兮归故乡。安得猛士兮守四...
将字符串转换为时间并在golang中解析
2016-11-02 19:31

回答 1 已采纳 You're not correctly providing the layout argument to Parse. You're supposed to be using Mon Jan 2
golang学习【7】字符串和常用数据结构
2024-04-17 22:34

一叶萩Charles的博客 golang字符串和常用数据结构学习介绍
golang实现从串口读取GPS信息
2017-04-25 19:54

逝水-无痕的博客对GPS模块的数据处理本质上还是串口通信程序设计，只是GPS模块的输出遵循固定的格式，通过字符串检索查找即可从模块发送的数据中找出需要的数据，常用的GPS模块大多采用NMEA-0183 协议。NMEA-0183 是美国国家海洋...
没有解决我的问题, 去提问

悬赏问题

¥15 执行 virtuoso 命令后，界面没有，cadence 启动不起来
¥50 comfyui下连接animatediff节点生成视频质量非常差的原因
¥20 有关区间dp的问题求解
¥15 多电路系统共用电源的串扰问题
¥15 slam rangenet++配置
¥15 有没有研究水声通信方面的帮我改俩matlab代码
¥15 ubuntu子系统密码忘记
¥15 保护模式-系统加载-段寄存器
¥15 电脑桌面设定一个区域禁止鼠标操作
¥15 求NPF226060磁芯的详细资料

Golang从字符串中的子字符串解析日期

2条回答 默认 最新

悬赏问题

2条回答默认最新