Golang正则表达式以匹配关键字对之间的多种模式

I have a string which has two keywords: "CURRENT NAME(S)" and "NEW NAME(S)" and each of these keywords are followed by a bunch of words. I want to extract those set of words beyond each of these keywords. To elaborate with a code:

    s := `"CURRENT NAME(S)
 Name1, Name2",,"NEW NAME(S)
NewName1,NewName2"`
    re := regexp.MustCompile(`"CURRENT NAME(S).*",,"NEW NAME(S).*"`)

    segs := re.FindAllString(s, -1)
    fmt.Println("segs:", segs)

    segs2 := re.FindAllStringSubmatch(s, -1)
    fmt.Println("segs2:", segs2)

As you can see, the string 's' has the input. "Name1,Name2" is the current names list and "NewName1, NewName2" is the new names list. I want to extract these two lists. The two lists are separated by a comma. Each of the keywords are beginning with a double quote and their reach ends, when their corresponding double quote ends.

What is the way to use regexp such that the program can print "Name1, Name2" and "NewName1,NewName2" ?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doushai7225 2016-07-25 08:29
关注
The issue with your regex is that the input string contains newline symbols, and . in Go regex does not match a newline. Another issue is that the .* is a greedy pattern and will match as many symbols as it can up to the last second keyword. Also, you need to escape parentheses in the regex pattern to match the ( and ) literal symbols.

The best way to solve the issue is to change .* into a negated character class pattern [^"]* and place it inside a pair of non-escaped ( and ) to form a capturing group (a construct to get submatches from the match).

Here is a Go demo:

package main import ( "fmt" "regexp" ) func main() { s := `"CURRENT NAME(S) Name1, Name2",,"NEW NAME(S) NewName1,NewName2"` re := regexp.MustCompile(`"CURRENT NAME$S$\s*([^"]*)",,"NEW NAME$S$\s*([^"]*)"`) segs2 := re.FindAllStringSubmatch(s,-1) fmt.Printf("segs2: [%s; %s]", segs2[0][1], segs2[0][2]) }

Now, the regex matches:

"CURRENT NAME$S$ - a literal string "CURRENT NAME(S)`

\s* - zero or more whitespaces

([^"]*) - Group 1 capturing 0+ chars other than "

",,"NEW NAME$S$ - a literal string ",,"NEW NAME(S)

\s* - zero or more whitespaces

([^"]*) - Group 2 capturing 0+ chars other than "

" - a literal "
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

Golang正则表达式以匹配关键字对之间的多种模式
2016-07-25 06:40

回答 3 已采纳 The issue with your regex is that the input string contains newline symbols, and . in Go regex doe
如何使用Golang正则表达式查找完全匹配的单词？
2018-12-20 15:44

回答 1 已采纳 Use the zero-length word boundry sequence \b: https://play.golang.org/p/-f0KEKb2EbF regexp.MatchS
正则表达式匹配golang中不以www开头的字符串
2018-10-04 13:48

回答 2 已采纳 If you're really bent on creating a negative lookahead manually, you will need to exclude all poss
非零基础自学Golang 第16章 正则表达式 16.1 正则表达式介绍 & 16.2 正则表达式语法
2022-12-22 14:37

Ding Jiaxiong的博客非零基础自学Golang 第16章 正则表达式 16.1 正则表达式介绍 & 16.2 正则表达式语法
Golang正则表达式替换字符串之间
2019-02-20 21:55

回答 1 已采纳 You may use (MYSTRING=).* and replace with ${1}foo. See the online Go regex demo. Here, (MYSTR
Golang正则表达式匹配并替换某个字符串后的第一个匹配项
2019-02-26 23:27

回答 1 已采纳 You may use ReplaceAllStringFunc and use a regex like (?m)^bar:(?: \s{4}.*)+ See the regex demo
Golang正则表达式匹配字符串，直到给定的字符序列
2019-01-09 16:21

回答 3 已采纳 You could capturin first part with -name in a group, then match what is in between and use an opti
Golang —— 正则表达式
2020-12-11 11:07

JIAYU.的博客 Go语言通过regexp标准包为正则表达式提供了官方支持，如果你已经使用过其他编程语言提供的正则相关功能，那么你应该对Go语言版本的不会太陌生，但是它们之间也有一些小的差异，因为Go实现的是RE2标准，除了\C。...
Go中字符串末尾的正则表达式匹配失败
2019-08-26 07:12

回答 2 已采纳 You may use re := regexp.MustCompile(`(?:\[\d{2}])+(.*)`) match := re.FindStringSubmatch(s) if le
Golang正则表达式提取2个定界符之间的文本-包括定界符
2017-05-21 21:49

回答 2 已采纳 The regex is: (?s)PATTERN BEGINS HERE.*?\); where (?s) is a flag to let .* match multiple lines
如何在Golang中使用正则表达式获取url模式？ http
2015-05-27 06:03

回答 3 已采纳 http.HandleFunc() can not be used to register a pattern to match a regular expression. In short, t
Go多个正则表达式查找的区别
2023-06-16 20:30

小龙在山东的博客 Find查找最左侧第一个。...给分组命名为key，然后通过Expand将匹配到的key按照模板的样式输出给result，value同理，另外匹配必须要用。FindAllIndex会返回匹配到的分片（开始和结束索引）。这两个方法不会返回子匹配。
Golang正则表达式报价子匹配
2015-07-28 16:58

回答 2 已采纳 Ok, that's it: http://play.golang.org/p/h2w-9-XFAt Regex: ^token="?([^"]*)"?$ MATCHES [token=lll
【golang】正则表达式 查找和替换字符
2019-01-13 01:53

一筐大白菜啊的博客 1) 正则表达式的描述模式， 1.1。连接操作连接操作就是匹配连接后的结果有 hello和 go 两个单词将它们连接起来，用正则表式为 (hello)(go)，就是连接操作，连接操作必须满足这这几个要求才能匹配成功匹配一...
Golang_18: Go语言 正则表达式
2023-05-26 21:15

谢TS的博客 Go 语言 正则表达式 处理使用内置的 regexp 模块。
没有解决我的问题, 去提问

悬赏问题

¥15 用stata实现聚类的代码
¥15 请问paddlehub能支持移动端开发吗？在Android studio上该如何部署？
¥170 如图所示配置eNSP
¥20 docker里部署springboot项目，访问不到扬声器
¥15 netty整合springboot之后自动重连失效
¥15 悬赏！微信开发者工具报错，求帮改
¥20 wireshark抓不到vlan
¥20 关于#stm32#的问题：需要指导自动酸碱滴定仪的原理图程序代码及仿真
¥20 设计一款异域新娘的视频相亲软件需要哪些技术支持
¥15 stata安慰剂检验作图但是真实值不出现在图上

Golang正则表达式以匹配关键字对之间的多种模式

3条回答 默认 最新

悬赏问题

3条回答默认最新