Golang正则表达式以匹配关键字对之间的多种模式

I have a string which has two keywords: "CURRENT NAME(S)" and "NEW NAME(S)" and each of these keywords are followed by a bunch of words. I want to extract those set of words beyond each of these keywords. To elaborate with a code:

    s := `"CURRENT NAME(S)
 Name1, Name2",,"NEW NAME(S)
NewName1,NewName2"`
    re := regexp.MustCompile(`"CURRENT NAME(S).*",,"NEW NAME(S).*"`)

    segs := re.FindAllString(s, -1)
    fmt.Println("segs:", segs)

    segs2 := re.FindAllStringSubmatch(s, -1)
    fmt.Println("segs2:", segs2)

As you can see, the string 's' has the input. "Name1,Name2" is the current names list and "NewName1, NewName2" is the new names list. I want to extract these two lists. The two lists are separated by a comma. Each of the keywords are beginning with a double quote and their reach ends, when their corresponding double quote ends.

What is the way to use regexp such that the program can print "Name1, Name2" and "NewName1,NewName2" ?

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doushai7225 2016-07-25 08:29
关注
The issue with your regex is that the input string contains newline symbols, and . in Go regex does not match a newline. Another issue is that the .* is a greedy pattern and will match as many symbols as it can up to the last second keyword. Also, you need to escape parentheses in the regex pattern to match the ( and ) literal symbols.

The best way to solve the issue is to change .* into a negated character class pattern [^"]* and place it inside a pair of non-escaped ( and ) to form a capturing group (a construct to get submatches from the match).

Here is a Go demo:

package main import ( "fmt" "regexp" ) func main() { s := `"CURRENT NAME(S) Name1, Name2",,"NEW NAME(S) NewName1,NewName2"` re := regexp.MustCompile(`"CURRENT NAME\(S\)\s*([^"]*)",,"NEW NAME\(S\)\s*([^"]*)"`) segs2 := re.FindAllStringSubmatch(s,-1) fmt.Printf("segs2: [%s; %s]", segs2[0][1], segs2[0][2]) }

Now, the regex matches:

"CURRENT NAME\(S\) - a literal string "CURRENT NAME(S)`

\s* - zero or more whitespaces

([^"]*) - Group 1 capturing 0+ chars other than "

",,"NEW NAME\(S\) - a literal string ",,"NEW NAME(S)

\s* - zero or more whitespaces

([^"]*) - Group 2 capturing 0+ chars other than "

" - a literal "
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

非零基础自学Golang 第16章 正则表达式 16.1 正则表达式介绍 & 16.2 正则表达式语法
2022-12-22 14:37

蓝色的烧烤的博客非零基础自学Golang 第16章 正则表达式 16.1 正则表达式介绍 & 16.2 正则表达式语法
Golang —— 正则表达式
2020-12-11 11:07

JIAYU.的博客 Go语言通过regexp标准包为正则表达式提供了官方支持，如果你已经使用过其他编程语言提供的正则相关功能，那么你应该对Go语言版本的不会太陌生，但是它们之间也有一些小的差异，因为Go实现的是RE2标准，除了\C。...
在 Go 语言中使用正则表达式提取所有匹配字符串的方法详解
2025-07-28 07:22

gopher.guo的博客本文介绍在Go语言中使用regexp标准库提取所有正则匹配项的方法。...文章提供了提取邮箱、商品ID等实用示例，并总结常见问题排查方法和常用正则表达式速查表。通过合理使用这些正则函数，可高效处理文本提取需求。
Go多个正则表达式查找的区别
2023-06-16 20:30

小龙在山东的博客 Find查找最左侧第一个。...给分组命名为key，然后通过Expand将匹配到的key按照模板的样式输出给result，value同理，另外匹配必须要用。FindAllIndex会返回匹配到的分片（开始和结束索引）。这两个方法不会返回子匹配。
【golang】正则表达式 查找和替换字符
2019-01-13 01:53

一筐大白菜啊的博客 1) 正则表达式的描述模式， 1.1。连接操作连接操作就是匹配连接后的结果有 hello和 go 两个单词将它们连接起来，用正则表式为 (hello)(go)，就是连接操作，连接操作必须满足这这几个要求才能匹配成功匹配一...
golang 爬虫修炼04 ---利用正则提取数据
2024-06-25 17:20

Jaysen13的博客函数的主要作用是将正则表达式中，奇形怪状的号(如.*[.)转换成 Go 语言能识别的格式，并将其存成结构体格式，方便编译器识别。返回值:返回成功匹配的二维数组[][][][][][][][][] []string。通常传-1，表示匹配所有。...
go语言正则表达式之匹配特定中文字（转码篇）
2018-08-28 14:14

weixin_34219944的博客 str2 := "小是正则表达式要匹配的字符串包含中英文vsdfvsva猪cas飞天demicgwegr//lwe;rgc了" fmt.Println(str2) fmt.Println("请匹配：字符串是否以“小”开头中间按顺序有“d”，有“猪”，以“了“结尾”") ...
Golang_18: Go语言 正则表达式
2023-05-26 21:15

谢TS的博客 Go 语言 正则表达式 处理使用内置的 regexp 模块。
正则表达式引擎算法
2024-10-01 15:33

你一身傲骨怎能输的博客 正则表达式引擎的底层运行原理是将正则表达式转化为一种可以高效执行的自动机模型（NFA或DFA），并通过匹配算法来查找文本中的匹配项。优化技术和前瞻断言等特性进一步提升了引擎的性能和灵活性。理解这些原理有助于...
SQL注入与正则表达式
2021-02-17 22:22

弈-剑的博客 > /* 输出：I love */ 正则表达式 正则表达式，又称为规则表达式（Regular Expression，简写为regex、regexp或RE），常用来检索、替换某些符合某个模式（规则）的文本。 PHP中使用正则规则一定要加代表正则的标识// ...
没有解决我的问题, 去提问

Golang正则表达式以匹配关键字对之间的多种模式

3条回答 默认 最新

3条回答默认最新