如何使用正则表达式捕获“多个”重复组

I have the following text file I would like to parse out to get the individual fields:

host_group_web = ( )
host_group_lbnorth = ( lba050 lbhou002 lblon003 )

The fields that I would like to extract are in bold

host_group_web = ( )
host_group_lbnorth = ( lba505 lbhou002 lblon003 )

host_group_web has no items in between the ( ), so that portion would be ignored

I've named the first group as nodegroup and the items in between the () as nodes

I am reading the file line by line, and storing the results for further processing.

In Golang, This is the snippet of Regex I am using:

hostGroupLine := "host_group_lbnorth = ( lba050 lbhou002 lblon003 )"
hostGroupExp := regexp.MustCompile(`host_group_(?P<nodegroup>[[:alnum:]]+)\s*=\s*\(\s*(?P<nodes>[[:alnum:]]+\s*)`)
hostGroupMatch := hostGroupExp.FindStringSubmatch(hostGroupLine)

for i, name := range hostGroupExp.SubexpNames() {
  if i != 0 {
    fmt.Println("GroupName:", name, "GroupMatch:", hostGroupMatch[i])
  }
}

I get the following output, which is missing the rest of the matches for the nodes named group.

GroupName: nodegroup GroupMatch: lbnorth
GroupName: nodes GroupMatch: lba050

The Snippet in Golang Playground

My question is, how do I get a Regex in Golang that would match the nodegroup and all the nodes that maybe in the line, e.g lba050 lbhou002 lblon003. The amount of nodes will vary, from 0 - as many.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

doutu3352 2016-11-05 19:18

关注

If you want to capture the group name and all possible node names, you should work with a different regex pattern. This one should capture all of them in one go. No need to work with named capture groups but you can if you want to.

hostGroupExp := regexp.MustCompile(`host_group_([[:alnum:]]+)|([[:alnum:]]+) `)

hostGroupLine := "host_group_lbnorth = ( lba050 lbhou002 lblon003 )"
hostGroupMatch := hostGroupExp.FindAllStringSubmatch(hostGroupLine, -1)

fmt.Printf("GroupName: %s
", hostGroupMatch[0][1])
for i := 1; i < len(hostGroupMatch); i++ {
    fmt.Printf("  Node: %s
", hostGroupMatch[i][2])
}

See it in action in playground

Alternative:

You can also work the way awk would do the parsing: use a regexp expression to split the lines in tokens and print the tokens you need. Of course the line layout should be the same as the one given in your example.

package main

import (
    "fmt"
    "regexp"
)

func printGroupName(tokens []string) {
    fmt.Printf("GroupName: %s
", tokens[2])
    for i := 5; i < len(tokens)-1; i++ {
        fmt.Printf("  Node: %s
", tokens[i])
    }
}

func main() {

    // regexp line splitter (either _ or space)
    r := regexp.MustCompile(`_| `)

    // lines to parse
    hostGroupLines := []string{
        "host_group_lbnorth = ( lba050 lbhou002 lblon003 )",
        "host_group_web = ( web44 web125 )",
        "host_group_web = ( web44 )",
        "host_group_lbnorth = ( )",
    }

    // split lines on regexp splitter and print result
    for _, line := range hostGroupLines {
        hostGroupMatch := r.Split(line, -1)
        printGroupName(hostGroupMatch)
    }

}

See it in action in playground

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

如何使用正则表达式捕获“多个”重复组
2016-11-05 06:00

回答 1 已采纳 If you want to capture the group name and all possible node names, you should work with a differen
正则表达式中多个重复的值怎么定义开头？正则表达式
2022-03-09 00:12

回答 4 已采纳用正则表达式的话.*？（你要匹配的字符串）.*？就行了 import re strs = '[{"domazon","pature":false,"exate":1536346214,"httpOls
如何在Java中使用正则表达式匹配多个内容? java 正则表达式
2017-07-30 01:47

回答 1 已采纳如果传入的只有1行，那只会匹配一个如果传入的是全部文本，需要用 while(m.find()) { }
java正则表达式捕获_java正则表达式捕获组
2021-03-06 19:39

weixin_39963080的博客 (pattern.source); //google,正则表达式...1.java.lang.String 2.java.util.regex.Pattern 3.java.util.regex.Matcher Qulifiers——限定,修饰贪婪: 懒惰: 占有的: 非捕获组不是一个组,只是简单的......1 Python ...
正则表达式如何分别匹配多个括号中的值 c#
2021-03-03 00:43

回答 1 已采纳 正则表达式：\$\{[^{}]*\} 解释： [abc] 表示字符是 `a` 或者 `b` 或者 `c`[^abc] 表示任意不是 `a` 或者 `b` 或者 `c` 的字符[^{}] 表示任意不
使用正则表达式捕获重复的组 php
2014-09-15 16:46

回答 1 已采纳 If you want to use the string extralistids in your pattern then try the below regex. (?:\bextrali
如何在正则表达式中使用变量？ javascript 前端正则表达式
2022-01-09 11:44

回答 1 已采纳 /regex\d/g您可以构造一个新的RegExp对象，而不使用语法：var replace = "regex\d";var re = new RegExp(replace,"g"); 您可以通过这种
Java 正则表达式-捕获分组和非捕获分组 12
2023-01-15 11:24

小钟不想敲代码的博客 正则表达式-分组括号( )
想使用正则表达式匹配，提取文本中特定的内容。 python 正则表达式
2022-01-19 16:23

回答 2 已采纳这应该就是你想要的功能： import os, re def GetMiddleStr(content,startStr,endStr): '''提取字符串content当中，startStr
正则表达式判断包含多个关键词，并且不能包含某个关键词的条件怎么写正则表达式
2022-06-01 17:40

回答 2 已采纳 /^(?!.*王磊).*(张明|李红|赵虎)/ var arr = ["我是李红","我是王伟","我是王磊张明","我是张明"]; arr.forEach(function(v, i){ c
使用正则表达式提取文本数据，正则表达式如何写 python 有问必答正则表达式爬虫
2021-10-25 18:26

回答 2 已采纳 regex = r"('gender':\s*{[^}]+})|('glasses':\s*{[^}]+})|('emotion':.+.jpg')" 不清楚是否你每个文件都是类似的，如果不行，再
正则表达式、分组、子匹配（子模式）、非捕获子匹配（子模式）
2020-12-13 07:44

前面我们知道正则表达式有很多元字符表示匹配次数（量词），都是可以重复匹配前面出现的单个字符次数。有时候，我们可能需要匹配一组多个字符一起出现的次数。这个时候，我们需要分组了。就是用小括号来括起这些字符...
使用java正则表达式匹配日期 java 正则表达式
2020-01-31 15:18

回答 1 已采纳 ``` ^\d{4}-0*((1|3|5|7|8|10|12)-0*([1-9]|[1-2]\d|3[0-1])|(4|6|9|11)-0*([1-9]|[1-2]\d|30)|2-0*([1-
100个正则表达式的入门实例和40个组合类型正则表达式实例
2023-03-15 20:30

《代码爱好者》的博客 100个正则表达式的入门实例和40个组合类型正则表达式实例
mysql 正则匹配捕获组_正则表达式高级用法(分组与捕获)
2021-02-01 23:42

weixin_39522486的博客分组的引入：对于要重复单个字符，非常简单，直接...X，一次或一次也没有X*X，零次或多次X+X，一次或多次X{n}X，恰好n次X{n,}X，至少n次X{n,m}X，至少n次，但是不超过m次但是我们如果要对多个字符进行重复怎么办呢？...
没有解决我的问题, 去提问

悬赏问题

¥15 mmocr的训练错误，结果全为0
¥15 python的qt5界面
¥15 无线电能传输系统MATLAB仿真问题
¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接
¥15 孟德尔随机化结果不一致
¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀

码龄粉丝数原力等级 --

如何使用正则表达式捕获“多个”重复组

1条回答默认最新

码龄粉丝数原力等级 --

Alternative:

悬赏问题

如何使用正则表达式捕获“多个”重复组

1条回答 默认 最新

Alternative:

悬赏问题

1条回答默认最新