如何使用 GREP、 REGEX 或 PERL 提取模式下的字符串

I have a file that looks something like this:

<table name="content_analyzer" primary-key="id">
  <type="global" />
</table>
<table name="content_analyzer2" primary-key="id">
  <type="global" />
</table>
<table name="content_analyzer_items" primary-key="id">
  <type="global" />
</table>

I need to extract anything within the quotes that follow "name=", i.e., content_analyzer , content_analyzer2 and content_analyzer_items.

I am doing this on a Linux box, so a solution using sed, perl, grep or bash is fine.

转载于:https://stackoverflow.com/questions/5080988/how-to-extract-string-following-a-pattern-with-grep-regex-or-perl

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

8条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
必承其重 | 欲带皇冠 2011-02-22 17:21
关注
Since you need to match content without including it in the result (must match name=" but it's not part of the desired result) some form of zero-width matching or group capturing is required. This can be done easily with the following tools:

Perl

With Perl you could use the n option to loop line by line and print the content of a capturing group if it matches:

perl -ne 'print "$1\n" if /name="(.*?)"/' filename

GNU grep

If you have an improved version of grep, such as GNU grep, you may have the -P option available. This option will enable Perl-like regex, allowing you to use \K which is a shorthand lookbehind. It will reset the match position, so anything before it is zero-width.

grep -Po 'name="\K.*?(?=")' filename

The o option makes grep print only the matched text, instead of the whole line.

Vim - Text Editor

Another way is to use a text editor directly. With Vim, one of the various ways of accomplishing this would be to delete lines without name= and then extract the content from the resulting lines:

:v/name=/d :%s/\v.*name\="([^"]+)".*/\1

Standard grep

If you don't have access to these tools, for some reason, something similar could be achieved with standard grep. However, without the look around it will require some cleanup later:

grep -o 'name="[^"]*"' filename

A note about saving results

In all of the commands above the results will be sent to stdout. It's important to remember that you can always save them by piping it to a file by appending:

> result

to the end of the command.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(7条)

报告相同问题？

关注问题

如何使用 GREP、 REGEX 或 PERL 提取模式下的字符串 html5 perl
2011-02-22 16:34

回答 8 已采纳 Since you need to match content without including it in the result (must match name=" but it's no
从golang中的字符或字符串之前的字符串grep子字符串的最佳方法
2016-03-16 05:15

回答 1 已采纳 You can use net.SplitHostPort, like so ip, _, err := net.SplitHostPort(conn.RemoteAddr().String()
正则表达式获取仅包含模式列表中的单词的字符串？ php
2019-03-10 04:09

回答 3 已采纳 Something like this $names_list = ['benclinton','clintonharry','harryben','benwill','jasonsmith',
使用正则表达式替换报表名称中的特殊字符(推荐)
2020-12-13 08:54

许多程序设计语言都支持利用正则表达式进行字符串操作。例如，在Perl中就内建了一个功能强大的正则表达式引擎，还有java语言自带的。正则表达式这个概念最初是由Unix中的工具软件（例如sed和grep）普及开的。正则...
Linux 提取文件中指定列数的字符串 bash linux
2019-11-27 23:10

回答 1 已采纳先取第 7 列，得到数值所在列，再用下划线分割，取第二列，最后在用00分割 ``` cat test.log | awk '{print $7}' |awk -F _ '{print $2}'|
PHP正则表达式preg_grep更改字符串路径 php
2015-09-25 00:26

回答 1 已采纳 preg_grep() does not change/replace the values, it returns the items that match the given regular
Go lang从字符串中获取匹配的子字符串
2015-07-24 10:03

回答 2 已采纳 You can just add a lazy quantifier .*?, ".*?" being the regex, if you want to keep it simple. The
Grep（Regex）中的正则表达式
2020-07-29 13:01

Treasure003的博客正则表达式或正则表达式是与一组字符串匹配的模式。模式由运算符，构造文字字符和元字符组成，它们具有特殊的含义。 GNU grep支持三种正则表达式语法，Basic，Extended和Perl兼容。最简单的形式是，当没
grep中使用\d匹配数字不成功的原因解决
2020-11-20 10:51

正则表达式：在计算机科学中，是指一个用来描述或者匹配一系列符合某个句法规则的字符串的单个字符串。在很多文本编辑器或其他工具里，正则表达式通常被用来检索或替换那些符合某个模式的文本内容。正则表达式这个...
php正则字符串,php正则匹配字符串
2021-04-08 09:04

郭五月的博客正则表达式描述了一种字符串匹配的模式，可以用来检查一个字符串是否含有某种子串，对匹配到的子串进行“取出”或“替换”操作。二、正则表达式的应用正则表达式在实际的...文章杰克.陈2014-08-13859浏览量PHP学习...
C#正则表达式判断字符串类型汇总
2019-04-29 11:05

未来无限的博客 //判断字符串是否为浮点数 private bool IsFloat(string str) { string regextext = @"^\d+\.\d+$"; Regex regex = new Regex(regextext, RegexOptions.None); return regex.IsMa...
Linux中grep命令，用或的关系查询多个字符串，正则表达式基础说明
2016-10-19 18:52

lkforce的博客使用 grep 'word1|word2' 文件名这样的命令是不对的！应该使用如下的命令： 1，grep -E 'word1|word2' 文件名 2，egrep 'word1|word2' 文件名 3，grep 'word1/|word2' 文件名为什么需要加-E，...
解析posix与perl标准的正则表达式区别
2020-12-19 19:59

正则表达式（Regular Expression，缩写为regexp，regex或regxp)，又称正规表达式、正规表示式或常规表达式或正规化表示法或正规表示法，是指一个用来描述或者匹配一系列符合某个句法规则的字符串的单个字符串。...
没有解决我的问题, 去提问

悬赏问题

¥15 kafka 分区副本增加会导致消息丢失或者不可用吗？
¥15 微信公众号自制会员卡没有收款渠道啊
¥15 stable diffusion
¥100 Jenkins自动化部署—悬赏100元
¥15 关于#python#的问题：求帮写python代码
¥20 MATLAB画图图形出现上下震荡的线条
¥15 关于#windows#的问题：怎么用WIN 11系统的电脑克隆WIN NT3.51-4.0系统的硬盘
¥15 perl MISA分析p3_in脚本出错
¥15 k8s部署jupyterlab，jupyterlab保存不了文件
¥15 ubuntu虚拟机打包apk错误

如何使用 GREP、 REGEX 或 PERL 提取模式下的字符串

8条回答 默认 最新

Perl

GNU grep

Vim - Text Editor

Standard grep

A note about saving results

悬赏问题

8条回答默认最新