php PCRE Regex优化

quite new to regexes i'm trying to optimize one, or at least know if there are better ways to do it.

Here is my input string:

$str = 'Some text
spanned on
several lines
txt_to_grab1 fixed_text1 txt_to_grab2
Full line to grab
txt_to_grab3 fixed_text2 txt_to_grab4
Some text after';

I'm trying to grab the lines from "txt_to_grab1" to "txt_to_grab4", but only the words "txt_to_grabX" and the line "Full line to grab".
I want to preserve everything untouched before and after (ie line breaks), but remove line breaks inside the lines i grab (as each line will be a <tr> that'll go into an html table).

Regex patterns/replace i found matching:

$find = "#(?<=
)(.*?) fixed_text1 (.*?)(
.*?
)(.*?) fixed_text2 (.*?)(
)#i";
$replace = '"$1" && "$2" grabbed.$3"$4" && "$5" grabbed.$6';   

$find = "#(.*)(?<=
)(.*?) fixed_text1 (.*?)(
)(.*)(?<=
)(.*?) fixed_text2 (.*?)(
.*)#is";
$replace = '$1"$2" && "$3" grabbed.$4$5"$6" && "$7" grabbed.$8';

Questions :

All questions can be sum up as : are there better/shorter/faster patterns ?

how to make the patterns work with either or ? I read somewhere on stack that (? ) would be a solution, but i dunno how to use them in lookbehinds. For example the following patterns work, but i don't like them (dirty as only are used in lookbehinds, may produce unexpected results):
```
"#(?<=
)(.*?) fixed_text1 (.*?)(?
.*??
)(.*?) fixed_text2 (.*?)(?
)#i"
"#(.*)(?<=
)(.*?) fixed_text1 (.*?)(?
)(.*)(?<=
)(.*?) fixed_text2 (.*?)(?
.*)#is";
```
even better, how to use the "s" modifier to remove all line breaks from the pattern, so being able to use (.*?) but still grabbing what i want ? Word boundaries ?
is the multiline mode (m modifier) useful/helpful here ?

I'd really like the regexes to be explained, if you provide some :)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doukong5394 2010-11-22 19:17
关注
You don't need lookbehinds for this. Just use the start-of-line anchor at the beginning of your regex and the end-of-line anchor at the end (that's ^ and $ in multiline mode). To match the line separators in the middle you can use (?: |[ ]), a common idiom for the three most common styles of line separator: , , or .

As for the s modifier (a.k.a. "single-line" or "DOT_ALL"), you don't need that either. All it does is allow the dot metacharacter to match line separators as well as all other characters, which doesn't do you any good. You want it to stop matching when it reaches line breaks, so you can exclude them from your captures.

Here's a demo:

$pattern='#^(.*?) fixed_text1 (.*)(?: |[ ])(.*)(?: |[ ])(.*?) fixed_text2 (.*)$#im'; preg_match($pattern, $source, $m); echo "$m[1] && $m[2] grabbed. "; echo "$m[3] "; echo "$m[4] && $m[5] grabbed. ";

output:

txt_to_grab1 && txt_to_grab2 grabbed. Full line to grab txt_to_grab3 && txt_to_grab4 grabbed.

See it in action on ideone.com
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

php PCRE Regex优化 php
2010-11-22 15:11

回答 1 已采纳 You don't need lookbehinds for this. Just use the start-of-line anchor at the beginning of your r
PHP PCRE匹配标点但不是++ php
2016-09-13 19:27

回答 4 已采纳 You have a couple of choices here, one being: (?<!\+)[+#](?!\+) # with lookarounds making sure
子模式中的相同反向引用，php pcre regex [关闭] php
2012-04-23 16:13

回答 1 已采纳 This should work fine with a recent version of PCRE - did you make sure to switch regexbuddy into
php5.2.17安装8.30以上版本pcre库
2022-11-06 07:59

春哥一号的博客 pcre8.30以上版本将pcre_info函数移除，用pcre_fullinfo代替。configure添加--with-pcre-regex=DIR选项。1.下载安装pcre，8.21版本开始pcre支持jit。
php正则表达式在regex101上工作时无效 php
2017-02-22 12:25

回答 1 已采纳 You need to use the u modifier to enable the unicode mode for regular expressions, since that × ch
PHP preg_match（）PCRE逻辑问题？ php
2016-07-20 03:18

回答 1 已采纳 Use /(*UTF8)^(([0-8]\d|\d)°?(\s?([0-5]\d|\d))?)(N|S)?$/ From http://www.pcre.org/pcre.txt: In
PHP PCRE的问题 php
2009-10-14 03:15

回答 5 已采纳 ^\d{1,5}(, *\d{1,5}){0,9}$
regex-parser:用于PCRE正则表达式的AST
2021-04-27 14:35

RegexParser是PCRE regex的解析器。它产生一个代表您的正则表达式的AST。它可以帮助您生成一些与您的正则表达式匹配的输入。安装它可以在： composer install robinbressan/regex-parser 用法要构建AST，您...
PHP PCRE - 匹配“没有” php
2013-12-17 05:14

回答 2 已采纳 You could make it lazy: .*?, that should match nothing every time. Also, you don't have to have
PHP PCRE：匹配大括号内容 php
2012-06-06 12:45

回答 1 已采纳 If you want to match everything from opening brace to close brace: /{[^}]+}/
PHP正则表达式用回调替换多个模式 php
2019-05-23 13:19

回答 2 已采纳 This uses preg_match() and PREG_OFFSET_CAPTURE to return the capture groups and the offset within
PHP pcre backtrack问题
2022-01-06 21:11

inrese的博客这是一个棘手的问题，我找了半天都没找到，它是一个php的正则库，关于它backtrack上限的配置问题，就算运行时用ini初始化修改，但是还是有一个内置上限，很遗憾，我到现在还没解决，欢迎大家在此讨论这个问题。...
Php preg_match_all仅匹配最后一个元素 php
2019-07-19 08:34

回答 2 已采纳 Here is another variant using \G that is bit faster and avoids empty matches: (?:{{([\w-]+(?:\h+[
regex:Regex 是 PHP 库，包含围绕正则表达式库和日常使用的扩展的轻量级包装器
2021-07-13 08:51

Regex是 PHP 库，包含围绕正则表达式库和日常使用的扩展的轻量级包装器。我们尝试解决这些库暴露的错误相关问题，并且它们处理得相当特殊。我们还尝试了统一他们提供的API ，因此大多数用途中，库旨在直接现有的...
php编译时pcre的作用,php – PCRE编译时没有UTF支持
2021-04-20 10:55

Kelly敏的博客 FreeBSD noobie寻求一些帮助,将PCRE和Apache与mod_php集成在一起.我拥有的：> FreeBSD 8.2-RELEASE-p3> Apache / 2.2.22(FreeBSD,从端口构建)> PHP 5.3.10 with Suhosin-Patch(cli)(内置：2012年4月6日02:...
php 安装 zip 扩展报pcre错误
2020-11-06 18:46

凄魅旋律的博客 1. 安装pcre #下载 wget https://netix.dl.sourceforge.net/project/pcre/pcre/8.40/pcre-8.40.tar.gz #解压安装包: tar -zxvf pcre-8.40.tar.gz #进入安装包目录 cd pcre-8.40 #编译安装 ./configure make &amp...
regex-guard:包装器，用于验证正则表达式并处理PCRE编译错误
2021-05-01 08:01

RegexGuard是一个包装程序，可让您验证正则表达式并使API远离无法捕获的PCRE编译警告。作曲家的安装 { " require " : { " regex-guard/regex-guard " : " ~1.0 " } } 通过终端： composer require regex-guard...
php-7.2.26.tar.xz
2020-02-18 18:06

--with-pcre-regex \ --with-jpeg-dir=/usr \ --with-png-dir=/usr \ --with-openssl \ --enable-ftp \ --with-kerberos \ --with-gettext \ --with-xmlrpc \ --with-xsl \ --enable-fpm \ --with-fpm-...
Regexer:解析和修改正则表达式
2021-05-23 14:58

正则表达式用PHP编写的正则表达式解析器。 Regexer允许您将regex作为AST进行操作，有关如何使用此lib的信息，请参见示例目录。
php中pcre裤怎么调_转：php pcre正则表达式完全教程----pcre官方文档
2020-12-21 09:15

weixin_39562089的博客 PCRE简介PCRE扩展的正则表达式会有一个每个线程都可用的全局缓存用来缓存编译后的正则表达式.PCRE在php4.2.0中是默认启用的,可以通过—without-pcre-regex禁用.在php 5.3.0之后,这个扩展不能被禁用.但是仍然可以使用...
没有解决我的问题, 去提问

悬赏问题

¥15 运筹学排序问题中的在线排序
¥15 关于docker部署flink集成hadoop的yarn，请教个问题 flink启动yarn-session.sh连不上hadoop，这个整了好几天一直不行，求帮忙看一下怎么解决
¥30 求一段fortran代码用IVF编译运行的结果
¥15 深度学习根据CNN网络模型，搭建BP模型并训练MNIST数据集
¥15 C++ 头文件/宏冲突问题解决
¥15 用comsol模拟大气湍流通过底部加热（温度不同）的腔体
¥50 安卓adb backup备份子用户应用数据失败
¥20 有人能用聚类分析帮我分析一下文本内容嘛
¥30 python代码，帮调试，帮帮忙吧
¥15 #MATLAB仿真#车辆换道路径规划

php PCRE Regex优化

Questions :

1条回答 默认 最新

悬赏问题

1条回答默认最新