dsa1230000 2015-06-24 17:05
浏览 39
已采纳

过早捕获PHP正则表达式的组

I have HTML stored in a MySQL database that I am migrating to a new WordPress installation from Joomla. I need to remove some caption text at the bottom of each page.

An example of the HTML:

<a href="some/link">link 1</a><p>some really long description</p><a href="another/link">link 2</a>CAPTION TEXT HERE[/caption]

I am using a PHP script to query the database and do the regex matching.

My regex thus far:

/(<\/a>)(.*?)(\[\/caption\])/

I need to remove the 2nd caption group (CAPTION TEXT HERE) entirely, so in essence replacing Groups 1,2 and 3 with Groups 1 and 3. Group 2 can contain any alphanumeric or special character.

The problem I am running into is that capture group 1 is matching the closing anchor tag for link 1 and continuing until the [/caption]

What happens is:

</a><p>some really long description</p><a href="another/link">link 2</a>CAPTION TEXT HERE[/caption]

gets replaced with:

<a href="some/link">link 1</a>[/caption]

when what I really need is:

<a href="some/link">link 1</a><p>some really long description</p><a href="another/link">link 2</a>[/caption]

Thank you in advance!

  • 写回答

1条回答 默认 最新

  • douwen1213 2015-06-24 17:16
    关注

    Male it to not include > in matched text

    (<\/a>)([^>]*?)(\[\/caption\])
    

    Demo

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥20 测距传感器数据手册i2c
  • ¥15 RPA正常跑,cmd输入cookies跑不出来
  • ¥15 求帮我调试一下freefem代码
  • ¥15 matlab代码解决,怎么运行
  • ¥15 R语言Rstudio突然无法启动
  • ¥15 关于#matlab#的问题:提取2个图像的变量作为另外一个图像像元的移动量,计算新的位置创建新的图像并提取第二个图像的变量到新的图像
  • ¥15 改算法,照着压缩包里边,参考其他代码封装的格式 写到main函数里
  • ¥15 用windows做服务的同志有吗
  • ¥60 求一个简单的网页(标签-安全|关键词-上传)
  • ¥35 lstm时间序列共享单车预测,loss值优化,参数优化算法