duanreng3439 2013-01-01 00:03
浏览 12
已采纳

捕获字符串,无条件结束

I'm trying to capture the middle part of a URL, that contains a conditional ending:

A URL can be of two sorts:

/a/b/(part/needed)
/a/b/(part/needed)/page/#

here's the regexp I use:

preg_match('@/a/b/(.*)(/page/\d)?@i', '/a/b/some/text/page/1', $matches);

returns

0=>"/a/b/some/text/page/1",
1=>"some/text/page/1"

It's ok but it includes the conditional ending which I don't want!

Can someone tell me how to not include the conditional string ending in it but still match when the last segment is present or absent?

  • 写回答

1条回答 默认 最新

  • doufu9947 2013-01-01 00:09
    关注

    By anchoring the expression with ^$ and making the first group non-greedy (.*?), you can get the segment you need. The .* alone is a greedy match, and will eat up everything that follows the .*.

    preg_match('@^/a/b/(.*?)(/page/\d)?$@i', '/a/b/some/text/page/1', $matches);
    //-----------^-------^^^-----------^
    print_r($matches);
    Array
    (
        [0] => /a/b/some/text/page/1
        [1] => some/text
        [2] => /page/1
    )
    

    If you don't need the /page/1, make that a non-capturing group (?:...).

    preg_match('@^/a/b/(.*?)(?:/page/\d)?$@i', '/a/b/some/text/more/page/1', $matches);
    //----------------------^^^
    print_r($matches);
    Array
    (
        [0] => /a/b/some/text/more/page/4
        [1] => some/text/more
    )
    

    regular-expressions.info has good information on character repetition with + and *, and the pitfalls of greediness.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥50 切换TabTip键盘的输入法
  • ¥15 关于#网络安全#的问题:求ensp的网络安全,不要步骤要完成版文件
  • ¥15 可否在不同线程中调用封装数据库操作的类
  • ¥15 微带串馈天线阵列每个阵元宽度计算
  • ¥15 keil的map文件中Image component sizes各项意思
  • ¥20 求个正点原子stm32f407开发版的贪吃蛇游戏
  • ¥15 划分vlan后,链路不通了?
  • ¥20 求各位懂行的人,注册表能不能看到usb使用得具体信息,干了什么,传输了什么数据
  • ¥15 Vue3 大型图片数据拖动排序
  • ¥15 Centos / PETGEM