preg_replace中的正则表达式检测url格式并提取元素

I need to replace certain user-entered URLs with embedded flash objects...and I'm having trouble with a regex that I'm using to match the url...I think mainly because the URLs are SEO-friendly and therefore a bit more difficult to parse

URL structure: http://www.site.com/item/item_title_that_can_include_1('_etc-32CHARACTERALPHANUMERICGUID

I need to both detect a match of an URL in that format and capture the 32CHARACTERALPHANUMERICGUID which is always placed after the - in the url

something like this:

$ret = preg_replace('#http://www\.site\.com/item/([^-])-([a-zA-Z0-9]+)#','<embed>itemid=$2</embed>', $ret);

For some reason, the above does not find a match for an URL in the specified format. I'm new to regexes, so I think I'm missing something fairly obvious.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
drbfxb977777 2010-10-25 01:39
关注
You should check out parse_url().

Examine the results - it was made for parsing URLs. You'll be able to extract the data you require from the tokens returned.

If you are regex crazy, try this...

/^http:\/\/www\.site\.com\/item\/[^-]*\-([a-zA-Z0-9]{32})$/

Your example is almost there, but...

When you do the not character range, i.e. [^-], you still need a quantifier. I placed *, or 0 or more.

You don't seem to use the item title, so we won't bother capturing it.

You should use beginning (^) and end ($) anchors if the string is always exactly like that.

You say the GUID is 32 chars, so we may as well explicitly state that with the {32} quantifier.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

使用preg_replace在正则表达式中使用特殊字符的问题 php
2018-11-08 14:16

回答 2 已采纳 In my case the solution was to use this regex: (Test)(?![^>]*?[^<]*?<\/a>)(?![^>]*
preg_replace中的正则表达式检测url格式并提取元素 php
2010-10-25 01:35

回答 1 已采纳 You should check out parse_url(). Examine the results - it was made for parsing URLs. You'll be a
使用preg_replace的正则表达式 php
2014-05-10 17:44

回答 1 已采纳 Something like this should work: preg_replace('#\*\s+([a-z]+)\s+$([a-z]+)$\s+\*#', '<b id="$
js正则表达式获取后缀名_php – 正则表达式从URL中提取文件扩展名
2020-12-30 15:23

文艺范理工生的博客我正在寻找一个与以下URI中的.js相匹配的正则表达式：/foo/bar/file.js?cache_key=123我正在编写一个函数,试图识别作为参数传入的文件类型.在这种情况下,文件以扩展名.js结尾,并且是一个javascript文件.我正在使用...
需要帮助为preg_replace编写正则表达式 php
2014-04-09 19:23

回答 1 已采纳 I'm a little confused by your post as you are saying that you want to "extract" the parameters, bu
使用preg_match组合正则表达式 php
2017-10-12 02:38

回答 1 已采纳 You can try consolidating everything into the following single regex: 000\.(?:[36]|000\.|100\.1|4
preg_replace和正则表达式帮助 php
2011-09-08 06:15

回答 3 已采纳 preg_replace("/(\?|\&)sort=\w+/", "", $curPageUrl); It will accept '&' or '?'.
php 正则提取图片链接,php 正则表达式提取图片url程序
2021-03-28 08:12

朝辞暮归的博客 //提取图片路径的src的正则表达式preg_match_all("/]+>/isU",$content,$matches);$img = "";if(!empty($matches)) {//注意，上面的正则表达式说明src的值是放在数组的第三个中$img = $matches[2];}else {$img = ...
计算preg_replace正则表达式中的出现次数 php
2014-06-27 13:08

回答 1 已采纳 As per the manual, there is an optional 5th parameter $count, which will be set to the number of r
PHP用preg_match_all正则多个关键字怎么写? php
2017-11-30 05:36

回答 8 已采纳 []改为() ``` $pattaern0='/(你好|中国|国家|新年|娱乐|程序|羁绊|www\\.baidu\\.com|google)+/u'; ```
带有多个正则表达式的PHP preg_replace_callback php
2012-07-17 00:41

回答 1 已采纳 Make it optional "#\{gallery: '(.+?)'(?: dir: '(.+?)')?\}#i" (?:text) creates a non-capturing g
php正则表达式匹配url参数,匹配url参数的正则实例分享
2021-04-02 08:36

weixin_40003767的博客 =]*)/g解释：前后的斜杠/是正则表达式的分隔符,最后的g表示全局匹配,匹配到第一个之后不会停下来,会继续匹配,相当于PHP里的preg_match_all,没有g就相当于preg_match,下面有例子说明.()表示子组.[^]表示字符类取反,...
preg_replace_callback：正则表达式搜索和替换 php
2013-01-25 00:41

回答 1 已采纳 Don't use global here; you're already using a closure, so use the use: function ($m) use ($skip_b
php 正则提取url,php 正则表达式提取网页超级链接url的函数
2021-03-23 20:51

风与sunshine的博客 function match_links($document) {preg_match_all("']+))[^>]*>?(.*?)'isx",$document,$links);while(list($key,$val) = each($links[2])) {if(!empty($val))$match['link'][] = $val;}while(list($key,$val)...
php正则表达式提取url,php 正则表达式提取图片url程序
2021-04-25 11:50

zibuyu9的博客先用正则表达式获取IMG标签，然后把每个IMG标签的SRC抽取出来，并且组合成自己的内容，最后进行替换我想对 html 的图片进行提取.如上地址. 我想全部提取出来但是包含'ico' 的地址忽略. 求正则 , 就是有些图片提取...
java正则表达式中括号_Java正则表达式获取中括号之间的内容
2021-03-01 09:02

范楚杰的博客不包含中括号正则表达式如下：\\[(.*?)]注：.匹配除换行符\n之外的任何单字符；*匹配前面的子表达式零次或多次；?匹配前面的子表达式零次或一次；()标记一个子表达式的开始和结束位置；\[匹配[字符。[是特殊字符需要...
php中用斜杠代替问号,preg_replace去除斜杠 - php
2021-03-24 08:57

weixin_39811101的博客我有这段代码非常适合剥离查询字符串，但是它留下了斜杠preg_replace('/\?.*$/', '', $_SERVER['REQUEST_URI'])如果我的URL是www.mysite.com/myPage?querystring=123，则上面的代码使我留有/myPage。我该如何进行...
php正则表达式查找html内容,如何利用PHP的正则表达式来获取HTML中的内容
2021-04-23 06:07

清清凉凉甜甜的的博客话题：如何利用PHP的正则表达式来获取HTML中的内容回答：preg_match('/(.*?)/',$str,$result);$str就是上面的html里面的内容，$result就是匹配到的字符串，你可以print_r($result)；看看里面就有你要的结果，或者...
php正则表达式详解,PHP正则表达式详解
2021-03-23 18:15

油葫芦阅金经的博客在几乎所有的基于UNIX/LINUX系统的软件工具中找到正则表达式的痕迹，例如：Perl或PHP脚本语言。此外，JavaScript这种客户端的脚本语言也提供了对正则表达式的支持，现在正则表达式已经成为了一个通用的概念和工具，...
php正则表达式 结尾,php正则表达式的基本语法总结
2021-04-07 08:06

funny 灵魂的博客就可以得到是否为email了 正则表达式的其他用法提取字符串 ereg() and eregi() 有一个特性是允许用户通过正则表达式去提取字符串的一部分(具体用法你可以阅读手册). 比如说,我们想从 path/URL 提取文件名 – 下面的...
没有解决我的问题, 去提问

悬赏问题

¥15 在不同的执行界面调用同一个页面
¥20 基于51单片机的数字频率计
¥50 M3T长焦相机如何标定以及正射影像拼接问题
¥15 keepalived的虚拟VIP地址 ping -s 发包测试，只能通过1472字节以下的数据包（相关搜索：静态路由）
¥20 关于#stm32#的问题：STM32串口发送问题，偶校验（even),发送5A 41 FB 20.烧录程序后发现串口助手读到的是5A 41 7B A0
¥15 C++map释放不掉
¥15 Mabatis查询数据
¥15 想知道lingo目标函数中求和公式上标是变量情况如何求解
¥15 关于E22-400T22S的LORA模块的通信问题
¥15 求用二阶有源低通滤波将3khz方波转为正弦波的电路

preg_replace中的正则表达式检测url格式并提取元素

1条回答 默认 最新

悬赏问题

1条回答默认最新