preg_match_all读取sitesource多行和匹配

I read my own website with file_get_contents to display specific text. I display the data from interviews and I want to get the interview headline and the time to use on another site (link to the interview).

The relevant code block is in a table.

<td>
    Interview 1
    <small style="color:gray">
        Persons 2
        Cameras 2
    </small>
</td>
<td>
    1018 min
</td>

As you can see, Interview 1 is the headline and the time is 1018. I tried this on my own but somehow the pattern got a little crazy.

preg_match_all('#<td>\s*(.+?)\s*<small style="color:gray">\s*<\/small>\s*<\/td><td>\s*(.+?)\s*<\/td>#is', $mysite, $match)

I used \s* for the line breaks and spaces and (.+?) to match. What's wrong with my search pattern?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongshou9343 2016-06-18 17:38
关注
First you should use a parser for this, regexs on HTML function expectedly. There are two issues with your regex though.

Issue one:

<small style="color:gray">\s*<\/small>

There isn't just white space between that element.

Issue two:

<\/td><td>

There is a new line between the <td>s.

So:

<td>\s*(.+?)\s*<small style="color:gray">.+?<\/small>\s*<\/td>\s<td>\s*(.+?)\s*<\/td>

should work for you (for this static example). If the small element's content is optional change the + to an *. Note also with a parser these wouldnt have been issues.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

preg_match_all读取sitesource多行和匹配 html php
2016-06-18 17:19

回答 3 已采纳 First you should use a parser for this, regexs on HTML function expectedly. There are two issues w
Php preg_match_all仅匹配最后一个元素 php
2019-07-19 08:34

回答 2 已采纳 Here is another variant using \G that is bit faster and avoids empty matches: (?:{{([\w-]+(?:\h+[
如果模式不匹配，如何使preg_match_all返回一个空数组值？ php
2017-10-23 16:11

回答 2 已采纳 It looks like each iteration can only return a maximum of one match, so preg_match_all with the in
php小经验:解析preg_match与preg_match_all 函数
2020-10-27 04:14

本篇文章是对php中的preg_match函数与preg_match_all函数进行了详细的分析介绍，需要的朋友参考下
PHP用preg_match_all正则多个关键字怎么写? php
2017-11-30 05:36

回答 8 已采纳 []改为() ``` $pattaern0='/(你好|中国|国家|新年|娱乐|程序|羁绊|www\\.baidu\\.com|google)+/u'; ```
求php一条preg_match_all正则，取指定div的id开头？ php 正则表达式
2021-08-21 14:27

回答 1 已采纳 $reg = "/<div id=\"num_(.*?)_off\".*?>.*?<\/div>/ism";
php preg_match_all简单正则表达式返回空值 php
2015-11-05 10:42

回答 4 已采纳 You need to replace: preg_match_all('/\d*/', $string, $matches); with: preg_match_all('/\d+/',
php中使用preg_match_all匹配文章中的图片
2020-12-19 19:47

preg_match_all 函数：int preg_match_all ( string pattern, string subject, array matches [, int flags] )执行一个全局正则表达式匹配在 subject 中搜索所有与 pattern 给出的正则表达式匹配的内容并将结果以 ...
使用正则表达式和php preg_match_all在括号之间获取字符串 php
2017-07-14 12:34

回答 2 已采纳 This method will extract your desired substrings and prepare the output data as you have requested
尝试使用preg_match_all匹配包含特定类的所有图像 php
2019-07-06 14:50

回答 1 已采纳 Your regex matches every img tag that has not the comment-media class. Regex101 lists this part as
PHP preg_match_all谜语 php
2018-07-30 22:29

回答 1 已采纳 /<tr>.*?class="DD.*?/ says "find <tr>, then match everything until you find class="D
php preg_match_all结合str_replace替换内容中所有img
2020-10-30 06:02

最近做站的时候，采集了大量的数据，但采回来的数据基本上都要经过过滤原站保留的数据，其中IMG就是一个地方。网站上好多这些应用例子似乎没有必要“秀”出来，但站已几天没写日志，那就来一个吧
preg_match_all匹配可选括号 php
2017-06-22 15:26

回答 2 已采纳 This seems to be a robust/reliable pattern: ~#[A-Z]+|\[[^#\]]*$#[A-Z]+$[^\]]*]~ Pattern Demo
preg_match_all使用心得分享
2021-01-20 01:17

preg_match_all — 进行全局正则表达式匹配说明复制代码代码如下:int preg_match_all ( string pattern, string subject, array matches [, int flags] ) 在 subject 中搜索所有与 pattern 给出的正则表达式匹配...
PHP preg_match实现正则表达式匹配功能【输出是否匹配及匹配值】
2020-10-19 12:24

主要介绍了PHP preg_match实现正则表达式匹配功能,较为详细的介绍了preg_match函数的功能、参数含义、返回值及使用方法,并结合实例给出了preg_match输出是否匹配及匹配值的相关实现技巧,需要的朋友可以参考下
php preg match 多行,php preg_match_all 匹配换行截止
2021-04-08 10:18

顾芸的博客 preg_match_all函数可能匹配[^\r\n]可能有问题,我给替换成点,就可以了.点也代表除换行符外的所有字符.另外,为使.*能够找到行尾,我给正则表达式改成了多行形式.完整的PHP程序如下$log="[2018-07-2407:03:57]SessionId...
PHP 正则表达式之正则处理函数小结(preg_match,preg_match_all,preg_replace,preg_split)
2020-12-19 03:35

前面我们已经学习了正则表达式的基础语法，包括了定界符、原子、元字符和模式修正符。实际上正则表达式想要起作用的话，就必须借用正则表达式处理函数。本节我们就来介绍一下PHP中基于perl的正则表达式处理函数，...
基于preg_match_all采集后数据处理的一点心得笔记(编码转换和正则匹配)
2020-10-26 08:14

主要介绍了采集后数据处理的一点心得笔记，编码转换和正则匹配，基于preg_match_all,需要的朋友可以参考下
php preg_match的匹配不同国家语言实例
2020-12-18 14:02

php preg_match的匹配不同国家语言实例正则：[\S]{2,32} 过滤是管用的 PHP中： <?php var_dump( preg_match("/[\S\b]{2,32}/",'ج') ); echo '<hr>'; var_dump( preg_match("/[\S\b]{2,32}/",'中国') ); 是...
php使用preg_match()函数验证ip地址的方法
2020-12-19 07:07

preg_match('/^(?:25[0-5]|2[0-4]\d|1\d\d|[1-9]\d|\d)(?:[.](?:25[0-5]|2[0-4]\d|1\d\d|[1-9]\d|\d)){3}$/', $ipAddress); 代码二、 <?php /* *@return Boolen *@param String $ip 要匹配的ip地址 *@param ...
没有解决我的问题, 去提问

悬赏问题

¥15 高价求中通快递查询接口
¥15 解决一个加好友限制问题或者有好的方案
¥15 关于#java#的问题，请各位专家解答！
¥15 急matlab编程仿真二阶震荡系统
¥20 TEC-9的数据通路实验
¥15 ue5 .3之前好好的现在只要是激活关卡就会崩溃
¥50 MATLAB实现圆柱体容器内球形颗粒堆积
¥15 python如何将动态的多个子列表，拼接后进行集合的交集
¥20 vitis-ai量化基于pytorch框架下的yolov5模型
¥15 如何实现H5在QQ平台上的二次分享卡片效果？

preg_match_all读取sitesource多行和匹配

3条回答 默认 最新

悬赏问题

3条回答默认最新