PHP：正则表达式搜索文件中的模式并将其拾取

I am really confused with regular expressions for PHP.

Anyway, I cant read the whole tutorial thing now because I have a bunch of files in html which I have to find links in there ASAP. I came up with the idea to automate it with a php code which it is the language I know.

so I think I can user this script :

$address = "file.txt"; 
$input = @file_get_contents($address) or die("Could not access file: $address");
$regexp = "??????????"; 
if(preg_match_all("/$regexp/siU", $input, $matches)) { 
    // $matches[2] = array of link addresses 
   // $matches[3] = array of link text - including HTML code 
}

My problem is with $regexp

My required pattern is like this:

href="/content/r807215r37l86637/fulltext.pdf" title="Download PDF

I want to search and get the /content/r807215r37l86637/fulltext.pdf from above lines which I have many of them in the files.

any help?

==================

edit

title attributes are important for me and all of them which I want, are titled

title="Download PDF"

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

5条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doubu1964 2011-02-11 20:25
关注
Once again regexp are bad for parsing html.

Save your sanity and use the built in DOM libraries.

$dom = new DOMDocument(); @$dom->loadHTML($html); $x = new DOMXPath($dom); $data = array(); foreach($x->query("//a[@title='Download PDF']") as $node) { $data[] = $node->getAttribute("href"); }

Edit Updated code based on ircmaxell comment.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(4条)

报告相同问题？

关注问题

mysql 正则regrx_ORACLE 常用正则表达式详解
2021-03-03 21:17

光露的博客 Oracle使用离不开这4个...regexp_like 只能用于条件表达式，和 like 类似，但是使用的正则表达式进行匹配，语法很简单：regexp_substr函数，和 substr 类似，用于拾取合符正则表达式描述的字符子串，语法如下：reg...
(转)正则表达式在ORACLE中的使用
2016-09-02 13:26

weixin_30763455的博客 Oracle使用正则表达式离不开这4个函数： 1。regexp_like 2。regexp_substr 3。regexp_instr 4。regexp_replace 看函数名称大概就能猜到有什么用了。 regexp_like只能用于条件表达式，和 like 类似，但是使用...
oracle中用正则判断,Oracle中正则表达式的使用实例教程
2021-05-02 04:31

weixin_39683692的博客前言正则表达式已经在很多软件中得到广泛的应用，包括*nix(Linux, Unix等)，HP等操作系统，PHP，C#，Java等开发环境。本文主要介绍了关于Oracle中正则表达式的使用方法，下面话不多说了，来一起看看详细的介绍。...
Oracle中的正则表达式
2019-08-06 20:40

weixin_38168081的博客 Oracle使用正则表达式离不开这4个函数： 1。regexp_like 2。regexp_substr 3。regexp_instr 4。regexp_replace 看函数名称大概就能猜到有什么用了。参考：--- ...
oracle中正则表达式的使用例子,Oracle中正则表达式的使用实例教程
2021-05-02 08:44

weixin_39582737的博客前言正则表达式已经在很多软件中得到广泛的应用，包括*nix(Linux,Unix等)，HP等操作系统，PHP，C#，Java等开发环境。本文主要介绍了关于Oracle中正则表达式的使用方法，下面话不多说了，来一起看看详细的介绍。...
php preg_match 符号,PHP：preg_match;无法匹配£符号
2021-05-04 04:05

万俟灵儿的博客我有一些数据要运行正则表达式.作为参考,原始文档以iso-8859-15编码,如果这有任何区别.这是一个使用正则表达式的函数;if(preg_match("{£\d+\.\d+}", $handle)) //{echo 'Found a match';}else{echo 'No match found...
php 向图片添加文本和水印（意想不到的坑）
2025-03-10 10:33

baddl1992的博客类名：Image.php。同时往图片中添加文本和水印（当text和water 同时存在时，需先用text再使用water, 不然有可能字体颜色设置无效）
php 用日期生成随机数_用PHP生成随机图像
2020-08-17 23:12

cungui5726的博客 php 用日期生成随机数One of the principles of creating a popular site is dynamism: making the front page look different each time a user visits. Obviously the most rewarding way to do this is by adding...
php 查询成绩_与专家讨论PHP：成绩单
2020-08-13 22:59

culi3118的博客 php 查询成绩Talk with the Experts this morning was a busy one, with close to a record number of attendees. Hardly surprising really, considering that the subject was PHP. Luckily for me, the experts ...
与PHP对抗招聘者垃圾邮件-概念证明
2020-08-30 18:16

culi3182的博客 Example uses of our app: 在本教程中，我们将开始构建一个自定义电子邮件处理器，该处理器可以读取单个电子邮件，通过一些预定义规则运行它们并对其执行操作。最终结果将与许多供应商提供的即用型产品非常相似，...
没有解决我的问题, 去提问

PHP：正则表达式搜索文件中的模式并将其拾取

edit

5条回答 默认 最新

5条回答默认最新