从html文档中回显<a>具有class =“pret”的内容

I have the html document in a php $content. I can echo it, but I just need all the <a...> tags with class="pret" and after I get them I would need the non words (like a code i.e. d3852) from href attribute of <a> and the number (i.e. 2352.2345) from between <a> and </a>.

I have tried more examples from the www but I either get empty arrays or php errors.

A regex example that gives me an empty array (the <a> tag is in a table)

$pattern = "#<table\s.*?>.*?<a\s.*?class=[\"']pret[\"'].*?>(.*?)</a>.*?</table>#i";
preg_match_all($pattern, $content, $results);
print_r($results[1]);

Another example that gives just an error

$a=$content->getElementsByTagName(a);

Reason for various errors: unvalid html, non utf 8 chars.

Next I did this on another website, matched the contents in a single SQL table, and the result is a copied website with updated data from my country. No longer will I search the www for matching single results.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

dongxing5525 2013-04-20 18:42

关注

Let's hope you're trying to parse valid (at least valid enough) HTML document, you should use DOM for this:

// Simple example from php manual from comments
$xml = new DOMDocument(); 
$xml->loadHTMLFile($url); 
$links = array(); 

foreach($xml->getElementsByTagName('a') as $link) { 
    $links[] = array('url' => $link->getAttribute('href'),
                     'text' => $link->nodeValue); 
}

Note using loadHTML not load (it's just more robust against errors). You also may set DOMDocument::recover (as suggested in comment by hakre) so parser will try to recover from errors.

Or you could use xPath (here's explanation of syntax):

$xpath = new DOMXpath($doc);
$elements = $xpath->query("//a[@class='pret']");

if (!is_null($elements)) {
    foreach ($elements as $element) {
        $links[] = array('url' => $link->getAttribute('href'),
                         'text' => $link->nodeValue); 
    }
}

And for case of invalid HTML you may use regexp like this:

$a1 = '\s*[^\'"=<>]+\s*=\s*"[^"]*"'; # Attribute with " - space tolerant
$a2 = "\s*[^'\"=<>]+\s*=\s*'[^']*'"; # Attribute with ' - space tolerant
$a3 = '\s*[^\'"=<>]+\s*=\s*[\w\d]*' # Unescaped values - space tolerant
# [^'"=<>]* # Junk - I'm not inserting this to regexp but you may have to

$a = "(?:$a1|$a2|$a2)*"; # Any number of arguments
$class = 'class=([\'"])pret\\1'; # Using ?: carefully is crucial for \\1 to work
                                 # otherwise you can use ["']
$reg = "<a{$a}\s*{$class}{$a}\s*>(.*?)</a";

And then just preg_match_all._{All regexp are written from the top of my head - you may have to debug them}.

报告相同问题？

关注问题

在php中回显的html <a>标签中执行javascript弹出窗口 ajax html javascript php
2014-03-10 15:41

回答 1 已采纳 The problem is that in the echo you are opening and closing the statement before calling the javas
PHP从文件夹中回显几个<img> html php
2018-09-27 07:43

回答 3 已采纳 You can use loops in html like so <?php for($i = 1; $i <= 5; $i++): ?> <img src="
如何回显数组中的内容（例如<a href=arraydata>） php
2019-04-10 15:27

回答 3 已采纳 Why still put in the array if you want to display? Just do it. $formats = $array["streamingData"
富文本编辑器回显去掉＜p＞＜/p＞
2021-07-14 16:57

Maybe221的博客只需要在标签内用指令 v-html=" 需要回显的内容 " 实例： <el-col :span='16' style="margin-bottom:15px" v-html="'课程简介'+backups.simpleIntroduce"></el-col>
简单的PHP，回显<li> ... </ li>目录中的所有图像 php
2014-05-26 16:35

回答 1 已采纳 You have too many closing </li> tags : <?php $files = glob("images/*.*"); for ($i=0; $i
我可以命中一个端点并获取/回显<body>吗？ [关闭] php
2016-10-10 13:10

回答 1 已采纳 You could use file_get_contents and echo to screen like below: $html = file_get_contents('http://
使用POST无法使用$ var回显html <form> php
2018-07-02 15:18

回答 2 已采纳 Just an advice... don't echo the htmltags... Is not a good practice... Instead of echo you can do
按格式回显<textarea/>中的内容
2016-08-04 20:16

光着脚丫数星星的博客 hljs less"><code class="hljs less">String<span class="hljs-selector-class">.valueOf</span>(textareaContent.replaceAll(<span class="hljs-string">" "</span>,<span class="hljs-string">" "</span>)....
在PHP中构建JSON并在<script>中回显它时转义字符 javascript json php
2014-09-06 13:09

回答 1 已采纳 This is simply a misunderstanding of what json is: written literally in a format that can create a
无法在<script>标签内回显PHP变量？ [关闭] html javascript json php
2016-07-07 19:28

回答 5 已采纳 The real problem is that this is what your rendered result looks like: var error = You must choos
使用PHP从Google表格JSON数据中回显特定值 json php
2019-08-20 04:20

回答 2 已采纳 If you use single quotes, the $ won't trigger PHP variable interpolation. echo $data['feed']['ent
解决：＜form:select multiple=“true“＞form表单下拉框多选及回显
2021-12-21 14:27

摸鲨鱼的脚的博客开发需求中遇到了一个下拉框多选及回显js问题，最终解决！页面效果因为框架较老，这部分筛选是使用from表单请求的，所以多选框回显有些复杂，最终页面效果是如下：代码示例 jsp页面代码： <!--因为项目使用...
尝试在<select>中将数组的内容作为<options>进行回显 php
2013-06-17 22:33

回答 3 已采纳 You are dealing with an array of arrays Your code should read. var_dump($bad_words); echo "Did
使用ueditor富文本编辑器数据回显带有HTML标签的解决办法
2020-12-30 04:40

wswtecblog的博客后台用富文本编辑器编辑好后，在前台读取数据库中的信息，前台读取的数据是带HTML标签的数据格式尝试的解决办法： 1：使用js方法将前台读取出来的数据用html（）方法 innerhtml（）等js原生方法尽行转换，...
html面中select下拉框回显,select下拉框数据回显
2021-06-11 06:45

darkdress life的博客 Map map = new HashMap<>(); String operatorId= request.getParameter("operatorId"); String adClass= request.getParameter("adClass"); String adName= request.getParameter("adName"); map.put("operatorId",...
Ant Design Vue - ＜a-upload＞文件上传回显及清空文件
2022-04-19 14:15

王佳斌的博客当用户上传完文件关闭页面后，再次打开编辑后需要回显出来，并且还能清空，如下图所示：解决方案示例环境：Vue2 / Ant Design Vue 2 ，只保留了核心功能，没有任何校验，如有需要自行添加。在项目中建立一...
html页面怎么回显数据,HTML页面中数据的回显功能
2021-06-12 02:49

weixin_39868248的博客同时这个链接还链接到了一个新的js文件中。用去调取后端的接口。{title : '操作',field : 'operate',align : 'center',formatter : function(value, row, index) {var e = '详情 ';return e;}} ]...
没有解决我的问题, 去提问

悬赏问题

¥20 BAPI_PR_CHANGE how to add account assignment information for service line
¥500 火焰左右视图、视差（基于双目相机）
¥100 set_link_state
¥15 虚幻5 UE美术毛发渲染
¥15 CVRP 图论物流运输优化
¥15 Tableau online 嵌入ppt失败
¥100 支付宝网页转账系统不识别账号
¥15 基于单片机的靶位控制系统
¥15 真我手机蓝牙传输进度消息被关闭了，怎么打开？(关键词-消息通知)
¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？

码龄粉丝数原力等级 --

从html文档中回显<a>具有class =“pret”的内容

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

从html文档中回显<a>具有class =“pret”的内容

2条回答 默认 最新

悬赏问题

2条回答默认最新