simple_html_dom无法按预期工作

$html = new \simple_html_dom();
$html -> load_file('h*ttp://xxx.com/article.html');
$res = $html->find('div[id=content]',0)->find('p');

$arr = array();//result set
foreach($res as $v){
    $arr[] = strip_tags($v->plaintext);
}
print_r($arr);//print

I want to scrap content from a webpage,the content is encapsulated in the <div> with ID valued 'content',now,I retrieve every paragraph enclosed with <p>,there are actually another tag <figure> in the div,finally I got results with both <p> And <figure>,<figure> should not be there and what is wrong with me?

DOM structure

div id= content p p figure p figure p p div

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dshyu6866 2014-08-22 11:56
关注
Would this work?

$res = $html->find('#content p');
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

PHP simple_html_dom无法正确解析Apple维基百科页面 html php
2015-03-22 17:28

回答 1 已采纳 Change MAX_FILE_SIZE constant in simple_html_dom.php to, e.g. define('MAX_FILE_SIZE', 800000);
Wordpress simple_html_dom.php管理页面 php
2018-11-27 23:12

回答 2 已采纳 I was able to solve this by looking at file_get_contents(): stream does not support seeking ／ When
为什么找不到div？（simple_html_dom） html php
2017-12-23 17:50

回答 2 已采纳 So the Solution: For some unkown reason I needed to find the div/tag I was searching for by count
phpQuery和simple_html_dom DOM解析器对比
2017-12-14 22:44

人间四月天美丽春色的博客 phpQuery和simple_html_dom都是非常优秀的DOM解析器。 phpQuery主要使用方法，更多方法查看http://code.google.com/p/phpquery/ 1.加载文档的几种方式 1 2 3 4 5 6 //...
在simple_html_dom中设置超时 php
2016-02-04 13:26

回答 1 已采纳 You can not do that with simple_html_dom() or file_get_contents() or any other 'pure' PHP. For th
使用带有ajax的simple_html_dom [重复] ajax html php
2015-02-11 13:32

回答 1 已采纳 try with this <?php require_once '../library/Simple_HTML_DOM/simple_html_dom.php'; // Create
simple_html_dom.php内存问题 php
2011-11-26 16:44

回答 2 已采纳 $html->clear; if this is your actual code then you may want to change it to function call: $h
phpQuery 和 simple_html_dom对比
2018-04-18 17:17

GoverChan的博客 phpQuery和simple_html_dom都是非常优秀的DOM解析器。phpQuery主要使用方法，更多方法查看http://code.google.com/p/phpquery/1.加载文档的几种方式123456//$html为内容字符串，$contentType为文档类型，如果不指定...
如何使用PHP Simple HTML dom获取此文本？ html php
2015-10-30 23:20

回答 2 已采纳 Maybe this will give you the result you are looking for: foreach($info_html->find('div.info p'
simple_html_dom访问div里面的ul php
2017-03-14 10:19

回答 1 已采纳 You have to select <ul> inside $element by using $dom = $dom->find($element.' ul', 0)-&g
如何在PHP中使用simple_html_dom导入多个URL？ php
2018-06-22 09:14

回答 2 已采纳 I got an answer. <?php if(!empty($_FILES["excel_file"])) { $connect = mysqli_connect("loc
PHP Simple HTML DOM Parser: 简易且高效的HTML解析库
2024-03-26 09:44

gitblog_00004的博客 PHP Simple HTML DOM Parser: 简易且高效的HTML解析库项目地址:https://gitcode.com/sunra/php-simple-html-dom-parser PHP Simple HTML DOM Parser 是一个轻量级的PHP库，专为解析和操作HTML文档而设计。它提供了...
处理来自外部库的错误（simple_html_dom） php
2014-10-11 14:52

回答 2 已采纳 You can use the error_get_last() function to get info about the last error. You might also conside
php xingnengfenxi_PHP 性能分析第三篇: 性能调优实战
2020-12-21 04:54

weixin_39602571的博客而在第二篇中，我们深入研究了 XHGui UI，现在最后一篇，让我们把 XHProf /XHGui 的知识用到工作中！性能调优不用运行的代码才是绝好的代码。其他只是好的代码。所以，性能调优时，最好的选择是首先确保运行尽...
php读写xml文件操作,PHP5操作XML的新特性控制读写 dom
2021-04-10 14:16

weixin_39617405的博客这篇文章的面向对象是所有对PHP5的XML新功能感兴趣的各个水平的PHP开发者。我们假定读者掌握XML的基本知识。然而，如果你已经在你的PHP当中使用了XML，那么这篇文章也会让你受益非浅。介绍在当今的互联网世界，XML...
没有解决我的问题, 去提问

悬赏问题

¥15 python的qt5界面
¥15 无线电能传输系统MATLAB仿真问题
¥50 如何用脚本实现输入法的热键设置
¥20 我想使用一些网络协议或者部分协议也行，主要想实现类似于traceroute的一定步长内的路由拓扑功能
¥30 深度学习，前后端连接
¥15 孟德尔随机化结果不一致
¥15 apm2.8飞控罗盘bad health，加速度计校准失败
¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
¥15 谁有desed数据集呀
¥20 手写数字识别运行c仿真时，程序报错错误代码sim211-100

simple_html_dom无法按预期工作

1条回答 默认 最新

悬赏问题

1条回答默认最新