PHP loadHTML错误解决方案会产生xPath查询问题

I had a problem with loading HTML including null bytes, and I applied the bug fix as shown here: PHP DOM loadHTML() method unusual warning

The thing is that now, any query I do on that "fixed" HTML will give no results at all.

This is what I do:

$opts = array('http' => array('header' => 'Accept-Charset: UTF-8, *;q=0'));
$context = stream_context_create($opts);
$html=file_get_contents('http://actualidad.rt.com/ultima_hora',false,$context);
$html=mb_convert_encoding($html, 'UTF-8', mb_detect_encoding($html, 'UTF-8, ISO-8859-1', true));
$html=str_replace("\0", '', $html); //Avoid PHP BUG https://stackoverflow.com/questions/30925533/php-dom-loadhtml-method-unusual-warning
$this->dom->loadHTML($html, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD);
$xpath=new DOMXPath($this->dom);
$COUNTDIVS=$xpath->query('//div');

$COUNTDIVS has zero elements, while the real HTML has a whole bunch of div tags.

And, the code is working fine with websites where the bug doesn't apply.

How could I fix it?

Thanks a lot.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

Xpath循环问题，用于将简单的HTML表解析为php数组 html php
2019-02-27 07:51

回答 1 已采纳 $strhtml=' <table id="Details" class="DATA_TABLE DATA_TABLE_WO_TOTAL"> <tr> <
PHP和XPath查询 php
2017-04-12 18:17

回答 1 已采纳 There are a few approaches to do this. First of all, you should register the namespace: $xml->
Xpath查询返回部分空值（PHP） php xml
2016-09-28 11:42

回答 1 已采纳 If you do something like: $xml = simplexml_load_string($tmpstr); $smsts = $xml->xpath('//TS');
您如何在PHP中解析和处理HTML / XML？
2019-12-04 10:40

asdfgh0077的博客如何解析HTML / XML并从中提取信息？
PHP $ xpath->查询循环 php
2019-02-28 08:35

回答 1 已采纳 Because you're using single quotes your resulting query string looks exactly like this (with $i an
在PHP中使用XPath替换XML属性 php xml
2019-06-11 17:26

回答 1 已采纳 The answer as Nigel Ren suggested was just to remove these two lines, as they no longer apply: $
简单的xpath查询不起作用 html php
2014-12-09 22:11

回答 1 已采纳 There is nothing wrong with your xpath query as it is correct syntax and the node does exist. The
前端面试题总结
2021-10-31 23:39

煜成'Studio的博客 Ajax readyState表示xhr对象的请求状态，取值范围是0——4，分别表示5个不同的状态。 0：（未初始化）xhr对象已经... 27请基于vue框架，实现一个父组件调用子组件方法的示例方案一：通过ref直接调用子组件的方法； ...
将php变量传递给xpath不起作用 html php
2017-05-19 01:35

回答 1 已采纳 Use double-quotes for the variable $index to be correctly substituted with the corresponding value
在Xpath查询中排除链接 php
2018-12-23 22:25

回答 1 已采纳 You can exclude link text nodes from results with //div[@class="intro"]//text()[not(parent::a)]
PHP SimpleXMLElement xpath php
2018-03-22 19:02

回答 1 已采纳 This gives me an empty array! No it doesn't. Look closely at your output, and you will see th
前端面试一
2019-03-21 11:33

赏花赏景赏时光的博客前端面试题第一阶段 HTML、CSS、HTML5、CSS3 1、XHTML、HTML、XML的异同 XHTML-Extensible Hypertext Markup Language：可扩展超文本标记语言，以 XML 应用的方式定义的 HTML，更严格更纯净的 HTML 版本 ...
使用PHP和xPath从HTML中提取数据 html php
2013-04-12 12:42

回答 2 已采纳 Each Company can be represented by a context-node while having each property represented by an xpa
一份百度前端面试题：
2016-12-24 14:26

nevercurtain的博客在网上看见一份很不错的百度前端面试题，自己可以对前边知识做一个总结，也算是对自己知识的一个补充吧。当然文章是转载的，侵删！！随着各大互联网公司设立了Web前端开发工程师、设计工程师等职位，web前端...
前端页面优化
2016-08-29 11:59

小圣贤君的博客前端是庞大的，包括HTML、CSS、Javascript、Image、Flash等等各种各样的资源。前端优化是复杂的，针对方方面面的资源都有不同的方式。那么，前端优化的目的是什么 1. 从用户角度而言，优化能够让页面加载得...
没有解决我的问题, 去提问

悬赏问题

¥15 随身WiFi网络灯亮但是没有网络，如何解决？
¥15 gdf格式的脑电数据如何处理matlab
¥20 重新写的代码替换了之后运行hbuliderx就这样了
¥100 监控抖音用户作品更新可以微信公众号提醒
¥15 UE5 如何可以不渲染HDRIBackdrop背景
¥70 2048小游戏毕设项目
¥20 mysql架构，按照姓名分表
¥15 MATLAB实现区间[a,b]上的Gauss-Legendre积分
¥15 delphi webbrowser组件网页下拉菜单自动选择问题
¥15 linux驱动，linux应用，多线程

码龄粉丝数原力等级 --

PHP loadHTML错误解决方案会产生xPath查询问题

0条回答默认最新

悬赏问题

PHP loadHTML错误解决方案会产生xPath查询问题

0条回答 默认 最新

悬赏问题

0条回答默认最新