dongyi8795 2016-11-08 05:04
浏览 20
已采纳

PHP DOM对象有一些自定义属性被剥离

I am trying to take an entire site into a DOM object like this:

$dom = new DOMDocument('1.0');
$dom->loadHTMLFile('http://thissite.com');

so that I can manipulate it and save a template.

However, some objects ( tags I have noticed) are stripping custom attributes out, so that:

<a href="/link/to/page/" aria-haspopup="true">Link Name</a>

changes to:

<a href="/link/to/page/">Link Name</a>

Is there any way to stop this happening?

UPDATE: Looks like this was not the issue, will leave an answer below to explain and potentially help others.

  • 写回答

1条回答 默认 最新

  • dongqianwei6664 2016-11-08 21:56
    关注

    So the issue was not a custom attribute. This custom attribute was inserted later via javascript, it was just getting the wrong link to the javascript file.

    I was looking at the "inspect element" and not the page source when troubleshooting. If you have this issue, look at the original page source (not using inspect) and see if the attribute (or any difference to the original code) is different between the code in the DOMDocument (by using echo $dom->saveHTML();).

    If this is the same then the DOMDocument is not the issue and you will need to check your javascript (are they relative links) etc.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 做个有关计算的小程序
  • ¥15 MPI读取tif文件无法正常给各进程分配路径
  • ¥15 如何用MATLAB实现以下三个公式(有相互嵌套)
  • ¥30 关于#算法#的问题:运用EViews第九版本进行一系列计量经济学的时间数列数据回归分析预测问题 求各位帮我解答一下
  • ¥15 setInterval 页面闪烁,怎么解决
  • ¥15 如何让企业微信机器人实现消息汇总整合
  • ¥50 关于#ui#的问题:做yolov8的ui界面出现的问题
  • ¥15 如何用Python爬取各高校教师公开的教育和工作经历
  • ¥15 TLE9879QXA40 电机驱动
  • ¥20 对于工程问题的非线性数学模型进行线性化