douxi2011 2015-06-11 11:26
浏览 15

解析HTML源代码以进行一些替换

I am trying to parse an HTML source to make some changes on it. Until now I was using Simple HTML Dom Parser but I am facing some memory leaking although using clear() function as in documentation.

So I decided to try PHP DOM and PHPQUERY but both are giving me the exact same problem. Even just loading dom, it is breaking in the exactly same location. It is removing some tags.

php dom

$dom = new DOMDocument;
$dom->loadHTML($html);
$html = $dom->saveHTML();

phpquery

require('phpQuery.php');
$html = phpQuery::newDocumentHTML($html);

source code page before

<input type="radio" id="logo_show0" name="logo_show" value="1" checked="checked" />

...

$(document).ready(function() {
    var tab = $('<li class=" active"><a href="#general" data-toggle="tab">Plugin</a></li>');
    $('#myTabTabs').append(tab);
});

source code page after

<input type="radio" id="logo_show0" name="logo_show" value="1" checked />

...

$(document).ready(function() {
    var tab = $('<li class=" active"><a href="#general" data-toggle="tab">Plugin');
    $('#myTabTabs').append(tab);
});

Has been removed checked and closing tags </a></li>.

What I am doing wrong?

  • 写回答

0条回答 默认 最新

    报告相同问题?

    悬赏问题

    • ¥15 三因素重复测量数据R语句编写,不存在交互作用
    • ¥15 微信会员卡等级和折扣规则
    • ¥15 微信公众平台自制会员卡可以通过收款码收款码收款进行自动积分吗
    • ¥15 随身WiFi网络灯亮但是没有网络,如何解决?
    • ¥15 gdf格式的脑电数据如何处理matlab
    • ¥20 重新写的代码替换了之后运行hbuliderx就这样了
    • ¥100 监控抖音用户作品更新可以微信公众号提醒
    • ¥15 UE5 如何可以不渲染HDRIBackdrop背景
    • ¥70 2048小游戏毕设项目
    • ¥20 mysql架构,按照姓名分表