drvkf88226 2019-03-15 01:09
浏览 43

loadHTML只返回javascript,它不返回源

I am trying to get HTML content of the page, but when I do that, my file_get_contents only returns piece of JS.

when I use just file_get_contents I can see the content but I am trying to use loadHTML as well. So using the function like below.

function getContent(String $url): \DOMDocument
    {
        $opts = array(
            'http' => array(
            'method'=>"GET",
            'header'=>"Content-Type: text/html; charset=utf-8"
            )
        );

        $context = stream_context_create($opts);
        $result = file_get_contents($url,false,$context);

        $parser = new \DOMDocument();
        $parser->loadHTML($result = mb_convert_encoding($result,'HTML-ENTITIES', 'ASCII, JIS, UTF-8, EUC-JP, SJIS'));

        return $parser;

    }

But this time, I am having DOMDocument::loadHTML(): Tag header invalid in Entity So tried to overcome this problem I used libxml_use_internal_errors(true); this but, then I am just returning JS part of page...

How can I overcome issue?

The links is here: https://lions-mansion.jp/MF161037/

  • 写回答

0条回答 默认 最新

    报告相同问题?

    悬赏问题

    • ¥15 用hfss做微带贴片阵列天线的时候分析设置有问题
    • ¥50 我撰写的python爬虫爬不了 要爬的网址有反爬机制
    • ¥15 Centos / PETSc / PETGEM
    • ¥15 centos7.9 IPv6端口telnet和端口监控问题
    • ¥120 计算机网络的新校区组网设计
    • ¥20 完全没有学习过GAN,看了CSDN的一篇文章,里面有代码但是完全不知道如何操作
    • ¥15 使用ue5插件narrative时如何切换关卡也保存叙事任务记录
    • ¥20 海浪数据 南海地区海况数据,波浪数据
    • ¥20 软件测试决策法疑问求解答
    • ¥15 win11 23H2删除推荐的项目,支持注册表等