drus40229 2015-12-11 04:47
浏览 40
已采纳

PHP XML输出中的字符编码问题

I am using a server-side PHP to call an API and return its content read by a client-side JS XMLHttpRequest(). The problem is, my PHP is returning letters like á as á. Here's the snippet of PHP that's causing this:

$dom = new DOMDocument;
$dom->loadHTML($meaning);
    foreach ($dom->getElementsByTagName('a') as $node) {
        $link_text =  $node->nodeValue;
        $link_href = $node->getAttribute('href');
        if (strpos($link_href,'www.somelink.com/something/') !== false) {
            $node->setAttribute('href', 'http://localhost:8888/a-s/#/mylink/' . $link_text);
        }
    }
echo $dom->saveHTML();

Taking cue from a couple of answers on similar questions on SO, I tried using HTMLEntities like so:

echo htmlentities($dom->saveHTML(), ENT_QUOTES, 'ISO-8859-15');

However, this further garbled up the output. What this did was throw the entire result in a raw xml format devoid of all formatting. What earlier looked like this:

enter image description here

Now looks like this:

enter image description here

Funny thing is, when I don't use HTMLEntities(), only the first instance of á gets rendered as á. If you look at the first image, the second instance onward, á is rendered as á without any problem!

  • 写回答

1条回答 默认 最新

  • donglu4633 2015-12-11 05:02
    关注

    Add this to your code before outputting:

    utf8_decode();
    echo $dom->saveHTML();
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 用hfss做微带贴片阵列天线的时候分析设置有问题
  • ¥50 我撰写的python爬虫爬不了 要爬的网址有反爬机制
  • ¥15 Centos / PETSc / PETGEM
  • ¥15 centos7.9 IPv6端口telnet和端口监控问题
  • ¥120 计算机网络的新校区组网设计
  • ¥20 完全没有学习过GAN,看了CSDN的一篇文章,里面有代码但是完全不知道如何操作
  • ¥15 使用ue5插件narrative时如何切换关卡也保存叙事任务记录
  • ¥20 海浪数据 南海地区海况数据,波浪数据
  • ¥20 软件测试决策法疑问求解答
  • ¥15 win11 23H2删除推荐的项目,支持注册表等