douwo1862 2012-07-13 12:52
浏览 212
已采纳

解析XML,未知编码的问题

I'm trying to read an XML feed, I'm not sure the encoding is proper, but it's set to UTF-8 and when I try to parse it in PHP via SimpleXML, it errors on "BöðVar" (note the special "o" characters).

libxml_use_internal_errors(TRUE);
$XMLOutputXMLObj = simplexml_load_string($xml_string);
if($XMLOutputXMLObj !== FALSE)
{
//do stuff
}

This is all I get for an error:

Entity 'ouml' not defined

Entity 'eth' not defined

I tried using "mb_convert_encoding", in various ways, but that failed.

How can I resolve this issue for any character? IE WITHOUT manually replacing ö with &214; (with # of course)?

Even better... is there a way to make it so SimpleXML doesn't care what it is parsing, as long as the tags are intact?

Thanks

  • 写回答

1条回答 默认 最新

  • duanliaoyin3171 2012-07-13 12:58
    关注

    Have you tried to escape the XML data in the node using the <![CDATA[ and ]]> tags before and after the node's text/value? E.g.

    <?xml version="1.0" encoding="UTF-8"?>
    <fmsdata>
      <result><![CDATA[Success !@#$%^&*()]]></result>
    </fmsdata>
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 在不同的执行界面调用同一个页面
  • ¥20 基于51单片机的数字频率计
  • ¥50 M3T长焦相机如何标定以及正射影像拼接问题
  • ¥15 keepalived的虚拟VIP地址 ping -s 发包测试,只能通过1472字节以下的数据包(相关搜索:静态路由)
  • ¥20 关于#stm32#的问题:STM32串口发送问题,偶校验(even),发送5A 41 FB 20.烧录程序后发现串口助手读到的是5A 41 7B A0
  • ¥15 C++map释放不掉
  • ¥15 Mabatis查询数据
  • ¥15 想知道lingo目标函数中求和公式上标是变量情况如何求解
  • ¥15 关于E22-400T22S的LORA模块的通信问题
  • ¥15 求用二阶有源低通滤波将3khz方波转为正弦波的电路