从外部页面获取XML数据并使用PHP解析它

I'm trying to create a database of World of Warcraft gems. If I go to this page:

http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=purple&searchType=items

And go to View Source in Firefox, I see a tonne of XML data which is exactly what I want. I wrote up this quick script to try and parse some of it:

<?php

$gemUrls = array(
                 'Blue' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=blue&searchType=items',
                 'Red' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=red&searchType=items',
                 'Yellow' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=yellow&searchType=items',
                 'Meta' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=meta&searchType=items',
                 'Green' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=green&searchType=items',
                 'Orange' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=orange&searchType=items',
                 'Purple' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=purple&searchType=items',
                 'Prismatic' => 'http://www.wowarmory.com/search.xml?fl[source]=all&fl[type]=gems&fl[subTp]=purple&searchType=items'
                 );


// Get blue gems

$blueGems = file_get_contents($gemUrls['Blue']);

$xml = new SimpleXMLElement($blueGems);

echo $xml->items[0]->item;

?>

But I get a load of errors like this:

Warning: SimpleXMLElement::__construct() [simplexmlelement.--construct]: Entity: line 20: parser error : xmlParseEntityRef: no name in C:\xampp\htdocs\WoW\index.php on line 19

Warning: SimpleXMLElement::__construct() [simplexmlelement.--construct]: if(Browser.iphone && Number(getcookie2("mobIntPageVisits")) < 3 && getcookie2( in C:\xampp\htdocs\WoW\index.php on line 19

I'm not sure what's wrong. I think file_get_contents() is bringing back data that isn't XML, maybe some Javascript files judging by the iPhone parts in the errors.

Is there any way to just get back the XML from that page? Without any HTML or anything?

Thanks :)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dsfdsfds521521 2010-06-17 02:39
关注
What is returned is an xhtml, it's xml-ish, but not good enough for an XML parser. To use SimpleXMLElement you would need well-formed XML. From the documentation of the constructor:

Method signature:

__construct ( string $data [, int $options [, bool $data_is_url [, string $ns [, bool $is_prefix ]]]] )

$data is described as:

A well-formed XML string or the path or URL to an XML document if data_is_url is TRUE.

So, random webpage will not satisfy this parser. You ask:

"Is there any way to just get back the XML from that page? Without any HTML or anything?"

You can contact the webmasters and find out if they have an XML view of the data. Failing that, you could use a plain HTML parser to try and extract data. I like PHP Simple HTML DOM Parser. Check out How to implement a web scraper in PHP?
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

从外部页面获取XML数据并使用PHP解析它 php xml
2010-06-17 02:13

回答 1 已采纳 What is returned is an xhtml, it's xml-ish, but not good enough for an XML parser. To use SimpleXM
使用PHP从外部XML文件获取数据 php xml
2014-12-14 15:45

回答 1 已采纳 $xml->result->geometry->location->lat
使用XMLReader和PHP获取大型XML文件中的子树数据 php xml
2019-04-22 14:31

回答 1 已采纳 Rather than trying to read the whole document element by element, you can with XMLReader ask it to
php 获取xml,用PHP读取XML数据
2021-03-25 08:40

马汉东的博客用PHP读取XML数据用PHP读取XML数据2009-12-18今天工作上碰到一个问题，由于我们的项目数据太少，所以需要从web search那边借调数据，他们只给我们提供了一个xml的接口。因此，我们需要把xml的数据转化成html呈现给...
PHP：从XML字符串中获取数据 php xml
2018-05-20 03:22

回答 2 已采纳 The var_dump gives you an object of type SimpleXMLElement which has a __toString method which retu
使用PHP从亚马逊MWS API获取订单数据 php xml
2018-08-18 16:30

回答 1 已采纳 You Mock service is overriding the production service instance. See the duplicate. $service = ne
使用PHP从URL获取XML数据 html php xml
2014-03-11 16:28

回答 1 已采纳 Check out $url. You have a space inbetween the '&' and 'apiKey', remove that, first off. I'm ge
php 获取xml参数,通过php来读取xml的数据
2021-03-25 11:58

夜看满天繁星的博客今天工作上碰到一个问题由于我们的项目数据太少所以需要从web search那边借调数据，他们只给我们提供了一个xml的接口。因此，我们需要把xml的数据转化成html呈现给大家。由于项目是基于php的，所以就摒弃了用js来...
如何使用php从Xml属性获取值 php xml
2017-11-09 10:03

回答 2 已采纳 As some elements include namespaces, they don't work well with the simple json encode bit. As a q
使用php获取XML数据（只是图像不是product_id） php xml
2015-12-29 13:30

回答 2 已采纳 You can do it using SimpleXMLElement $xml = simplexml_load_string($xmlString, "SimpleXMLElement")
如何使用PHP获取XML属性 php xml
2017-11-01 11:50

回答 1 已采纳 php file code remove device from $xml->device->attributes()->name; <body style="font
php xml数据,用php读取xml数据
2021-04-21 04:48

柯布西耶的博客今天工作上碰到一个问题由于我们的项目数据太少所以需要从web search那边借调数据，他们只给我们提供了一个xml的接口。因此，我们需要把xml的数据转化成html呈现给大家。由于项目是基于php的，所以就摒弃了用js来...
如何在使用php解析时从XML文件中获取链接和粗体表示法？ php xml
2017-01-25 16:09

回答 2 已采纳 You can use asXML function to ouput the way you want: foreach ($xmls->activity as $xml) {
PHP 和 XML：PHP 中的 XML 解析
2024-04-21 05:15

新华的博客 PHP 中的 XML 解析是 Web 开发中的一项关键任务，涉及从 XML 文档中提取和操作数据。SimpleXML 扩展通过提供一种面向对象的方法来访问 XML 元素，从而简化了此过程。借助 SimpleXML，开发人员可以毫不费力地导航 XML...
php如何取xml数据,用php读取xml数据
2021-04-26 11:20

weixin_39576149的博客今天工作上碰到一个问题由于我们的项目数据太少所以需要从web search那边借调数据，他们只给我们提供了一个xml的接口。因此，我们需要把xml的数据转化成html呈现给大家。由于项目是基于php的，所以就摒弃了用js来...
没有解决我的问题, 去提问

悬赏问题

¥15 求螺旋焊缝的图像处理
¥15 blast算法（相关搜索：数据库）
¥15 请问有人会紧聚焦相关的matlab知识嘛？
¥15 网络通信安全解决方案
¥50 yalmip+Gurobi
¥20 win10修改放大文本以及缩放与布局后蓝屏无法正常进入桌面
¥15 itunes恢复数据最后一步发生错误
¥15 关于#windows#的问题：2024年5月15日的win11更新后资源管理器没有地址栏了顶部的地址栏和文件搜索都消失了
¥100 H5网页如何调用微信扫一扫功能？
¥15 讲解电路图，付费求解

从外部页面获取XML数据并使用PHP解析它

1条回答 默认 最新

悬赏问题

1条回答默认最新