使用PHP遍历Yandex API的XML响应

I am creating a metasearch engine using Yandex API. Yandex gives result in XML format. So we need to traverse the XML response inorder to get the different fields like URL,title ,description etc.

The XML response by Yandex is as follows: http://pastebin.com/kAVAVri9

This is how i have implemented: paste

$dom5 = new DOMDocument();

if ($dom5->loadXML($site_results)) {

    $results  = $dom5->getElementsByTagName("response");
    $results1 = $results->getElementsByTagName("results");
    $results2 = $results1->getElementsByTagName("group");


    $totals["yandex"] = 1000;


    foreach ($results1 as $link) {

        $url = $link->getElementsByTagName("doc")->item(2)->nodeValue;
        ;
        $url = str_replace('http://', '', $url);
        if (substr($url, -1, 1) == '/') {
            $url = substr($url, 0, strlen($url) - 1);
        }
        $search_results[$i]["url"] = $url;

        $title                       = $link->getElementsByTagName("doc")->item(4)->nodeValue;
        $search_results[$i]["title"] = $title;
        $test                        = $link->getElementsByTagName("doc");
        $test1                       = $test->getElementsByTagName("title");
        $desc                        = $test1->getElementsByTagName("headline")->item(0)->nodeValue;
        $search_results[$i]["desc"]  = $desc;

        $search_results[$i]["engine"]   = 'yandex';
        $search_results[$i]["position"] = $i + 1;
        $i++;

    }
}

I am new to php. Please forgive me if i have done some stupid mistake. I am unable to retrive the results through my implementation. Please help me find the mistake and get the necessary fields from xml response. Thank you!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

dtbc37573 2013-03-16 00:42

关注

The method getElementsByTagName() returns a DOMNodeList:

$results  = $dom5->getElementsByTagName("response");

The DOMNodeList does not have a method called getElementsByTagName(), but you call it:

$results1 = $results->getElementsByTagName("results");

Therefore the fatal error is triggered: Whenever in PHP you execute a method on an object that does not exist, you will get a fatal error and your script stops working.

Do not call undefined object methods and you should be fine.

Apart from these basics, for parsing such XML documents I normally suggest SimpleXML, however this XML file is a little specific therfore I suggest to extend from SimpleXML and add the features you likely need to use, in part from regular expressions as well as from DOMDocument.

One concept you should know about when parsing these XML files is Xpath. For example to access the elements you had that many problems with above, you can write the path literally:

/*/response/results/grouping/group

In PHP with SimpleXML this looks like:

$url = 'http://pastebin.com/raw.php?i=kAVAVri9';
$xml = simplexml_load_file($url, 'MySimpleXML');
foreach ($xml->xpath('/*/response/results/grouping/group') as $link) {
    # ... operate on $link
}

A larger example:

$url = 'http://pastebin.com/raw.php?i=kAVAVri9';
$url = '../data/yandex.xml';
$xml = simplexml_load_file($url, 'MySimpleXML');
foreach ($xml->xpath('/*/response/results/grouping/group') as $link) {
    $url      = $link->doc->url->str()->preg('~^https?://(.*?)/*$~u', '$1');
    $title    = $link->doc->title->text();
    $headline = $link->doc->headline->text();
    printf("<%s> %s
%s

", $url, $title, wordwrap($headline));
}

And it's exemplary output:

<www.facebook.com> " Facebook" - a social networking service
Allows users to find and communicate with friends, classmates and
colleagues, share thoughts, photos and videos, and join various groups.

<en.wikipedia.org/wiki/Facebook>  Facebook - Wikipedia, the free encyclopedia
 Facebook is a social networking service launched in February 2004, owned
and operated by Facebook, Inc. As of September 2012, Facebook has over one
billion active users, more than half of them using Facebook on a mobile
device.

<mashable.com/category/facebook>  Facebook 

...

The PHP code example above needs some more code to work because it extends from SimpleXML for the ease of use. This is done with the following code:

class MySimpleXML extends SimpleXMLElement
{
    public function text()
    {
        $string = null === $this[0] ? ''
            : (dom_import_simplexml($this)->textContent);

        return $this->str($string)->normlaizeWS();
    }

    public function str($string = null)
    {
        return new MyString($string ?: $this);
    }
}

class MyString
{
    private $string;

    public function __construct($string)
    {
        $this->string = $string;
    }

    public function preg($pattern, $replacement)
    {
        return new self(preg_replace($pattern, $replacement, $this));
    }

    public function normlaizeWS()
    {
        return $this->preg('~\s+~', ' ');
    }

    public function __toString()
    {
        return (string) $this->string;
    }
}

This might be all a little bit much for the beginning, checkout the PHP manual for SimpleXML and the other functions used in the code-example.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

使用PHP遍历Yandex API的XML响应 php xml
2013-03-15 18:19

回答 1 已采纳 The method getElementsByTagName() returns a DOMNodeList: $results = $dom5->getElementsByTagNa
PHP中的XML不显示数组 php xml
2016-11-13 12:15

回答 1 已采纳 Every $value1 in your foreach is a SimpleXMLElement Object with attributes. To get attributes of
如何使用phpmailer发送电子邮件？ php
2014-06-15 17:22

回答 1 已采纳 Well, I am not sure if you really need it, but I did a class some time ago to send emails easily u
php破解开发资源库,十五套专为开发人员打造的PHP资源库
2021-04-23 14:59

ZIBO资博的博客最近一段时间以来，PHP作为一款高效服务器端编程语言，开始在Web开发领域掀起又一股热潮。大家可能想象不到，根据2013年1月发布的一份调查报告显示，PHP语言已经被安装在全球超过2.4亿个网站以及210万台Web服务器之...
来自对象php的数据 php xml
2014-12-29 18:23

回答 1 已采纳 This should work for you: echo $xmlPoland->fact->temperature; Output: -9 If you need i
在数组php中查找/选择指定的ID php
2014-12-10 12:39

回答 4 已采纳 $filteredArray = array(); for($i = 0, $end = count($array);$i < $end;i++) { if($array[$i
使用Guzzle获取远程数据 http php
2017-02-28 15:39

回答 1 已采纳 This error occur when the curl.cainfo and openssl.cafile config properties of the php.ini fil
PHP程序员应该知道的15个库
2015-10-08 13:29

武晓兵的博客据2013年发布的一份调查报告显示，PHP语言已经被安装在全球超过2.4亿个网站以及210万台Web服务器之上。PHP代表超文本预处理器，它主要用于创建动态网页。当然，PHP还有许多其他用途，在Web开发人员中具有极高的人气...
从AGI运行时为空变量（但从bash不是空的） bash php
2016-06-10 12:39

回答 1 已采纳 The problem was in accoustic files folder permissions. Just moved it to /home/asterisk and run ch
Smarty：从字符串/浮点值的末尾修剪零 php
2014-10-25 18:16

回答 1 已采纳 You could do for such numbers, using rtrim function: PHP data: $data = ['3', '3.','3.0','3.00','
Jenkins里定时器构建任务使用groovy模板发邮件报错 jenkins
2023-03-14 17:37

回答 7 已采纳基于Monster 组和GPT的调写：根据错误信息，Groovy脚本中存在一个空对象调用了方法。具体地说，Groovy脚本中的 getUserName() 方法在一个空对象上被调用，这导致了Null
【python渗透测试】python在渗透测试中的利用（完全版，持续中出）
2023-04-11 22:00

人间体佐菲的博客然后，它打印接收到的消息并使用send()方法发送响应。最后，它关闭客户端套接字以释放资源。服务器端首先创建一个套接字对象，并将其绑定到本地IP地址和端口。然后，它开始监听连接请求，并在while循环中等待客户端...
通过urlencode（）输出为SEO目标命名非英语字母语言的图像是否合适 php
2015-01-21 12:44

回答 1 已采纳 No, if you name your image '%C5%9Fark%C4%B1.png', it will be named so, and you would have to link
前端面试题总结
2021-12-15 14:54

L1270423647的博客 1.网络中使用最多的图片格式有哪些 • gif 支持动画，只有全透明和不透明两种模式，只有 256 种颜色 • jpg 采用有损压缩算法，体积较小，不支持透明，不支持动画 • png 采用有损压缩算法，体积也相对较小，支持...
【吐血整理】超全golang面试题合集+golang学习指南+golang知识图谱+成长路线一份涵盖大部分golang程序员所需要掌握的核心知识。
2021-01-11 12:37

小白debug的博客目录(善用Ctrl+F) 基础入门新手 Golang开发新手常犯的50个错误数据类型连nil切片和空切片一不一样都不清楚？...map不初始化使用会怎么样 map不初始化长度和初始化长度的区别 map承载多大，..
2021-03-04
2021-03-04 19:21

usual_mind2020的博客它不要求用户指定对数据的存放方法，也不需要用户了解具体的数据存放方式，所以具有完全不同底层结构的不同数据库系统, 可以使用相同的结构化查询语言作为数据输入与管理的接口。结构化查询语言语句可以嵌套，这使它...
Go 相关的框架，库和软件的精选清单
2020-07-03 09:37

baobaodqh的博客 count-min-log-执行Count-Min-Log草图：使用近似计数器进行近似计数（类似于Count-Min草图，但使用较少的内存）。 crunch -Go包实现了用于轻松处理各种数据类型的缓冲区。 cuckoofilter -Cuckoo过滤器：是Go中实现...
精选的 Go 框架，库和软件的精选清单
2020-05-09 11:24

思月行云的博客 vorbis- “本机” Go Vorbis 解码器（使用 CGO，但没有依赖项）。 waveform -Go 程序包，能够从音频流生成波形图像。身份验证和 OAuth 用于实施认证方案的库。 authboss -Web 的模块化身份验证系统。它尝试删除...
没有解决我的问题, 去提问

悬赏问题

¥15 乘性高斯噪声在深度学习网络中的应用
¥15 运筹学排序问题中的在线排序
¥15 关于docker部署flink集成hadoop的yarn，请教个问题 flink启动yarn-session.sh连不上hadoop，这个整了好几天一直不行，求帮忙看一下怎么解决
¥30 求一段fortran代码用IVF编译运行的结果
¥15 深度学习根据CNN网络模型，搭建BP模型并训练MNIST数据集
¥15 C++ 头文件/宏冲突问题解决
¥15 用comsol模拟大气湍流通过底部加热（温度不同）的腔体
¥50 安卓adb backup备份子用户应用数据失败
¥20 有人能用聚类分析帮我分析一下文本内容嘛
¥30 python代码，帮调试，帮帮忙吧

码龄粉丝数原力等级 --

使用PHP遍历Yandex API的XML响应

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

使用PHP遍历Yandex API的XML响应

1条回答 默认 最新

悬赏问题

1条回答默认最新