加载外部XML文件并在1次调用中获取html标头信息

I have a php file which grabs an xml file from another site, it then chucks that information into my database.

The problem I am having is that their site only allows 360 requests in any 1 hour period, so am trying to code it to check the header information whilst grabbing the file.

I have it checking the status of the page using

$requesttest = 'http://www.footballwebpages.co.uk/teams.xml';
if($requesttest == NULL) return false;  
$ch = curl_init($requesttest);  
curl_setopt($ch, CURLOPT_TIMEOUT, 5);  
curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, 5);  
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);  
$data = curl_exec($ch);  
$httpcode = curl_getinfo($ch, CURLINFO_HTTP_CODE);  
curl_close($ch); 

if($httpcode == 429){
    return 'Try again later, too many requests recieved.';
} else if($httpcode>=200 && $httpcode<300){
    /* run code to grab xml file */
    $comps = array (    0 => 1, /* Premier_League */
                    1 => 2 /* Championship */ 
                    );
    $comps_total = count($comps);
    $comps_no = 0;

    while ($comps_no < $comps_total) {
        $url = 'http://www.footballwebpages.co.uk/teams.xml?comp=' . $comps[$comps_no];
        $full_list = simplexml_load_file($url);
        /* Code for grabbing and storing info from XML */
} else {
    return 'Football Web Pages Offline';
}

At the moment, it checks the main 'teams' page to see if the requests limit has been reached, and then grabs each xml for the competitions set. The issue is that if when on first check, there is only 1 request available, when it gets to the next stage, it will fail. How can I check the header info when loading the xml file, without having to call the page to check the header, then call the page to grab the xml file?

Basically load the xml file if the header code is between 200 and 300 in 1 call, so as not to waste 2 requests to grab 1 xml page.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

douju2014 2016-01-24 13:02

关注

You could perhaps employ a method similar to the following, forget the first call to the base url as it is redundant and instead use the return value from the function to determine if further processing should be done:

<?php
    /* utility function to get data and return an object */
    function getxml( $comp=1 ){
        global $ch;
        global $url;

        curl_setopt( $ch, CURLOPT_URL, $url . '?comp=' . $comp );
        $data = curl_exec( $ch );
        $status = curl_getinfo( $ch, CURLINFO_HTTP_CODE ); 

        return (object)array(
            'xmldata'   =>  $data,
            'status'    =>  $status
        );
    }
    /* All the comps available - more than specified! */
    $comps=array( 
        'Barclays_Premier_League' => 1,
        'Sky_Bet_Championship' => 2,
        'Sky_Bet_League_One' => 3,
        'Sky_Bet_League_Two' => 4,
        'National_League' => 5,
        'National_League_North' => 6,
        'National_League_South' => 7,
        'Evo-Stik_Southern_League_Premier_Division' => 8,
        'Evo-Stik_Southern_League_Division_One_Central' => 9,
        'Evo-Stik_Southern_League_Division_One_South_&_West' => 10,
        'Ryman_League_Premier_Division' => 11,
        'Ryman_League_Division_One_North' => 12,
        'Ryman_League_Division_One_South' => 13,
        'Evo-Stik_League_Premier_Division' => 14,
        'Evo-Stik_League_Division_One_North' => 15,
        'Evo-Stik_League_Division_One_South' => 16,
        'Scottish_Premiership' => 17,
        'Scottish_Championship' => 18,
        'Scottish_League_One' => 19,
        'Scottish_League_Two' => 20
    );
    /* only interested in first two */
    $comps=array_slice( $comps, 0, 2, true );


    /* I don't use simple_xml() - used to process xml data */
    $dom=new DOMDocument;

    /* base url */
    $url= 'http://www.footballwebpages.co.uk/teams.xml';

    /* 
        initialise curl request object but 
        set the url for each $comp in the function 
    */
    $ch = curl_init();
    curl_setopt( $ch, CURLOPT_TIMEOUT, 5 );  
    curl_setopt( $ch, CURLOPT_CONNECTTIMEOUT, 5 );  
    curl_setopt( $ch, CURLOPT_RETURNTRANSFER, true );   

    /* 
    If there have been too many requests when launching 
    the 429 condition should break out of the entire loop -
    thus using only 1 request
    */
    foreach( $comps as $key => $comp ){
        $xml=getxml( $comp );
        switch( $xml->status ){
            case 429: echo 'Try again later, too many requests recieved.'; break 2;
            case 200:
                /* if everything is ok, process $xml */
                $dom->loadXML( $xml->xmldata );


                /* example of processing xml data */
                echo '
                <h1>'.$dom->getElementsByTagName('competition')->item(0)->nodeValue.'</h1>
                    <ul>';

                $col=$dom->getElementsByTagName('team');
                if( $col ){
                    foreach( $col as $team ) echo '<li>'.$team->childNodes->item(1)->nodeValue.', '.$team->childNodes->item(3)->nodeValue.'</li>';
                }
                echo '
                    </ul>';
            break;
            default:/* If no response or an unknown response exit */
                echo 'Football Web Pages Offline';
            break 2;
        }
    }

    curl_close( $ch ); 
    $dom=$ch=$comps=null;
?>

报告相同问题？

关注问题

PHP - 从另一个文件加载标头 html php
2015-12-28 08:36

回答 1 已采纳 Well, first of all, I don't think you copied the code correctly to here. As my debugging eyes can
如何在.htaccess文件中设置一些PHP标头 php
2014-07-13 14:15

回答 1 已采纳 You need to have mod_headers installed, or you might get error 500. You can wrap the setting of he
我需要在php标头中进行特殊设置才能生成SSL加密的xml输出吗？ php ssl xml
2014-11-21 15:03

回答 1 已采纳 There's nothing you need to set in the header. If the connection was established via SSL, the resp
您如何在PHP中解析和处理HTML / XML？
2019-12-04 10:40

asdfgh0077的博客如何解析HTML / XML并从中提取信息？
在wordpress中查找并替换html标头标签 php
2016-09-27 17:12

回答 1 已采纳 you can hook into the WP SEO 'wpseo_robots' hook like this: <?php // add the filter using an a
在html.tpl.php的标头中添加Google Plus html php
2013-12-11 19:24

回答 2 已采纳 Should you not be using echo instead of print? <?php if(drupal_is_front_page()): echo '<a
在Golang中获取POST参数，并将标头作为application / json
2018-07-06 07:27

回答 2 已采纳 use Json decode: req is *http.Request decoder := json.NewDecoder(req.Body) decoder.UseNumber() er
php在表单调用函数后,从表单中调用php函数，并用结果完成表单 - php
2021-04-26 14:52

刀熊说说的博客该函数需要一个唯一的ID，该函数将唯一的ID放入URL，获取一些XML，解析XML并返回一些东西。这是我要自动填写其余表单字段的内容。因此，我需要一个表单按钮来将唯一的ID发送给我的函数，并且我需要了解我的函数如何...
无法在本地网络中获取http标头 http php
2016-07-26 14:16

回答 1 已采纳 First of all add line i records.txt Generate Image Do ob_clean(), it will remove output if warnin
使用PHP构建虚拟xml文件时，删除错误并避免标头已发送错误 php xml
2011-07-14 14:03

回答 2 已采纳 There is not much you can do about in the situation. But I won't say never. Other plugin developer
通过PHP将标头添加到CSV文件 html php
2018-04-14 11:16

回答 2 已采纳 You could create a array with your keys, and create a loop to get values. Then use fputcsv() to wr
解决同时上传文件和json_上载文件并使用XML或JSON来存储和显示文件信息
2020-07-06 06:16

cuxiong8996的博客在本教程中，您将学习如何在PHP中使用会话，如何通过DOM处理XML数据以及如何在PHP中创建，使用和读取JSON数据。关于本教程本教程通过演示基于Web的工作流应用程序的构建，教您如何使用PHP。 “ 学习PHP，第1...
是否可以使用拒绝的x-frame-option标头获取网站的HTML代码？ html php
2019-02-11 19:07

回答 1 已采纳 I actually wrote a plugin that's designed to import content into your WordPress site. Note, it's o
【PHP】phpini配置文件中文翻译
2019-11-14 11:27

JackMa_的博客 [PHP] ;;;;;;;;... 关于 php.ini 配置文件 ;... PHP 的初始化文件, 必须命名为 php.ini. ... 主要是用来负责PHP的配置. ... PHP 会尝试通过一些地址来寻找和加载配置. ... 1. SAPI . ; 2. 环境变量 PHPRC . (A...
PHP配置文件详解php.ini
2020-01-09 09:06

Yel_Liang的博客 PHP还是一个不断发展的工具，其功能还在不断地删减 ; 而php.ini的设置更改可以反映出相当的变化， ; 在使用新的PHP版本前，研究一下php.ini会有好处的 ;;;;;;;;;;;;;;;;;;; ; 关于这个文件 ; ;;;;;;;;;;;;;;;;;;; ...
jquery xml_用jQuery处理XML
2020-06-24 12:36

cuyi7076的博客虽然本教程对于希望学习或提高jQuery和XML处理技能的经验丰富的开发人员很有用，但它还提供了基本DOM脚本概念的实用概述，这些概念甚至可以使最初级JavaScript编码人员也可以快速掌握并掌握它们。本教程的全部范围...
html,xml_网页开发_爬虫_笔记
2020-04-26 21:54

weixin_ry5219775的博客 https://www.runoob.com/html/html-examples.html html 实例 https://www.runoob.com/tags/html-reference.html HTML 参考手册- (HTML5 标准) 各种标签 https://www.runoob.com/html/html-basic.html 这是一个链接 ...
html++菜单抽离,利用模板将HTML从JavaScript中抽离
2021-06-14 02:47

我要当现充的博客利用模板将HTML从JavaScript中抽离一、当需要注入大段的HTML标签到页面中时，应该使用服务器渲染(从服务器加载HTML标签)该方法将模板放置于服务器中使用XMLHttpRequest对象来获取外部标签(如多页应用)function ...
PHP低版本安全问题
2023-11-17 21:05

信安成长日记的博客 GCC 有个 C 语言扩展修饰符 attribute((constructor))，可以让由它修饰的函数在 main() 之前执行，若它出现在共享对象中时，那么一旦共享对象被系统加载，立即将执行 attribute((constructor)) 修饰的函数。在 ...
没有解决我的问题, 去提问

悬赏问题

¥15 目详情-五一模拟赛详情页
¥15 有了解d3和topogram.js库的吗？有偿请教
¥100 任意维数的K均值聚类
¥15 stamps做sbas-insar，时序沉降图怎么画
¥15 买了个传感器，根据商家发的代码和步骤使用但是代码报错了不会改，有没有人可以看看
¥15 关于#Java#的问题，如何解决？
¥15 加热介质是液体，换热器壳侧导热系数和总的导热系数怎么算
¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
¥15 cmd cl 0x000007b
¥20 BAPI_PR_CHANGE how to add account assignment information for service line

码龄粉丝数原力等级 --

加载外部XML文件并在1次调用中获取html标头信息

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

加载外部XML文件并在1次调用中获取html标头信息

1条回答 默认 最新

悬赏问题

1条回答默认最新