如何将PHP转换为XML输出

I have a php code. this code outputs an HTML. I need to modify this code to output an XML. ANy ideas as to how shall I go about doing this. Is there any XML library available that directly does the job or do i have to manually create each node.?

My php code is:

<!DOCTYPE html>
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />

<style>
a {text-decoration:none; color:black;}
</style>
</head>

<body>


<?php

$a=$_POST["title"];
$b=$_POST["name"];

$c="http://www.imdb.com/search/title?title=".urlencode($a)."&title_type=".urlencode($b);
$d=file_get_contents($c);


preg_match_all('/<div id="main">
(No results.)/', $d,$nore);


preg_match_all('#<img src="(.*)"#Us', $d, $img);//image

preg_match_all('/<a\s*href="\/title\/tt[0-9]*\/">((?:[a-z]*(?:&*[.]*)?\s*-*[a-z]*[0-9]*[^<])+)/i',$d,$tit);  //title 

preg_match_all('/<span\sclass="year_type">\s*\(([\d]*)/',$d,$ye); //movie year working fine

preg_match_all('#<span class="credit">
    Dir: (.*)
(?:    With:)?#Us',$d,$dir); //director 

preg_match_all('/<span class="rating-rating"><span class="value">([\w]*.[\w]*)/i',$d,$rat); //rating 

preg_match_all('/<a\shref="(\/title\/tt[0-9]*\/)"\s*[title]+/i',$d,$lin); //link 




for($i=0;$i<5;$i++)
{ 
  if (@$rat[1][$i]=="-")
  $rat[1][$i]="N/A";
}

for($i=0;$i<5;$i++)
{ 
 if(@$dir[1][$i]=="")
 $dir[1][$i]="N/A";
}




if(count($tit[1])>5)
$cnt=5;
else
$cnt=count($tit[1]);



 echo"<center><b>Search Result</b></center>";
echo "<br/>";
echo "<center><b>\"$a\"of type\"$b\":</b></center>";
echo"<br/>";

if(@$nore[1][0]=="No results.")
echo "<center><b>No movies found!</b></center>";
else
{
echo "<center><table border=1><tr><td><center>Image</center></td><td><center>Title</center></td><td><center>Year</center></td><td><center>Director</center></td><td><center>Rating(10)</center></td><td><center>Link to Movie</center></td></tr>";
  for($j=0;$j<$cnt;$j++)
          {
            echo "<tr>";
            echo "<td>".@$img[0][$j+2]."</td>";
            echo "<td><center>".@$tit[1][$j]."</center></td>";
            echo "<td><center>".@$ye[1][$j]."</center></td>";
            echo "<td><center>".@$dir[1][$j]."</center></td>";
            echo "<td><center>".@$rat[1][$j]."</center></td>";
            echo '<td><center><a style="text-decoration:underline; color:blue;" href="http://www.imdb.com'.@$lin[1][$j].'">Details</a></center></td>';
            echo "</tr>";
          }




echo "</table></center>";
}               

?>

</body>
</html>

Expected XML output:

<result cover="http://ia.mediaimdb.com/images      
/M/MV5BMjMyOTM4MDMxNV5BMl5BanBnXkFtZTcwNjIyNzExOA@@._V1._SX54_
CR0,0,54,74_.jpg" title="The Amazing Spider-Man(2012)"year="2012"
director="Marc Webb" rating="7.5"
details="http://www.imdb.com/title/tt0948470"/>

<result cover="http://ia.mediaimdb.
com/images/M/MV5BMzk3MTE5MDU5NV5BMl5BanBnXkFtZTYwMjY3NTY3._V1._SX54_CR0,
0,54,74_.jpg" title="Spider-Man(2002)" year="2002"director="Sam Raimi"
rating="7.3" details="http://www.imdb.com/title/tt0145487"/>

<result cover="http://ia.mediaimdb.
com/images/M/MV5BODUwMDc5Mzc5M15BMl5BanBnXkFtZTcwNDgzOTY0MQ@@._V1._SX54_
CR0,0,54,74_.jpg" title="Spider-Man 3 (2007)" year="2007" director="Sam
Raimi" rating="6.3" details="http://www.imdb.com/title/tt0413300"/>

<result cover="http://i.mediaimdb.
com/images/SF1f0a42ee1aa08d477a576fbbf7562eed/realm/feature.gif" title="
The Amazing Spider-Man 2 (2014)" year="2014" director="Sam Raimi"
rating="6.3" details="http://www.imdb.com/title/tt1872181"/>

<result cover="http://ia.mediaimdb.
com/images/M/MV5BMjE1ODcyODYxMl5BMl5BanBnXkFtZTcwNjA1NDE3MQ@@._V1._SX54_
CR0,0,54,74_.jpg" title="Spider-Man 2 (2004)" year="2004" director="Sam
Raimi" rating="7.5" details="http://www.imdb.com/title/tt0316654"/>
</results>

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

dso0139 2012-11-11 09:40

关注

First thing, you're parsing your html result with regex which is inefficient, unnecessary, and... well, you're answering to the cthulhu call!

Second, parsing IMDB HTML to retrieve results, although valid, might be unnecessary. There are some neat 3rd party APIs that do the job for you, like http://imdbapi.org

If you don't want to use any 3rd party API though, IMHO, you should, instead, parse the HTML using a DOM parser/manipulator, like DOMDocument, for instance, which is safer, better and, at the same time, can solve your HTML to XML problem.

Here's the bit you asked (build XML and HTML from results):

function resultsToHTML($results)
{
    $doc = new DOMDocumet();
    $table = $doc->createElement('table');

    foreach ($results as $r) {
        $row = $doc->createElement('tr');
        $doc->appendChild($row);
        $title = $doc->createElement('td', $r['title']);
        $row->appendChild($title);
        $year = $doc->createElement('td', $r['year']);
        $row->appendChild($year);
        $rating = $doc->createElement('td', $r['rating']);
        $row->appendChild($rating);

        $imgTD = $doc->createElement('td');

        //Creating a img tag (use only on)
        $img = $doc->createElement('img');
        $img->setAttribute('src', $r['img_src']);
        $imgTD->appendChild($img);
        $row->appendChild($imgTD);

        $imgTD = $doc->createElement('td');

        //Importing directly from the old document
        $fauxDoc = new DOMDocument();
        $fauxDoc->loadXML($r['img']);
        $img = $fauxDoc->getElementsByTagName('img')->index(0);
        $importedImg = $doc->importNode('$img', true);
        $imgTD->appendChild($importedImg);
        $row->appendChild($imgTD);
    }
    return $doc;
}

function resultsToXML($results)
{
    $doc = new DOMDocumet();
    $root = $doc->createElement('results');
    foreach ($results as $r) {
        $element = $root->createElement('result');
        $element->setAttribute('cover', $r['img_src']);
        $element->setAttribute('title', $r['title']);
        $element->setAttribute('year', $r['year']);
        $element->setAttribute('rating', $r['rating']);
        $root->appendChild($element);
    }
    $doc->appendChild($root);
    return $doc;
}

to print them you just need to

$xml = resultsToXML($results);
print $xml->saveXML();

Same thing with html

Here's a refactor of your code with DOMDocument, based on your post:

<?php
//Mock IMDB Link
$a = 'The Amazing Spider-Man';
$b = 'title';
$c = "http://www.imdb.com/search/title?title=".urlencode($a)."&title_type=".urlencode($b);

// HTML might be malformed so we want DOMDocument to be quiet
libxml_use_internal_errors(true);
//Initialize DOMDocument parser
$doc = new DOMDocument();

//Load previously downloaded document
$doc->loadHTMLFile($c);

//initialize array to store results
$results = array();

// get table of results and extract a list of rows
$listOfTables = $doc->getElementsByTagName('table');
$rows = getResultRows($listOfTables);

$i = 0;
//loop through all rows to retrieve information
foreach ($rows as $row) {
    if ($title = getTitle($row)) {
        $results[$i]['title'] = $title;
    }
    if (!is_null($year = getYear($row)) && $year) {
        $results[$i]['year'] = $year;
    }
    if (!is_null($rating = getRating($row)) && $rating) {
        $results[$i]['rating'] = $rating;
    }
    if ($img = getImage($row)) {
        $results[$i]['img'] = $img;
    }
    if ($src = getImageSrc($row)) {
        $results[$i]['img_src'] = $src;
    }
    ++$i;
}

//the first result can be a false positive due to the
// results' table header, so we remove it
if (isset($results[0])) {
    array_shift($results);
}

FUNCTIONS

function getResultRows($listOfTables)
{
    foreach ($listOfTables as $table) {
        if ($table->getAttribute('class') === 'results') {
            return $table->getElementsByTagName('tr');
        }
    }
}

function getImageSrc($row)
{
    $img = $row->getElementsByTagName('img')->item(0);
    if (!is_null($img)) {
        return $img->getAttribute('src');
    } else {
        return false;
    }
}

function getImage($row, $doc)
{
    $img = $row->getElementsByTagName('img')->item(0);
    if (!is_null($img)) {
        return $doc->saveHTML($img);
    } else {
        return false;
    }
}


function getTitle($row)
{
    $tdInfo = getTDInfo($row->getElementsByTagName('td'));
    if (!is_null($tdInfo) && !is_null($as = $tdInfo->getElementsByTagName('a'))) {
        return $as->item(0)->nodeValue;
    } else {
        return false;
    }
}


function getYear($row)
{
    $tdInfo = getTDInfo($row->getElementsByTagName('td'));
    if (!is_null($tdInfo) && !is_null($spans = $tdInfo->getElementsByTagName('span'))) {
        foreach ($spans as $span) {
            if ($span->getAttribute('class') === 'year_type') {
                return str_replace(')', '', str_replace('(', '', $span->nodeValue));
            }
        }
    }
}

function getRating($row)
{
    $tdInfo = getTDInfo($row->getElementsByTagName('td'));
    if (!is_null($tdInfo) && !is_null($spans = $tdInfo->getElementsByTagName('span'))) {
        foreach ($spans as $span) {
            if ($span->getAttribute('class') === 'rating-rating') {
                return $span->nodeValue;
            }
        }
    }
}


function getTDInfo($tds)
{
    foreach ($tds as $td) {
        if ($td->getAttribute('class') == 'title') {
            return $td;
        }
    }
}

本回答被题主选为最佳回答 , 对您是否有帮助呢?

报告相同问题？

关注问题

PHP将Array转换为XML php xml
2016-06-03 15:21

回答 2 已采纳 Almost had it! You simply had to pass the $subnode, not $xml into the recursive call of function:
PHP如何将数组转换为xml php xml
2014-12-23 06:34

回答 3 已采纳 Read through all the answers at the link you provided, there are several solutions suggested there
将SOAP请求从XML转换为PHP php xml
2014-08-20 10:38

回答 1 已采纳 After headaches over this, I finally found a working solution, might not be the best but it works
php实现将数组转换为XML的方法
2020-10-24 14:34

在PHP编程中，将数组转换为XML格式是一项非常实用的技能，尤其是在需要将结构化数据进行网络传输或者进行数据交换时。本文介绍了使用PHP语言将数组转换为XML的方法，并通过实例分析了PHP操作数组和XML格式文件的技巧...
PHP - 将动态XML对象转换为HTML列表 html php xml
2017-11-07 12:30

回答 3 已采纳 I'm not entirely sure I understand exactly what you are after. Perhaps this does it for you or is
将XML nodeValue转换为PHP / HTML字符串 php xml
2016-12-08 10:01

回答 1 已采纳 The problem you are getting arises from the fact that your code adds spaces and a plus sign to the
将XML转换为PHP数组会导致转换后丢失属性数据 php xml
2018-04-19 13:26

回答 1 已采纳 The problem is (must admit I don't totally understand the ins and outs of the code)... In this co
php实现xml转换数组的方法示例
2020-10-20 13:06

在示例的最后，通过调用`xml2array`函数并传入之前得到的XML对象，将XML数据转换成数组，并通过`var_dump`函数输出转换结果，以供开发者查看数据结构是否正确转换。在处理XML数据时，我们还会遇到需要对XML进行...
XML内容转换为PHP变量 php xml
2015-01-21 13:51

回答 1 已采纳 That is not valid XML, if you make the XML valid it will work So change the file to <stores
将SOAP XML响应转换为PHP对象或数组 php xml
2015-06-23 17:42

回答 1 已采纳 The location of the WSDL-file is going to depend on the SOAP API you're calling. Check their docum
如何将json响应数据转换为xml [duplicate] json php xml
2018-05-31 08:37

回答 1 已采纳 function array2xml($array, $xml = false){ if($xml === false){ $xml = new SimpleXMLEle
PHP处理数组和XML之间的互相转换
2020-10-22 05:21

转换可以简单地通过遍历数组，将数组的key/value对转换为XML节点，并通过echo输出XML字符串。更复杂一点的转换，可以使用DOMDocument类来创建一个XML结构，然后递归地将数组的值添加为相应的XML节点。这种通过DOM...
PHP读取并输出XML文件数据的简单实现方法
2020-10-18 21:02

本文将介绍如何使用PHP读取并输出XML文件中的数据，涵盖了载入、遍历、读取和输出XML数据的技术和操作技巧。首先，我们需要有一个XML文件，该文件中存储了我们想要读取的数据。XML文件通常是以一种层级结构的方式...
APSArrayToXML.class.php:将数组转换为XMLString或XML对象
2021-05-20 14:49

2. `arrayToXML()`: 这个方法是类的核心，接受一个PHP数组作为参数，然后将其转换为XML字符串。它可能会采用递归的方式来处理嵌套的数组，将每个键值对转换为XML元素。 3. `createXMLElement()`: 这个辅助方法可能...
PHP实现的数组和XML文件相互转换功能示例
2020-10-18 16:22

在PHP中实现数组与XML文件的相互转换是一个比较实用的技能，尤其在处理与外部API交互时，例如微信支付操作返回数据为XML格式，这就需要我们将其转换成PHP数组以便于处理。首先，我们来看如何将XML转换为数组。实现...
没有解决我的问题, 去提问

悬赏问题

¥15 如何让企业微信机器人实现消息汇总整合
¥50 关于#ui#的问题：做yolov8的ui界面出现的问题
¥15 如何用Python爬取各高校教师公开的教育和工作经历
¥15 TLE9879QXA40 电机驱动
¥20 对于工程问题的非线性数学模型进行线性化
¥15 Mirare PLUS 进行密钥认证？（详解）
¥15 物体双站RCS和其组成阵列后的双站RCS关系验证
¥20 想用ollama做一个自己的AI数据库
¥15 关于qualoth编辑及缝合服装领子的问题解决方案探寻
¥15 请问怎么才能复现这样的图呀

码龄粉丝数原力等级 --

如何将PHP转换为XML输出

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

如何将PHP转换为XML输出

1条回答 默认 最新

悬赏问题

1条回答默认最新