XPath没有返回元素之后的所有内容

I'm retrieving a ul > li of ingredients from my site, then I'm using foreach to loop through each li.

Inside the <li></li> it contains information in the following format: 1-2 tablespoons <a href="link">coconut oil</a> (to taste), not all contains hyperlinks, it's random.

All I'm trying to do is break up the data so I can put them into an array like so:

array(
    0 => array(
        'amount' => 2 ounces,
        'ingredients' => pre-cooked chicken
    ),
    1 => array(
        'amount' => 1-2 tablespoons,
        'ingredients' => coconut oil (to taste)
    )
);

While maintaining the html a link in the coconut oil part.

Here is the code that I'm using.

$string is an array with the li content
foreach($string as $data){
    $try = new \DOMdocument;
    $try->loadHTML($data);
    $find = new \DOMXPath($try);

    // from this point it's where I'm having problems
    $x = $find->query('//li');
    foreach($x as $data){
        echo '<pre>', print_r($data), '</pre>';
    }
}

The print_r($data) returns the following DOMElement Objects (with other empty keys like parentNode, childNode, firstChild, previousSibling):

DOMElement Object (
    [tagName] => li
    [schemaTypeInfo] => 
    [nodeName] => li
    [nodeValue] => 2 ounces pre-cooked chicken
    [nodeType] => 1
    [attributes] => (object value omitted)
    [ownerDocument] => (object value omitted)
    [localName] => li
    [textContent] => 2 ounces pre-cooked chicken
)
DOMElement Object (
    [tagName] => li
    [schemaTypeInfo] => 
    [nodeName] => li
    [nodeValue] => 1-2 tablespoons coconut oil (to taste)
    [nodeType] => 1
    [attributes] => (object value omitted)
    [ownerDocument] => (object value omitted)
    [localName] => li
    [textContent] => 1-2 tablespoons coconut oil (to taste)
)

I thought it would be best to break up the information, in 1 query I just get all of the data inside the strong tag, but the issue that I'm having is actually just getting all of the content after the strong tag.

Here I try to get all of the content after the strong tag:

$list = $find->query('//strong/following-sibling::text()');
foreach($list as $data){
    $i[] = $try->saveHTML($data);
}

If I print_r($i) I get the following:

Array
(
    [0] =>  pre-cooked chicken
    [1] =>  
    [2] =>  (to taste)
)

but if I change the query to $list = $find->query('//strong/following-sibling::*') all I get is the following which is a hyperlink.

Array
(
    [0] => coconut oil
)

Update:

Input array:

Array (
    [0] => <strong>2 ounces</strong> pre-cooked chicken
    [1] => <strong>1-2 tablespoons</strong> <a href="/link">coconut oil</a> (to taste)
)

And

Expected output:

array(
    0 => array(
        'amount' => 2 ounces,
        'ingredients' => pre-cooked chicken
    ),
    1 => array(
        'amount' => 1-2 tablespoons,
        'ingredients' => <a href="/link">coconut oil</a> (to taste)
    )
);

展开全部

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doufei1893 2017-09-08 09:09
关注
Are you expecting something like this? Hope this seems to be helpful. Here we are using preg_match.

Try this code snippet here

<?php ini_set('display_errors', 1); $result=array(); $array=Array ( 0 => "2 ounces pre-cooked chicken", 1 => '1-2 tablespoons <a href="/link">coconut oil</a> (to taste)' ); foreach($array as $data) { preg_match("/(.*?)(?:<\/strong>)(.*)/",$data,$matches); $result[]=array( "amount"=>$matches[1], "ingredients"=>$matches[2] ); } print_r($result);
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报
编辑

预览
轻敲空格完成输入
显示为

卡片

标题

链接
评论

按下Enter换行，Ctrl+Enter发表内容

编辑

预览

报告相同问题？

关注问题

PHP Xpath返回零匹配 php
2016-10-18 05:22

回答 1 已采纳 As stated on the comments by Sami Kuhmonen you may need to register the namespaces, here is an exa
没有文本节点后代的文档中所有元素的Xpath？ html php xml
2016-12-15 13:52

回答 1 已采纳 This XPath, //*[not(.//text())] will select all elements in the document without text node desc
Xpath查询返回部分空值（PHP） php xml
2016-09-28 03:42

回答 1 已采纳 If you do something like: $xml = simplexml_load_string($tmpstr); $smsts = $xml->xpath('//TS');
PHP xpath提取网页数据内容代码解析
2020-10-14 19:15

首先，本文介绍了PHP中使用xpath来提取网页数据内容的基本方法。xpath（XML Path Language）是一种在XML文档中查找信息的语言，同样适用于HTML文档，因为HTML可以被视为XML的一个实例。在PHP中，要使用xpath，主要...
选择所有元素，直到 - XPath css php
2016-09-13 02:58

回答 1 已采纳 //li[contains(@class, 'result') and @id and not(preceding-sibling::li[not(@id)])] stop when the
php和xpath试图循环通过特定元素的子元素 php
2015-11-04 12:21

回答 1 已采纳 Cracked it myself: $ind = 0; foreach($xml->xpath("//ad:image") as $image) { foreach($image-&
如何使用PHP xpath获取所有属性？ php
2015-12-10 02:49

回答 2 已采纳 In XPath, you can use @* to reference attributes of any name, for example : $nodes = $xpath->q
php用xpath解析html的代码实例讲解
2021-01-20 00:17

接着，我们使用`xpath`方法来执行XPath查询，该查询是`"html/body/p/p/form/p/p/p/p/p[*]/p/p/table//tr/td[@class='topicViews']"`，这个查询会找到所有拥有`class='topicViews'`属性的`td`元素。查询结果存储在`$...
PHP SimpleXMLElement xpath php
2018-03-22 11:02

回答 1 已采纳 This gives me an empty array! No it doesn't. Look closely at your output, and you will see th
php和xpath - 循环遍历特定元素的子元素 php
2013-04-29 06:37

回答 1 已采纳 That's the right result since you're using the second xpath call on the original $xml which is the
PHP xpath在foreach循环中按属性获取元素 php xml
2015-03-24 01:50

回答 2 已采纳 The problem is not XPath but SimpleXML. SimpleXMLElement::xpath() is limited. It converts the resu
PHP操作XML中XPath的应用示例
2020-10-16 09:40

3. 使用DOMXPath对象的query()方法进行XPath查询，该方法接受一个XPath路径表达式作为参数，并返回一个DOMNodeList对象，该对象包含了所有匹配XPath查询的结果。 4. 最后，可以通过遍历DOMNodeList对象来访问每个...
php+xml编程之xpath的应用实例
2020-10-24 12:06

通过指定XPath路径表达式“/words/word/ch”，我们可以查询到所有的ch元素，然后通过遍历DOMNodeList对象来获取并输出每一个ch元素的节点值。除了基本的使用方法，XPath还提供了许多其他的函数和操作符，用于更...
html2xpath:通过XPath遍历给定的URL，并将所选元素作为JSON返回
2021-05-05 17:00

通过XPath遍历给定的URL，并将选定的元素作为JSON返回。用法范围必需的描述 ü 是的网页的URL，以执行给定的XPath查询。 x [ n ] 是的要执行的XPath查询。您可以在单个请求中运行多个XPath查询。 JSON...
PHP xpath()函数讲解
2020-10-17 02:34

这里，`xpath("from")` 查询找到了 `<from>` 元素，并返回了一个包含这个元素的 `SimpleXMLElement` 对象的数组。`print_r()` 函数用于打印结果，显示匹配到的节点内容。 XPath 表达式可以非常复杂，例如，你可以...
没有解决我的问题, 去提问

XPath没有返回元素之后的所有内容

1条回答 默认 最新

1条回答默认最新