doucang6739
2018-08-16 20:11
浏览 164
已采纳

通过xpath中的子li节点进行迭代

I have the following HTML:

$page = '<html>
<head>
<title>Page</title>
</head>

<body>

<div>
    <div>
        <div>
        </div>
        <div class="this one">
            <h2>Ignore</h2>
            <p>Text</p>

            <h2>Header 1</h2>
            <ul><li>List Value 1</li></ul>

            <h2>Header 2</h2>
            <ul><li>List Value 2</li></ul>

            <h2>Ignore</h2>
            <ul><li>List Value 3</li></ul>

            <h2>Header 3</h2>
            <ul>
                <li>List Value A</li>
                <li>List Value B</li>
                <li>List Value C</li>
            </ul>

            <h2>Ignore</h2>
            <p>Text</p>
        </div>
    </div>
</div>

</body>
</html>';

I am trying to get the li list for Header 3 only and the following code doesn't work;

$doc->loadHTML($page);   
$xpath = new DomXPath($doc);

$nodes = $xpath->query("//div[@class='this one']/h2[.='Header 3']/ul/li");
foreach($nodes as $node) { 
    echo $node->nodeValue . "<br />";
}

I am expecting the output:

List Value A<br />
List Value B<br />
List Value C<br />

图片转代码服务由CSDN问答提供 功能建议

我有以下HTML:

  $ page ='  &lt; html&gt; 
&lt; head&gt; 
&lt; title&gt; Page&lt; / title&gt; 
&lt; / head&gt; 
 
&lt; body&gt; 
 
&lt; div&gt; 
&lt; div&gt; 
&lt; div&gt  ; 
&lt; / div&gt; 
&lt; div class =“this one”&gt; 
&lt; h2&gt;忽略&lt; / h2&gt; 
&lt; p&gt;文字&lt; / p&gt; 
 
&lt; h2&gt;  ;标题1&lt; / h2&gt; 
&lt; ul&gt;&lt; li&gt;列表值1&lt; / li&gt;&lt; / ul&gt; 
 
&lt; h2&gt;标题2&lt; / h2&gt; 
&lt; ul&gt;&lt;  ; li&gt;列表值2&lt; / li&gt;&lt; / ul&gt; 
 
&lt; h2&gt;忽略&lt; / h2&gt; 
&lt; ul&gt;&lt; li&gt;列表值3&lt; / li&gt;&lt; / ul&gt;  
 
&lt; h2&gt;标题3&lt; / h2&gt; 
&lt; ul&gt; 
&lt; li&gt;列表值A&lt; / li&gt; 
&lt; li&gt;列表值B&lt; / li&gt; 
&lt;  li&gt;列表值C&lt; / li&gt; 
&lt; / ul&gt; 
 
&lt; h2&gt; Ignor  e&lt; / h2&gt; 
&lt; p&gt;文字&lt; / p&gt; 
&lt; / div&gt; 
&lt; / div&gt; 
&lt; / div&gt; 
 
&lt; / body&gt; 
&lt; / html&gt;  '; 
   
 
 

我正在尝试仅获取 Header 3 li 列表,以下代码没有 工作;

  $ doc-&gt; loadHTML($ page);  
 $ xpath = new DomXPath($ doc); 
 
 $ nodes = $ xpath-&gt; query(“// div [@ class ='this one'] / h2 [。='Header 3'] /  ul / li“); 
foreach($ nodes as $ node){
 echo $ node-&gt; nodeValue。  “&lt; br /&gt;”; 
} 
   
 
 

我期待输出:

  List 值A&lt; br /&gt; 
列出值B&lt; br /&gt; 
列出值C&lt; br /&gt; 
   
 
  • 写回答
  • 关注问题
  • 收藏
  • 邀请回答

1条回答 默认 最新

  • dtz46697 2018-08-16 20:19
    已采纳

    This is the expression that you want:

    //div[@class = 'this one']/h2[text() = 'Header 3']/following-sibling::ul[1]/li
    

    Broken down a bit:

    • //div[@class = 'this one'] - Match all <div>s in the document with the specified class attribute value

    • …/h2[text() = 'Header 3'] - Match all <h2>s that are children of those <div>s that have the specified text content

    • …/following-sibling::ul - Use the following-sibling axis to match <ul>s that appear after the <h2>s

    • …[1] - Match only the first <ul> that is a sibling of the matched <h2> (… remembering that indexes are 1-based in XPath expressions)

    • …/li - And match all of the <li>s which are children of that <ul>

    已采纳该答案
    打赏 评论

相关推荐 更多相似问题