dongyuan7981 2015-03-23 23:01
浏览 39

DOMCrawler找到带有内部HTML文本的Tag

I'm trying to use Goutte to scrape a web page and I can't find a DOMCrawler method to search for actual text. Let's say there's a td, but it has no class or ID. So, I need to search for let's say "Title" then get that tds next sibling.

  • 写回答

1条回答 默认 最新

  • dongyu1614 2015-03-24 02:48
    关注

    First I would like to refer you to the DomCrawlers documentation on Node Filtering, specifically the DomCrawler::reduce() method and Accessing Node Values

    If you aren't happy with accessing these then you can always drop down and use DOMNode properties

    Of course you might have better luck with using XPath queries which are extremely powerful.

    评论

报告相同问题?