dongzenglin8292 2015-05-14 12:39
浏览 140
已采纳

DomDocument / DOMXPath - 如何通过itemprop和img src获取HTML Dom元素

I am working on a script which is getting data from HTML DOM elements.

Here is my code:

$url = 'http://www.sportsdirect.com/nike-satire-mens-skate-shoes-242188?colcode=24218822';
libxml_use_internal_errors(true); 
$doc = new DOMDocument();
$doc->loadHTMLFile($url);

$xpath = new DOMXpath($doc);

$Name = $xpath->query('//span[@id="ProductName"]')->item(0)->nodeValue;

echo $Name;

This code is simply taking the text inside <span id="ProductName"></span>. I know how to get the data from elements with specific class or id.

I don't know how I can get the src="http://adres-to-image.com/img.png" (pure example) from image tag or how I can get elements which do not have id or class but have attribute like itemprop, for example <div itemprop="name"></div>

  1. How can I get the image src?
  2. How can I get elements with itemprop?
  • 写回答

2条回答 默认 最新

  • dtuqxb3884 2015-05-14 12:52
    关注

    For your examples:

    $xpath->query('//img/@src)->item(0)->nodeValue
    

    This means

    Select all src attributes of all img tags and get the value of the first

    $xpath->query('//div/[@itemprop="name"])->item(0)->nodeValue
    

    This means

    Select all divs with itemprop attr equals name and get the value of the first.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 delta降尺度计算的一些细节,有偿
  • ¥15 Arduino红外遥控代码有问题
  • ¥15 数值计算离散正交多项式
  • ¥30 数值计算均差系数编程
  • ¥15 redis-full-check比较 两个集群的数据出错
  • ¥15 Matlab编程问题
  • ¥15 训练的多模态特征融合模型准确度很低怎么办
  • ¥15 kylin启动报错log4j类冲突
  • ¥15 超声波模块测距控制点灯,灯的闪烁很不稳定,经过调试发现测的距离偏大
  • ¥15 import arcpy出现importing _arcgisscripting 找不到相关程序