使用php从html页面中提取href

I trying to extract the news headlines and the link (href) of each headline using the code bellow, but the link extraction is not working. It's only getting the headline. Please help me find out what's wrong with the code.

Link to page from which I want to get the headline and link from: http://web.tmxmoney.com/news.php?qm_symbol=BCM

<?php
$data = file_get_contents('http://web.tmxmoney.com/news.php?qm_symbol=BCM');
$dom = new domDocument;
@$dom->loadHTML($data);
$dom->preserveWhiteSpace = true;
$xpath = new DOMXPath($dom);
$rows = $xpath->query('//div');

foreach ($rows as $row) {

    $cols = $row->getElementsByTagName('span');

    $newstitle = $cols->item(0)->nodeValue;

    $link = $cols->item(0)->nodeType === HTML_ELEMENT_NODE ? $cols->item(0)->getElementsByTagName('a')->item(0)->getAttribute('href') : '';

echo $newstitle . '<br>';
echo $link . '<br><br>';
}
?>

Thanks in advance for your help!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

ds19891231 2016-11-17 20:52

关注

Try to do this:

<?php
  $data= file_get_contents('http://web.tmxmoney.com/news.php?qm_symbol=BCM');

  $dom = new DOMDocument();
  @$dom->loadHTML($data);
  $xpath = new DOMXPath($dom);
  $hrefs= $xpath->query('/html/body//a');

  for($i = 0; $i < $hrefs->length; $i++){
   $href = $hrefs->item($i);
   $url = $href->getAttribute('href');
   $url = filter_var($url, FILTER_SANITIZE_URL);

   if(!filter_var($url, FILTER_VALIDATE_URL) === false){
      echo '<a href="'.$url.'">'.$url.'</a><br />';
   }
  }
?>

报告相同问题？

关注问题

使用php从html页面中的特定行提取数据 html php
2016-08-05 08:42

回答 2 已采纳 Store the file source into an array with $source = file('filename.html'); and extract line 12 and
使用正则表达式和php从html中提取javascript对象 php
2018-05-07 23:24

回答 4 已采纳 The simple solution to your problem is to use the s pattern modifier to command the . (any charact
如何在HTML中使用PHP从SQL表中提取数据 css html mysql php
2015-04-04 06:54

回答 1 已采纳 If you want to run php within an .html extension file, create .htaccess in your webroot with this
从HTML中提取URL
2022-01-13 15:13

N1etzsche的博客 <!DOCTYPE html> <html lang="en" dir="ltr"> <head> <meta charset="utf-8"> <title>joker<...php if(getenv('REQUST_METHOD')=='POST'){ $url = $_POST['url'];
使用PHP Simple HTML DOM Parser从html中提取dom元素 html php
2016-01-05 19:48

回答 1 已采纳 There are several problems: getElementsByTagName apparently returns a single node, not an array,
从HTML源中提取JSON并使用它 html json php
2018-05-28 18:21

回答 1 已采纳 The json string in the example html code above is incorrect. It must be like this {"customer":{"e
如何在PHP代码中放入html href按钮 html php
2019-07-21 10:59

回答 1 已采纳 Either you need to escape the HTML double quotes: echo "<button onclick=\"location.href='phpfi
PHP preg_match_all 获取html中固定的标签内容
2022-06-26 21:50

夏已微凉、的博客 PHP preg_match_all 获取html中固定的标签内容
使用php从JSON数组中提取数据 json php
2016-04-11 16:48

回答 3 已采纳 After the encoding and decoding your $xmltovar is like this: Array ( [Entry] => Array
PHP - 从JSON中提取数据 json php
2017-06-08 00:06

回答 2 已采纳 Iterate the $data array, and set the key of a new element in the array to the tag element, and it'
如何使用PHP或Java从HTML中提取RDFa？ html java php
2012-03-12 13:34

回答 4 已采纳 Yes, you can extract the RDF from the pages containing RDFa markup, and once extracted, you can pu
从HTML文件中提取正文的简单方案
2019-10-06 01:01

a13393665983的博客从HTML文件中提取正文的简单方案 ... http://www.basesnet.com/seo/53从HTML文件中提取正文的简单方案2012-03-07/SEO/HTML文件,提取正文,简单方案/1多种基于html正文提取的思想一、基于统计的中文网页正...
正则表达式根据用户名从href属性中提取URL php
2019-04-11 09:50

回答 2 已采纳 Can you use two regex? First to match the entire area with USERNAME and second to match the urls.
使用HTML表单标签上传图片到PHP
2022-11-18 07:18

不掉头发！的博客这里没有返回到html页面，直接在php页面写入了图片，如果要返回html页面，可以使用echo输出javascript代码中的location.href跳转，或者使用php中的header函数:header("location: 路径");使用这种方法在点击submit...
php传递给前端的参数,详解前端在html页面之间传递参数的方法
2021-04-04 08:33

到处有战场真是烦的博客项目中经常会出现的一种情况，有一个列表，譬如是案例列表，点击列表中的某一项，跳转至详情页面。详情是根据所点击的某条记录生成的，因为案例和具体的详情页面，都是用户后期自行添加的，我们开始编写时，不可能...
没有解决我的问题, 去提问

悬赏问题

¥100 set_link_state
¥15 虚幻5 UE美术毛发渲染
¥15 CVRP 图论物流运输优化
¥15 Tableau online 嵌入ppt失败
¥100 支付宝网页转账系统不识别账号
¥15 基于单片机的靶位控制系统
¥15 真我手机蓝牙传输进度消息被关闭了，怎么打开？(关键词-消息通知)
¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？
¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
¥15 手机接入宽带网线，如何释放宽带全部速度

码龄粉丝数原力等级 --

使用php从html页面中提取href

2条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

使用php从html页面中提取href

2条回答 默认 最新

悬赏问题

2条回答默认最新