如何从html页面获取文本链接？ [重复]

This question already has an answer here:

How do you parse and process HTML/XML in PHP? 30 answers

I want to get the links "http://www.w3schools.com/default.asp" & "http://www.google.com" from this webpage.I want the links of <a> tags inside <div class="link">,there are many other <a> tags in this page and I don't want them. How can I retrieve the particular links only? Can anyone help me?

<div class="link">
<a href="http://www.w3schools.com/default.asp">
<h4>W3 Schools</h4>
</a>
</div>
<div class="link">
<a href="http://www.google.com">
<h4>Google</h4>
</a>
</div>

</div>

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douzhe3516 2013-12-19 11:22
关注
Use a DOM Parser such as DOMDocument to achieve this:

$dom = new DOMDocument; $dom->loadHTML($html); // $html is a string containing the HTML foreach ($dom->getElementsByTagName('a') as $link) { echo $link->getAttribute('href').'<br/>'; }

Output:

http://www.w3schools.com/default.asp http://www.google.com

Demo.

UPDATE: If you only want the links inside the specific <div>, you can use an XPath expression to find the links inside the div, and then loop through them to get the href attribute:

$dom = new DOMDocument; $dom->loadHTML($html); $xpath = new DOMXPath($dom); $links_inside_div = $xpath->query("//*[contains(@class, 'link')]/a"); foreach ($links_inside_div as $link) { echo $link->getAttribute('href').'<br/>'; }

Demo.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

如何从html页面获取文本链接？ [重复] html php
2013-12-19 11:19

回答 3 已采纳 Use a DOM Parser such as DOMDocument to achieve this: $dom = new DOMDocument; $dom->loadHTML($
如何获取HTML文档中一个没有标签的文本？ html 有问必答
2021-12-31 14:18

回答 3 已采纳 document.body.childNodes
如何使用PHP Simple HTML dom获取此文本？ html php
2015-10-30 23:20

回答 2 已采纳 Maybe this will give you the result you are looking for: foreach($info_html->find('div.info p'
前端HTML5+CSS3学习笔记
2021-11-17 17:57

Baucc的博客前端HTML5+CSS3学习笔记
如何从php脚本中获取响应文本？ javascript php
2019-08-09 01:02

回答 1 已采纳 What you're looking for is the body of the response. It's a readable stream though so you'll need
jquery怎么能获取或修改文本节点? javascript jquery
2021-09-06 16:25

回答 4 已采纳获取div子元素然后循环找到[3]
如何在PHP中获取标签内的链接html（html为纯文本）？ html php
2017-10-23 16:06

回答 1 已采纳 Use the DOMDocument $dom = new DOMDocument; // load your html $dom->loadHTML($input); // loop
前端简介、HTTP、 HTML
2022-08-22 20:50

nana_cx的博客学习前端与后端差别、HTTP的四个特性、数据格式、响应状态码，HTML的标签(body内常见标签、列表标签、表格标签、表单标签)
富文本框数据回显后携带HTML标签的问题 html 前端
2021-12-22 11:27

回答 1 已采纳 js 里用 innerHTml属性。vue里v-html 指令。react里 dangerouslySetInnerHTML
后端返回html代码，如何重定向到一个临时页面，将后端返回的html代码插入其中？ html javascript 前端
2022-02-25 16:34

回答 4 已采纳重定向到window.location.href = ‘about:blank’，将后端返回的html写入页面documen.write(res)
如何从HTML页面中提取文本块？ html php
2011-03-08 23:15

回答 2 已采纳 I use phpQuery. Are you familiar with jQuery? they share the same syntax. You might be concerned a
SEO是什么？前端如何进行SEO优化
2021-11-13 22:54

万物之恋的博客前端如何进行SEO优化 SEO是什么？ seo又称网站优化，也称搜索引擎优化，英文名（Search Engine Optimization），简称：seo。 seo是一种基础搜索引擎的网络营销推广方式，通过搜索引擎平台的规则来优化，以实现产品...
如何用python获取这个网页的HTML（超文本链接语言）？ python 开发语言
2020-03-10 12:56

回答 2 已采纳实验了一下，加了个请求头，试过可以获取，我的代码 ``` import requests import html headers = {"User-Agent": "Mozilla/5.0
前端开发基础 HTML+CSS+JS
2021-04-28 00:22

小白典的博客 HTML文本是由HTML命令组成的描述性文本，HTML命令用来说明文字、图像、视频、表格、链接等，目前广泛使用的是HTML5 CSS介绍 CSS(Cascading Style Sheets)是层叠样式表，用来表现HTML文件样式的标准语言。CSS用于定义...
2023前端面试题汇总
2023-03-09 17:58

下雪不过冬天的博客 2023前端基础面试题汇总
没有解决我的问题, 去提问

悬赏问题

¥15 求帮我调试一下freefem代码
¥15 matlab代码解决，怎么运行
¥15 R语言Rstudio突然无法启动
¥15 关于#matlab#的问题：提取2个图像的变量作为另外一个图像像元的移动量，计算新的位置创建新的图像并提取第二个图像的变量到新的图像
¥15 改算法，照着压缩包里边，参考其他代码封装的格式写到main函数里
¥15 用windows做服务的同志有吗
¥60 求一个简单的网页(标签-安全|关键词-上传)
¥35 lstm时间序列共享单车预测，loss值优化，参数优化算法
¥15 Python中的request，如何使用ssr节点，通过代理requests网页。本人在泰国，需要用大陆ip才能玩网页游戏，合法合规。
¥100 为什么这个恒流源电路不能恒流？

如何从html页面获取文本链接？ [重复]

3条回答 默认 最新

悬赏问题

3条回答默认最新