如何从网页上抓取数据？

I need to show some news from web page, so I need to extract data from web site. But I am unable to extract data as the following code:

$html=file_get_html("http://listverse.com/2014/12/01/10-times-us-foreign-policy-was-wildly-inconsistent/");
     foreach($html->find('article h2') as $element)
     {
        echo "<h2>".$element->plaintext."</h2>"."<br>";

        foreach ($html->find('article h2 p') as $element1) {

            echo "<pre>";print_r($element1->plaintext );
        }

But I got correct header but each paragraph is redundant.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doupu2722 2014-12-02 12:51
关注
The paragraphs follow the headings, they aren't descendants of them (and HTML doesn't allow paragraphs to descend from headings anyway).

Having got the headings, you need to look at their siblings (e.g. looping over them until you get one that isn't a paragraph or is another heading).

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

php访问国外的一个网页网页抓取json数据 json php
2018-11-15 06:12

回答 1 已采纳问题已解决，token问题，具体看是哪一个token，不同页面的token不同
使用curl从asp抓取数据 php
2014-12-07 05:51

回答 1 已采纳 We don't need to write values for ViewState and EventValidation as we can get this values dynamica
php使用curl爬取页面,json数据获取不完整 json php 有问必答
2021-08-02 16:03

回答 2 已采纳你访问的是同一个url?你爬取的是列表内容。并没有去请求详细内容
如何使用php抓取基于javascript和ajax的网页数据 ajax javascript php
2014-12-18 09:39

回答 1 已采纳 PHP doesn't render JS, so you can't do what you are asking. But, that page is making a request wh
求教大神！我抓取到一个页面，但是我想得到这个页面中的值，该怎么办？ java
2017-03-14 08:35

回答 1 已采纳 getElementsByClassName() 这个函数可以，你可以查查用法
自动更新sql中的网页数据 php sql
2016-05-30 18:25

回答 2 已采纳 Turns out the best way to solve this was using Cron Jobs. I run a PHP script every day and I modi
php如何抓取网页内容,php如何抓取网页数据？
2021-03-26 12:01

weixin_39678304的博客 php抓取网页数据header("Content-type: text/html; charset=utf-8");//$url = "https://www.cnblogs.com/chenliyang/p/6554647.html";//$html = file_get_contents($url);////如果出现中文乱码使用下面代码////$...
python抓取网页上的表格写入CSV，0开头的数字，怎么能完整的写入csv？ python 爬虫
2022-08-20 08:49

回答 3 已采纳 import pandas as pd df = pd.DataFrame() for i in range(1, 5): url = f'http://vip.stock.finance
在php中抓取https json url [关闭] https json php
2013-04-22 12:30

回答 2 已采纳 What's the error? See this (possible dupe) Unless, is there any reason why you can't use curl, o
（PHP）使用Curl获取空白页面（Mytischtennis） php
2017-12-06 16:11

回答 1 已采纳 You need to set two more options in your curl request: // Add some more headers here if you need
PHP的cURL库功能简介抓取网页、POST数据及其他
2020-12-18 17:58

无论是你想从从一个链接上取部分数据，或是取一个XML文件并把其导入数据库，那怕就是简单的获取网页内容，反应釜cURL 是一个功能强大的PHP库。本文主要讲述如果使用这个PHP库。启用 cURL 设置首先，我们得先要确定...
抓取HTML表格数据并创建XML或JSON文档 php xml
2012-06-05 15:41

回答 3 已采纳 Here's a quick example to get you started using only dom functions: $dom = new DOMDocument(); @$d
PHP网页抓取之抓取百度贴吧邮箱数据代码分享
2020-09-21 17:14

本文给大家介绍PHP网页抓取之抓取百度贴吧邮箱数据代码分享，程序实现了一键抓取帖子全部邮箱和分页抓取邮箱两个功能，感兴趣的朋友一起学习吧
PHP中4种常用的抓取网络数据方法
2020-10-24 04:03

主要介绍了PHP中4种常用的抓取网络数据方法,本文讲解使用file_get_contents函数、fopen函数、curl库三种常见方法抓取网络数据,并给出了代码实例,需要的朋友可以参考下
没有解决我的问题, 去提问

悬赏问题

¥15 Vue3 大型图片数据拖动排序
¥15 划分vlan后不通了
¥15 GDI处理通道视频时总是带有白色锯齿
¥20 用雷电模拟器安装百达屋apk一直闪退
¥15 算能科技20240506咨询（拒绝大模型回答）
¥15 自适应 AR 模型参数估计Matlab程序
¥100 角动量包络面如何用MATLAB绘制
¥15 merge函数占用内存过大
¥15 使用EMD去噪处理RML2016数据集时候的原理
¥15 神经网络预测均方误差很小但是图像上看着差别太大

如何从网页上抓取数据？

1条回答 默认 最新

悬赏问题

1条回答默认最新