从网站检索特定数据

I am currently building a scraper to scrape certain information from a website.

For example, I would like to get a restaurant name, address, opening hours & telephone number from a website.

By using curl, I managed to get the data from the website:

    $url = "http://localhost/test.html";
    $ch = curl_init(); 
    curl_setopt($ch, CURLOPT_URL, $url); 
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); 
    $data = curl_exec($ch); 
    curl_close($ch);

However, I need some ideas on how would I be able to pin point my scraper to the exact location to scrape these information out.

I have tried regular expressions, but was unable to get it to work.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongwang3066 2012-10-05 12:48
关注
Use SimpleHTMLDom parser for php:
http://simplehtmldom.sourceforge.net/

Download here:
http://sourceforge.net/projects/simplehtmldom/files/

Documentation here:
http://simplehtmldom.sourceforge.net/manual.htm

That is as I have experience with parsing the best tool for parsing HTML with php...

Also you don't need to use curl for getting content if it is not necessary, for simpleHTMLDom parser just use:

$remote_html = file_get_html("http://www.somesite.com/");
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

从网站检索特定数据 html php
2012-10-05 12:47

回答 2 已采纳 Use SimpleHTMLDom parser for php: http://simplehtmldom.sourceforge.net/ Download here: http://sou
从sql查询中检索特定数据 php sql
2013-10-08 18:36

回答 1 已采纳 Modify your php loop like so: $downloaded=0; $redeemed=0; while ($row = mysql_fetch_row($result))
PHP循环从已解码的JSON中检索特定数据的每个实例 json php
2019-03-06 17:13

回答 2 已采纳 You can merge the second level arrays (shirts and pants keys) and take the uid column from the res
浏览器前端数据快速检索
2021-02-08 10:32

烤辣椒放醋的博客我相信很多人Web开发人员都遇到过前端数据联想输入的问题。例如：百度搜索时会联想出你想要检索的信息。像这样的需求可能还是相当比较简单的，因为这种输入一般输入的速度并不是很快，通过keyup+ajax是可以满足...
如何从数据库mysql中检索特定数据 mysql php
2013-11-25 08:31

回答 1 已采纳 You could just check which option the person has selected and depending on that selected option yo
JQuery：从Html页面检索特定用户数据 html jquery php
2014-02-14 13:27

回答 4 已采纳 Missing "" in your html code <a class="peoplePage" data-username="<?php echo $theUsernameDa
从数据库中检索阿拉伯数据 mysql php
2018-11-13 06:52

回答 4 已采纳 instead of $sSQL= 'SET CHARACTER SET utf8'; mysqli_query($conn,$sSQL); try mysqli_set_chars
前端面试八股文（超详细）
2022-03-06 17:42

小泽今天早睡的博客比如说直接使用缓存而不发起请求，或者发起了请求但后端存储的数据和前端一致，那么就没有必要再将数据回传回来，这样就减少了响应数据。强制缓存就是向浏览器缓存查找该请求结果，并根据该结果的缓存规则来决定...
从html.Node检索原始数据
2018-02-04 17:48

回答 1 已采纳 I get what you mean, I use a lot of this in tests. What you need is already in the same x/net/htm
从嵌套对象中检索特定值 php
2016-10-26 09:09

回答 3 已采纳 If you want to get values of Pro_photourl and id from std class, you can just use: <?php forea
使用goquery从网站检索文本 html
2017-12-21 20:50

回答 2 已采纳 I somehow don't like the idea of using regex to parse html. I feel it to be too fragile against mi
基于Python的信息检索与信息抽取系统-课程设计.rar
2023-06-15 16:28

本项目利用Python实现了一个信息检索与信息抽取系统，包括数据、前端和后端代码。信息检索（Information Retrieval）是用户进行信息查询和获取的主要方式，是查找信息的方法和手段。狭义的信息检索仅指信息查询...
从桥表laravel中检索数据 laravel php
2017-06-03 03:59

回答 1 已采纳 Add the pivot table column in your relationship public function ingredients() { return $this-
2023前端面试题汇总
2023-03-09 17:58

下雪不过冬天的博客 2023前端基础面试题汇总
Element UI的数据表格数据检索方法
2022-04-25 10:18

傻傻的羊的博客 Element UI的数据表格数据检索方法，单元格内容检索，表头样式检索。
没有解决我的问题, 去提问

悬赏问题

¥20 RL+GNN解决人员排班问题时梯度消失
¥15 统计大规模图中的完全子图问题
¥15 使用LM2596制作降压电路，一个能运行，一个不能
¥60 要数控稳压电源测试数据
¥15 能帮我写下这个编程吗
¥15 ikuai客户端l2tp协议链接报终止15信号和无法将p.p.p6转换为我的l2tp线路
¥15 phython读取excel表格报错 ^7个 SyntaxError: invalid syntax 语句报错
¥20 @microsoft/fetch-event-source 流式响应问题
¥15 ogg dd trandata 报错
¥15 高缺失率数据如何选择填充方式

从网站检索特定数据

2条回答 默认 最新

悬赏问题

2条回答默认最新