使用Curl从html表中获取信息

i need to get some information about some plants and put it into mysql table. My knowledge on Curl and DOM is quite null, but i've come to this:

    set_time_limit(0);
include('simple_html_dom.php');


$ch = curl_init ("http://davesgarden.com/guides/pf/go/1501/"); 

curl_setopt($ch, CURLOPT_USERAGENT,"Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US;     rv:1.9.0.1) Gecko/2008070208 Firefox/3.0.1");
curl_setopt($ch, CURLOPT_HTTPHEADER, array("Accept-Language: es-es,en"));
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1);
curl_setopt($ch, CURLOPT_BINARYTRANSFER,1);
curl_setopt($ch, CURLOPT_TIMEOUT,0); 
curl_setopt($ch, CURLOPT_SSL_VERIFYPEER, false);
$data = curl_exec ($ch); 
curl_close ($ch); 


$html= str_get_html($data);


$e = $html->find("table", 8);

 echo $e->innertext;

now, i'm really lost about how to move in from this point, can you please guide me?

Thanks!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
douyimiao1993 2012-04-23 22:08
关注
This is a mess.

But at least it's a (somewhat) consistent mess.

If this is a one time extraction and not a rolling project, personally I'd use quick and dirty regex on this instead of simple_html_dom. You'll be there all day twiddling with the tags otherwise.

For example, this regex pulls out the majority of title/data pairs:

$pattern = "/<b>(.*?)</b>\s*<br>(.*?)</?(td|p)>/si";

You'll need to do some pre and post cleaning before it will get them all though.

I don't envy you having this task...
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

如何在php中使用curl从服务器获得响应 php
2018-04-16 15:07

回答 1 已采纳 a http connection that never close? don't think php's curl bindings are suitable for that. but you
PHP - 从curl响应中获取特定值 json php
2019-03-05 05:09

回答 1 已采纳 <?php $url = 'hxxp://domain.com/univ/v8?q=tas+wanita'; $ch=curl_init($url);
无法在php中使用curl获取json结果 json php
2017-04-12 12:49

回答 1 已采纳 Try this one, it is working fine. You were missing some required headers. This url doesn't give an
php如何获取curl的header,如何在php中使用curl获取header检测
2021-04-22 02:40

weixin_39994461的博客如何在php中使用curl获取header检测发布时间：2021-02-05 18:08:59来源：亿速云阅读：87作者：Leah这期内容当中小编将会给大家带来有关如何在php中使用curl获取header检测，文章内容丰富且以专业的角度为大家分析和...
php curl获取html和js渲染 html javascript jquery php
2015-06-25 19:41

回答 1 已采纳 No you can't. Not from PHP directly. If you have control over the server you could install phanto
php使用curl爬取页面,json数据获取不完整 json php 有问必答
2021-08-02 16:03

回答 2 已采纳你访问的是同一个url?你爬取的是列表内容。并没有去请求详细内容
php Curl在标题中获取JWT php
2019-04-01 17:40

回答 2 已采纳 i need enable CURLOPT_HEADER => true in my array
PHP中CURL的几个经典应用实例
2021-01-21 11:38

1、cURL请求的基本步骤：（1）初始化（2）设置选项，包括URL （3）执行并获取HTML文档内容 ...获取的信息以文件流的形式返回，而不是直接输出 curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1); /
在PHP中使用curl获取robots.txt php
2016-05-30 16:36

回答 1 已采纳 In CURLOPT_URL you must write full host and path For example: http://www.stackoverflow.com/robots
从响应curl php获取参数 php
2017-09-06 10:04

回答 1 已采纳 You can get the URL parametrs by GET, example www.example.com/user?id=1111&Fname=john&Lname=Doe
使用PHP cURL获取JSON数据 json php
2016-09-21 15:19

回答 2 已采纳 If you see a basic CURL example:- http://php.net/manual/en/curl.examples-basic.php You come to kn
php curl获取乱码,如何解决php curl获取乱码
2021-04-30 05:36

小青kk呢的博客 php curl获取乱码的解决办法：首先打开相应的脚本文件；...推荐：《PHP视频教程》问题想从电影天堂获取电影信息用到了curl，在结果中出现了乱码，如图：解决办法在官方文档中有个 CURLOPT_ENCODING 选项，试了，其...
从curl响应字符串中获取特定变量 php
2017-10-04 10:34

回答 3 已采纳 preg_match is your friend: <? $result = "Accepted=AVSAUTH:TEST:::829649376:N::U ENTRYMETHOD=KE
php curl获取远程文件大小,php获取远程文件大小
2021-04-09 12:31

淡庸的博客获取本地文件大小filesize()就可以了,但是如何获取远程文件的大小呢? 这里介绍四个方法来获取远程文件的大小.方法1:get_headersget_headers($url,true);//返回结果Array([0] => HTTP/1.1 200 OK[Date] => Sat,...
php 使用 CURL 获取数据
2020-05-24 09:40

BUG制造者:图图的博客第一种，POST 和 GET 合并 function http_curl($url, $type = 'get', $data = ''){ $cl = curl_init();　//初始化 ...　// 将curl_exec()获取的信息以字符串返回，而不是直接输出。　curl_setopt($.
PHP的curl获取header信息
2020-06-11 19:00

牛奔的博客 PHP的curl功能十分强大，简单点说，就是一个PHP实现浏览器的基础。最常用的可能就是抓取远程数据或者向远程POST数据。但是在这个过程中，调试时，可能会有查看header的必要。 echo get('http://www.baidu.com');...
curl抓取页面是乱码 php_php使用curl获取文本出现中文乱码的解决办法
2020-12-19 11:20

weixin_39945445的博客在使用php的curl获取远程html文本时出现了中文乱码。解决办法的代码如下：$url = "www.ecjson.com";//获取页面内容$ch = curl_init();curl_setopt ($ch, CURLOPT_URL, $url);curl_setopt ($ch, CURLOPT_...
没有解决我的问题, 去提问

悬赏问题

¥15 echarts动画效果失效的问题。官网下载的例子。
¥60 许可证msc licensing软件报错显示已有相同版本软件，但是下一步显示无法读取日志目录。
¥15 Attention is all you need 的代码运行
¥15 一个服务器已经有一个系统了如果用usb再装一个系统，原来的系统会被覆盖掉吗
¥15 使用esm_msa1_t12_100M_UR50S蛋白质语言模型进行零样本预测时，终端显示出了sequence handled的进度条，但是并不出结果就自动终止回到命令提示行了是怎么回事：
¥15 前置放大电路与功率放大电路相连放大倍数出现问题
¥30 关于<main>标签页面跳转的问题
¥80 部署运行web自动化项目
¥15 腾讯云如何建立同一个项目中物模型之间的联系
¥30 VMware 云桌面水印如何添加

使用Curl从html表中获取信息

4条回答 默认 最新

悬赏问题

4条回答默认最新