使用div的PHP web抓取

I have tried everything, I have read on the other questions however it doesn't work.

I want from this website:

http://www.interparcel.com/tracking.php?action=dotrack&trackno=RE367831140GR

To extract this:

Sorry, no consignment was found with these details.Error - No xml data received

I have also tried with the websites parcelforce.com and dhl.com: The same procedures, it results zero matches.

Things I have tried (among st many):

$curl = curl_init('http://www.interparcel.com/tracking.php?action=dotrack&trackno=$nummm');
curl_setopt($curl, CURLOPT_RETURNTRANSFER, TRUE);

$page = curl_exec($curl);

if(curl_errno($curl)) // check for execution errors
{
    echo 'Scraper error: ' . curl_error($curl);
    exit;
}

curl_close($curl);

$regex = '/<div class="header-description">(.*?)</div>/s';
if ( preg_match($regex, $page, $list) )
    echo $list[0];
else 
    print "Not found"; 

<?php // File: MatchAllDivMain.php

// Read html file to be processed into $data variable
$data = file_get_contents('test.html');

// Commented regex to extract contents from <div class="main">contents</div>
//  where "contents" may contain nested <div>s.
//  Regex uses PCRE's recursive (?1) sub expression syntax to recurs group 1
$pattern_long = '{           # recursive regex to capture contents of "main" DIV
<div\s+class="main"\s*>              # match the "main" class DIV opening tag
  (                                   # capture "main" DIV contents into $1
    (?:                               # non-cap group for nesting * quantifier
      (?: (?!<div[^>]*>|</div>). )++  # possessively match all non-DIV tag chars
    |                                 # or 
      <div[^>]*>(?1)</div>            # recursively match nested <div>xyz</div>
    )*                                # loop however deep as necessary
  )                                   # end group 1 capture
</div>                               # match the "main" class DIV closing tag
}six';  // single-line (dot matches all), ignore case and free spacing modes ON

// short version of same regex
$pattern_short = '{<div\s+class="main"\s*>((?:(?:(?!<div[^>]*>|</div>).)++|<div[^>]*>(?1)</div>)*)</div>}si';

$matchcount = preg_match_all($pattern_long, $data, $matches);
// $matchcount = preg_match_all($pattern_short, $data, $matches);
echo("<pre>
");
if ($matchcount > 0) {
    echo("$matchcount matches found.
");
    //  print_r($matches);
    for($i = 0; $i < $matchcount; $i++) {
        echo("
Match #" . ($i + 1) . ":
");
        echo($matches[1][$i]); // print 1st capture group for match number i
    }
} else {
    echo('No matches');
}
echo("
</pre>");
?>

Methods described in:

all without success, any help on what I'm doing wrong?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

chrome调试前端的div显示问题 chrome 前端
2017-06-08 02:11

回答 1 已采纳左边是padding，右边是内容块，下面是margin，到时窗口右边的有布局 ![图片说明](https://img-ask.csdn.net/upload/201706/08/1496888796
使用Php后端和C＃前端聊天应用程序 c# php xamarin
2017-01-22 07:10

回答 1 已采纳 So I end up using RestFul Services to complete my Task
如何使用php从facebook抓取一个关键字 facebook php
2014-10-04 07:07

回答 1 已采纳 For Twitter: https://dev.twitter.com/rest/reference/get/statuses/mentions_timeline For Facebook i
Web 前端基础知识面试大全
2022-04-01 18:29

studyer网的博客如果使用apply或call方法，那么this指向他们的第一个参数，apply的第二个参数是一个参数数组，call的第二个及其以后的参数都是数组里面的元素，就是说要全部列举出来； Bind:返回绑定函数，传入参数数列 Apply:传入...
web前端调试时浏览器报错 type：Status report
2016-03-04 08:13

回答 5 已采纳你的Dialog来自哪个js文件？
js-web-screen-shot截图空白 javascript vue.js 前端
2022-06-01 18:06

回答 2 已采纳 <template> <div class="toolbarOutline"> 名称<el-input v-model="name"
使用php从输入框中获取值 html javascript php
2018-06-07 18:04

回答 1 已采纳 Use a form with the method attribute set to ‘get’ or ‘post’ like this: <form method=“post”>
web前端开发php面试题及答案,web前端js面试题及参考答案
2021-05-05 06:33

weixin_39814126的博客 web前端js面试题及参考答案面试题在web前端js求职者在面试求职考核中重要的组成部分，以下是小编为大家整理的：web前端js面试题及参考答案，仅供大家参考!web前端js面试题及参考答案1.WEB标准以及W3C标准是什么?标签...
使用PHP CURL下载MP4文件 php
2017-09-14 00:56

回答 1 已采纳 You must open the file in binary mode to ensure file is saved to disk correctly. $file = fopen($d
requests抓取html, 为什么div中的内容没有被抓取 ajax html5 python 有问必答
2021-04-28 20:30

回答 4 已采纳 imgs = bf1.find_all('div',id='cp_img').get_text()，还要调用get_text()这个方法获取，find_all只是找到这个标签对象。如果觉得有帮忙，
使用PHP发送TCP数据 php
2017-04-18 15:05

回答 1 已采纳 Use PHP sockets : <?php while(true){ sleep 30; $fp = fsockopen("www.example.com", 23,
Web前端基础知识总结
2019-03-19 11:40

随便起的名字也被占用的博客除此之外，Web Storage拥有setItem,getItem,removeItem,clear等方法，不像cookie需要前端开发者自己封装setCookie，getCookie。但是Cookie也是不可以或缺的：Cookie的作用是与服务器进行交互，作为HTTP规范的一部分...
使用php添加背景图片 php
2016-02-21 10:21

回答 2 已采纳 A css file contains CSS. Just CSS. You can't write html or PHP into a CSS file. If you want to gen
Web前端开发应该必备的编码原则
2019-11-19 09:36

web前端新手学习之家的博客今天小编要跟大家分享的文章是关于Web前端开发应该必备的编码原则。HTML已经走过了20几年的发展历程，它几乎见证了整个互联网的发展。但是，即便到现在，有很多基础的概念和原则依然需要开发者高度注意。下面，向...
web前端开发面试题
2020-07-06 21:32

书亦何欢*的博客 3.如何看待前端开发？ 4.平时是如何学习前端开发的？ 5.未来三到五年的规划是怎样的？ position的值， relative和absolute分别是相对于谁进行定位的？ § absolute :生成绝对定位的元素，相对于最近一级的定位不是...
没有解决我的问题, 去提问

悬赏问题

¥15 微信会员卡等级和折扣规则
¥15 微信公众平台自制会员卡可以通过收款码收款码收款进行自动积分吗
¥15 随身WiFi网络灯亮但是没有网络，如何解决？
¥15 gdf格式的脑电数据如何处理matlab
¥20 重新写的代码替换了之后运行hbuliderx就这样了
¥100 监控抖音用户作品更新可以微信公众号提醒
¥15 UE5 如何可以不渲染HDRIBackdrop背景
¥70 2048小游戏毕设项目
¥20 mysql架构，按照姓名分表
¥15 MATLAB实现区间[a,b]上的Gauss-Legendre积分

码龄粉丝数原力等级 --

使用div的PHP web抓取

0条回答默认最新

悬赏问题

使用div的PHP web抓取

0条回答 默认 最新

悬赏问题

0条回答默认最新