刮书价格

I'm trying to write a scrape app, and I'm running in to problems. My PHP Curl code isn't pulling up the pages with the price of the books. It's returning me to the web root of the domain.

I'm trying to search the site by ISBN.

I've been bashing my head against the wall for days. Any help will be most appreciated!

Code:

<form method="post" for="new-search" name="SearchTerm" class='form-validate' id="SearchTerm" action="index.php">
    <textarea rows="3" name="SearchTerm" id="SearchTerm" cols="40" class="validate-required error"></textarea><div class="error" id="SearchTerm-error">
    <br>                        
    <button class="search primary" type="submit">continue</button>

</form>


<?php

/*
echo("<pre>");print_r($_GET);echo("</pre>");
echo("<pre>");print_r($_POST);echo("</pre>");
*/

$isbn = $_POST['SearchTerm'];


$userAgent = 'User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US;rv:1.8.1.16) Gecko/20080702 Firefox/2.0.0.16';

$fields = array(
    'url' => ("http://www.bookleberry.com/Search/SearchKeyword"),
    'qurl' => ("http://www.bookleberry.com/Search/SearchKeyword/" . $_POST['SearchTerm']),
    'SearchTerm' => ($_POST['SearchTerm']),
    'Page' => ('1'),
    'class' => ('textfield validate-required'),
    'for' => ('new-search'),
    'result-count' => ('1'),
    'status' => 'success',
);

$SearchTerm = ($fields['SearchTerm']);
$url = ($fields['url']);
$Page = ($fields['Page']);


echo("<pre>");
print_r($fields);
echo("</pre>");

if ($isbn != NULL){

    //open connection
    $ch = curl_init($url);
    //set the url, number of POST vars, POST data
    curl_setopt($ch, CURLOPT_HEADER, $userAgent);
    curl_setopt($ch, CURLOPT_URL, $url);
        echo "before curl_exec:<br>";
        echo "curl_errno=". curl_errno($ch) ."<br>";
        echo "curl_error=". curl_error($ch) ."<br>";
    curl_setopt($ch,CURLOPT_POST,count($fields));
    curl_setopt($ch, CURLOPT_POST, 1);
    curl_setopt($ch, CURLOPT_POSTFIELDS, "?SearchTerm=$SearchTerm");
    curl_setopt($ch, CURLOPT_HTTPGET, 1);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, 1);
    curl_setopt($ch, CURLOPT_TIMEOUT, 9999999);
     curl_setopt($ch,CURLOPT_HTTPHEADER,array (
        "Accept: application/json"
    ));




    $info = curl_getinfo($ch);

    //execute post
    $result = curl_exec($ch);
    print $result;


print "<pre>
";
print_r(curl_getinfo($ch));  // get error info

?>

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongzhang7961 2011-02-24 19:21
关注
Don't hurt your head, use it!

Install fiddler.

Do a request using the browser, look in fiddler to exactly what is posted. This includes all headers, cookies and form variables.

Do a post using your code, examine fiddler again

Compare the differences between the two and adjust your script.

Repeat.

Also it helps to install firebug. Using the copy Xpath, and putting that into a php DOM xpath query makes scraping fun and easy!
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

用PHP刮取页面 php
2019-01-08 10:14

回答 1 已采纳 A very quick look at the page https://www.soccerstats.com/matches.asp showed that what the "cookie
[php]图书检索下拉菜单模糊查询的问题 php
2018-03-21 15:29

回答 6 已采纳下拉菜单条件1 条件2 条件3 条件4 按钮 php $option = $_POST['$option']; $search = $_P
PHP str_replace使用通配符刮取内容？ php
2018-08-17 20:23

回答 3 已采纳 Well maybe my question wasn't that good written. I had a table which I needed to scrape from a web
淘宝天猫各平台APP端页面详情api接口调用
2022-04-07 10:30

是有头发的程序猿的博客为了进行此平台API的调用，首先我们需要做下面几件事情。... “spuId”: “2256953934”, “subtitle”: “耐刮钢化玻璃桌面多抽屉分类收纳钢琴烤漆”, “taobaoDescUrl”: “//market.m.taobao....
PHP输出1-100的质数 php
2021-05-16 11:53

回答 1 已采纳 <?php header("content-type:text/html;charset=utf-8"); function getPrime($num){ $s=""; for (
刮google图像结果php php
2011-01-13 01:00

回答 2 已采纳 Google provides an image search api: http://code.google.com/apis/imagesearch/. You should try to u
通过php curl得取淘宝单个价格和特征资料 php
2019-11-14 17:45

回答 1 已采纳 https://blog.csdn.net/m0_37683054/article/details/76101048
网页抓取 - 完整指南
2023-01-28 11:57

海拥✘的博客它具有价格监控、媒体监控、情感分析等多种用途。数据现在已成为市场上的新石油。如果使用得当，企业可以通过领先于竞争对手来实现目标。这样，他们就可以利用这一优势来超越竞争对手。你拥有的相关数据越多，你做出...
php for循环请求接口超时 php
2019-02-23 14:16

回答 2 已采纳建议使用异步消息队列既可以完成你的需求体验感也不会差
php怎么对视频播放地址进行加密 javascript php
2020-09-04 15:33

回答 1 已采纳 https://blog.csdn.net/qincidong/article/details/82781699
无法找到包php7.3-gd php
2019-08-07 07:46

回答 1 已采纳 It looks like you do not have the appropriate repo added. Try : sudo add-apt-repository ppa:ondre
Linux 就该这么学
2018-08-14 00:44

蔚1的博客窃以为，一名技术高超的导师不应该仅仅是技术的搬运工，而应该是优质知识的提炼者，所以在写作本书的过程中，我不希望也不会将自己了解掌握的所有技术知识都写到书里，借此来炫技，而是从真正贴近于新人学习特点的...
php curl返回400 Bad Request php
2019-07-16 16:26

回答 1 已采纳 @everyone. Thanks for your tips. Finally, my code works with following configuration almost time.
计算机类专业毕业设计（课程设计）题目大全
2019-05-15 22:15

askunix_hjh的博客计算机散件报价系统电子商务网站设计 ( 网上商品销售系统 ) 供求信息网基于 WEB的设备管理系统基于 Web的网上物流系统网络考试系统人力资源管理系统基于 WEB的购物系统汽车销售管理信息系统 ...
我用维权失败经历告诉你，在淘宝上买到假货只能忍气吞声
2019-01-13 19:49

encoderlee的博客这张发票疑点太多，根据北京国税局提供的发票识别方法，都和真发票特征不符，最为关键的是，右下角的发票密码，居然是半透明的，不用刮开都能看到数字。然后我把发票代码和发票密码，拿到北京国税局网站上查询，虽然...
没有解决我的问题, 去提问

悬赏问题

¥15 用windows做服务的同志有吗
¥60 求一个简单的网页(标签-安全|关键词-上传)
¥35 lstm时间序列共享单车预测，loss值优化，参数优化算法
¥15 Python中的request，如何使用ssr节点，通过代理requests网页。本人在泰国，需要用大陆ip才能玩网页游戏，合法合规。
¥100 为什么这个恒流源电路不能恒流？
¥15 有偿求跨组件数据流路径图
¥15 写一个方法checkPerson，入参实体类Person，出参布尔值
¥15 我想咨询一下路面纹理三维点云数据处理的一些问题，上传的坐标文件里是怎么对无序点进行编号的，以及xy坐标在处理的时候是进行整体模型分片处理的吗
¥15 一直显示正在等待HID—ISP
¥15 Python turtle 画图

刮书价格

1条回答 默认 最新

悬赏问题

1条回答默认最新