如何在禁用cURL和allow_url_fopen时抓取网站

I know the question regarding PHP web page scrapers has been asked time and time and using this, I discovered SimpleHTMLDOM. After working seamlessly on my local server, I uploaded everything to my online server only to find out something wasn't working right. A quick look at the FAQ lead me to this. I'm currently using a free hosting service so edit any php.ini settings. So using the FAQ's suggestion, I tried using cURL, only to find out that this too is turned off by my hosting service. Are there any other simple solutions to scrape contents of a of another web page without the use or cURL or SimpleHTMLDOM?

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doutui4649 2010-10-07 10:23
关注
If cURL and allow_url_fopen are not enabled you can try to fetch the content via

fsockopen — Open Internet or Unix domain socket connection

In other words, you have to do HTTP Requests manually. See the example in the manual for how to do a GET Request. The returned content can then be further processed. If sockets are enabled, you can also use any third party lib utilitzing them, for instance Zend_Http_Client.

On a sidenote, check out Best Methods to Parse HTML for alternatives to SimpleHTMLDom.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(3条)

报告相同问题？

关注问题

如何在禁用cURL和allow_url_fopen时抓取网站 php
2010-10-07 10:12

回答 4 已采纳 If cURL and allow_url_fopen are not enabled you can try to fetch the content via fsockopen — Ope
Sendgrid PHP使用未定义的常量CURL_SSLVERSION_TLSv1_2 php
2015-04-14 17:34

回答 4 已采纳 It looks like you might have an outdated build of CURL installed on your machine. CURL_SSLVER
在curl_init（）中将变量插入url时的语法 php
2016-01-13 21:13

回答 1 已采纳 It's probably because you're using single quotes. Try any of the following: $ch = curl_init("htt
curl_setopt — 设置 cURL 传输选项
2018-02-08 23:58

weixin_30867015的博客 curl_setopt (PHP 4 >= 4.0.2, PHP 5, PHP 7) curl_setopt—设置 cURL 传输选项 bool curl_setopt ( resource $ch , int $option , mixed $value ) 为 cURL 会话句柄设置选项。参数 ch 由...
从JSON php curl中提取数据：json_decode无法正常工作 json php
2018-10-18 19:25

回答 1 已采纳 Buoyed by your answers - @Andreas and @YvesLeBorg, I redid the code and got what I want. I am putt
cURL不使用GET var和fopen - PHP php
2016-10-25 20:54

回答 2 已采纳 You need to initialize $ch: $ch = curl_init(); url_setopt_array( $ch, array( CURLOPT_
allow_url_fopen的simplexml补丁用于simplehtmldom php
2011-05-05 14:34

回答 2 已采纳 You'd need to edit the simple_html_dom source. Its easier to just create your own function that do
PHP下通过file_get_contents()方法不能正常获取远程网页内容
2017-06-20 15:13

HikingTsang的博客本文介绍了PHP下通过file_get_contents()方法不能正常获取远程网页内容的解决方法。
PHP Curl在浏览器中返回不同的URL结果 php
2018-06-01 11:31

回答 2 已采纳 Because JavaScript is the root of all evil. the website gets the search results you want with AJAX
如何将变量作为cURL数组中的url参数传递给CURLOPT_URL php
2014-06-04 08:42

回答 3 已采纳 Using cURL is not needed for this and was loading very slow. Here is a very simple way to get the
使用CURL multi时报警[curl_multi_remove_handle（）] php
2015-09-16 20:48

回答 1 已采纳 In that last foreach you seem to assume that both $urls and $conn arrays have same indexes, which
PHP函数常用的抓取页面方式
2018-07-25 23:41

宋同学灬的博客简单将自己常用的页面抓取的函数分享给大家！ 1、files(); 2、file_get_contents(); 3、fopen();...使用file_get_contents之前必须开启allow_url_fopen 在php.ini 中设置allow_url_fopen= on...
PHP CURL - 当你只知道id时刮掉seo url php
2018-08-10 10:52

回答 2 已采纳 Curl provides the option CURLOPT_FOLLOWLOCATION. curl_setopt($curl, CURLOPT_FOLLOWLOCATION, true)
CTFSHOW_WEB入门1-112
2024-01-02 11:42

萝北哦的博客 gitsvnhg对于此类题目应该比较敏感网站备份信息泄露.zip.rar.7z.tar,gzbak.swp.txt···
PHP 采集大全采集原理分析禁用采集各种采集方法详解采集的攻于防采集性能应用协议分析...
2014-03-11 14:34

HookPHP的博客做了N年的PHP，采集了N家数据，由初学者菜鸟，到现在的熟手，采集天猫、淘宝 ...
PHP代码审计8—SSRF 漏洞
2022-08-02 16:23

W0ngk的博客 4、常见的防御绕过方法 1）DNS重绑定原理：攻击者控制了或者拥有一台DNS服务器，将一个子域绑定到了两个不同的IP，IP地址再不断轮换，目标服务器在检测URL和访问URL时指向的IP地址不同，导致白名单检测被绕过。...
PHP文件包含漏洞复现
2019-09-17 20:10

春日野穹ㅤ的博客文件包含漏洞的产生原因是在通过 PHP 的函数引入文件时，由于传入的文件名没有经过合理的校验，从而操作了预想之外的文件，就可能导致意外的文件泄露甚至恶意的代码注入。 php 中引发文件包含漏洞的通常是以下四个...
没有解决我的问题, 去提问

悬赏问题

¥15 wpf界面一直接收PLC给过来的信号，导致UI界面操作起来会卡顿
¥15 init i2c:2 freq:100000[MAIXPY]: find ov2640[MAIXPY]: find ov sensor是main文件哪里有问题吗
¥15 运动想象脑电信号数据集.vhdr
¥15 三因素重复测量数据R语句编写，不存在交互作用
¥15 微信会员卡等级和折扣规则
¥15 微信公众平台自制会员卡可以通过收款码收款码收款进行自动积分吗
¥15 随身WiFi网络灯亮但是没有网络，如何解决？
¥15 gdf格式的脑电数据如何处理matlab
¥20 重新写的代码替换了之后运行hbuliderx就这样了
¥100 监控抖音用户作品更新可以微信公众号提醒

如何在禁用cURL和allow_url_fopen时抓取网站

4条回答 默认 最新

悬赏问题

4条回答默认最新