使用.htaccess从子域中排除Crawler

I want to stop Crawler from crawling the subdomain tools.subdomain.com I found a Snippet on the Internet which show following Rewrite Rule:

RewriteCond %{HTTP_USER_AGENT} (googlebot|bingbot|Baiduspider) [NC]
RewriteRule .* - [R=403,L]

How can i manage to block those Crawler on this subdomain, or just allow the current up to date Browser to visit the Subdomain? I Want to manage this through .htaccess, because not every crawler accepts the robots.txt. For the robots.txt i have following rewrite Condition.

RewriteCond %{HTTP_HOST} =testing.subdomain.com
RewriteRule ^robots\.txt$ /robots_testing.txt [L]

Cheers

Sven

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongtao4890 2014-07-01 10:13
关注
It depends on your server layout.

Segregated subdomain

If the subdomain has its own document root, it's enough place an .htaccess file in the subdomain's document root and write the directives you specified in the htaccess file:

RewriteCond %{HTTP_USER_AGENT} (googlebot|bingbot|Baiduspider) [NC] RewriteRule .* - [R=403,L]

Shared subdomain

If the subdomain is using the same document root as the toplevel domain, it's enough to add a RewriteCond to the above:

RewriteCond %{HTTP_HOST} ^tools\.subdomain\.com$ RewriteCond %{HTTP_USER_AGENT} (googlebot|bingbot|Baiduspider) [NC] RewriteRule .* - [R=403,L]

Please note (1): the syntax ^tools\.subdomain\.com$ is needed to match exactly the entire name of the host; besides, since it's a regular expression, dots must be escaped with a backslash.

Please note (2): the syntax of the last RewriteCond may vary according to the bots you want to exclude.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

使用.htaccess从子域中排除Crawler apache php
2014-07-01 09:44

回答 1 已采纳 It depends on your server layout. Segregated subdomain If the subdomain has its own document roo
使用.htaccess重定向子域 apache php
2014-10-01 04:35

回答 2 已采纳 You have to replace your sub folder name to be the same as your sub domain (indiantimes.com.au -&
使用.htaccess从url中删除关键字 apache php tomcat
2015-11-05 05:34

回答 1 已采纳 Try something like this in your .htaccess file RewriteEngine On RewriteRule ^(.*)/mobile/(.*)$ $1
.htaccess使用方法_如何有效地使用.htaccess文件
2022-09-25 10:49

叶涛网站推广优化的博客 .htaccess文件使用前提，.htaccess的主要作用就是实现url改写，也就是当浏览器通过url访问到服务器某个文件夹时，作为主人，我们可以来接待这个url，具体地怎样接待它，就是此文件的作用。所有的访问都是通过URL实现...
.htaccess将子域重写为请求参数 php
2017-03-30 12:27

回答 1 已采纳 Assuming you are using PHP. You don't need to put it as query parameter since from PHP you can alr
使用.htaccess从url中删除index.php php
2014-11-10 07:37

回答 1 已采纳 You need to have your /foldername/.htaccess like this: RewriteEngine on RewriteBase /foldername/
用node.js执行php 使用.htaccess重定向 javascript node.js php
2016-06-28 13:02

回答 2 已采纳 Ok, I've already got it. I just changed xhttp.open("GET","php/gethighscore.php",true); to xhtt
文件上传 .htaccess 与.user.ini
2023-03-13 08:46

devil8123665的博客 htaccess uesr.ini
使用.htaccess更改url中的目录show apache php
2017-03-28 17:54

回答 1 已采纳 Your last rule is also working fine but you are facing style/image display problem due to your use
.htaccess - 子域和路径为GET var apache php
2014-12-24 04:01

回答 1 已采纳 Replace your 2 rules by these 2 rules: # Map all requests to the 'path' get variable in dispatche
.htaccess子域重写在Laravel中 apache laravel php
2015-06-23 15:25

回答 1 已采纳 First, make sure mode_rewrite is enabled. I am sure you have probably done this but I want to mak
[CTF].htaccess的使用技巧总结
2021-05-11 19:32

Y4tacker的博客 valuephp_flagTrick总结利用404页面文件包含本地文件包含远程文件包含可以利用伪协议htaccess自己解析自己目录下有php文件无php文件文件解析配合一句话木马Cgi执行FastCgi执行利用报错信息写马绕过exif_imagetype()...
我想使用.htaccess从网址中删除问号＆.php扩展名 php
2013-07-31 06:22

回答 2 已采纳 That code appears to be from one of my answers :) Replace your code with this: Options +FollowSy
Nginx代理PHP配置 nginx.htaccess文件
2021-12-15 14:28

hello一二三的博客 if (!-d $request_filename) { set $rule_0 1$rule_0; } if (!-f $request_filename) { set $rule_0 2$rule_0; } if ($rule_0 = "21") { rewrite ^/(.*)$ /index.php/$1 last; }
【文件上传绕过】——解析漏洞_.htaccess文件解析漏洞
2021-07-28 12:49

剑客 getshell的博客文章目录一、实验目的：二、工具：三、实验环境：四、漏洞利用前提:五、原理：六、利用方式:七、`.htaccess文件`内容:八、`.htaccess文件`制作：1. 直接创建文件：2. 通过`cmd命令行`创建：九、实验过程：一、...
.htaccess利用方式
2021-05-08 20:04

H3rmesk1t的博客 .htaccess利用方式文件解析文件包含源码泄露代码执行命令执行XSS自定义错误文件文件解析经常出现在文件上传的黑名单没有限制 .htaceess 后缀，通过上传 .htaccess 文件，再上传图片，使图片的 php 恶意代码得以被...
使用.htaccess实现301、302、404重定向
2018-04-07 06:49

老唯的博客　URL重定向（URL redirection，或称网址重定向或网域名称转址），是指当使用者浏览某个网址时，将他导向到另一个网址的技术。二、URL重定向怎么配置？　1）首先需要apache开启重定向，修改httpd.conf配置：1 查找：...
.htaccess的利用方法和技巧
2022-08-10 19:06

snowlyzz的博客关于htaccess 的妙用
Apache的.htaccess利用技巧
2022-02-07 15:50

xiaochuhe--kaishui的博客 .htaccess 文件提供了针对目录改变配置的方法，即在一个特定的文档目录中放置一个包含一条或多条指令的文件，以作用于此目录及其所有子目录。作为用户，所能使用的命令受到限制。管理员可以通过 Apache 的 ...
没有解决我的问题, 去提问

悬赏问题

¥15 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题
¥15 请完成下列相关问题！

使用.htaccess从子域中排除Crawler

1条回答 默认 最新

悬赏问题

1条回答默认最新