dp411805872 2012-01-31 04:30
浏览 59
已采纳

允许搜索引擎索引文档,但只允许访问付费成员。 可能?

I'm creating a website that will have the admin upload documents available only to the paid members of the website. But what I do want is that the search engines crawls or indexes the document, so that it appears in the search engine search results. Documents include DOC, DOCX and PDF.

For example i have a document that has this text: "the quick brown fox jumped over the lazy dog". Now someone Google's "brown fox". Assuming that I have the standings, I would want the result to appear in the Google results. When the user clicks on it, I want that the user lands on a page, instead of the document, where there is a preview of the text with a link to be a member to view full document.

I planned that the preview of the document on the page will be saved into the database when the document is uploaded. So it is easily visible and crawl-able. For the full document, I could only figure to allow the full document to be crawled. But I think if I allow the search engine to crawl, then I'll be giving access to the users aswell. And if I use htaccess to keep the documents from being accessed directly then I'm shutting the crawlers out too.

I also considered extracting all document text and putting it in the database, but I read somewhere that it is very difficult to distinguish between a user and a spider, and using user agents is a bad idea as it is very easy to spoof.

So I'm confused as to how I should go about this. Any help will be appreciated. Thank you in advance!

  • 写回答

2条回答 默认 最新

  • dougu7546 2012-01-31 04:33
    关注

    No, not possible. Any user can pretend to be a search engine by changing their User-Agent.

    You could do IP-address-based restrictions, or heuristic-based detection, but you're likely to accidentally block crawlers.

    Perhaps you should give users a number of free page views per day, or consider a different method of monetization.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 delta降尺度计算的一些细节,有偿
  • ¥15 Arduino红外遥控代码有问题
  • ¥15 数值计算离散正交多项式
  • ¥30 数值计算均差系数编程
  • ¥15 redis-full-check比较 两个集群的数据出错
  • ¥15 Matlab编程问题
  • ¥15 训练的多模态特征融合模型准确度很低怎么办
  • ¥15 kylin启动报错log4j类冲突
  • ¥15 超声波模块测距控制点灯,灯的闪烁很不稳定,经过调试发现测的距离偏大
  • ¥15 import arcpy出现importing _arcgisscripting 找不到相关程序