duanjiaonie6097 2013-05-11 05:48
浏览 53
已采纳

如何使用wget来计算网站上的页数?

There are prospective client websites asking us for a re-design. Often when I ask, "How many pages is your website?" the answer I too often get is, "I don't know". It would be simple to count if there are 10 pages on their website, but these are larger websites with perhaps hundreds of pages.

Is there a way to count all the pages on a website without doing it manually? I know wget can be used to download pages from a website, but I don't want to download all their pages, besides that would just give me a collection of files not pages.

Is wget the solution to this? If so, how could it be used to count the pages of a website? If not with wget, is there another solution that would work? Remember, I don't have internal access to their website to do the count, it has to be done from the web. Or is counting the internal links equate to a page?

  • 写回答

1条回答 默认 最新

  • doujiabing1228 2013-05-11 05:55
    关注

    You can easily count web pages if website is static or small.

    but if website is too big like StackOverflow then you can use Google indexing.

    Just goto google and search : site:stackoverflow.com

    it returns the number of pages : About 17,000,000 results

    You can put site: before each website and google show the total pages of that website

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 无线电能传输系统MATLAB仿真问题
  • ¥50 如何用脚本实现输入法的热键设置
  • ¥20 我想使用一些网络协议或者部分协议也行,主要想实现类似于traceroute的一定步长内的路由拓扑功能
  • ¥30 深度学习,前后端连接
  • ¥15 孟德尔随机化结果不一致
  • ¥15 apm2.8飞控罗盘bad health,加速度计校准失败
  • ¥15 求解O-S方程的特征值问题给出边界层布拉休斯平行流的中性曲线
  • ¥15 谁有desed数据集呀
  • ¥20 手写数字识别运行c仿真时,程序报错错误代码sim211-100
  • ¥15 关于#hadoop#的问题