Python的requests.get()获取不到正确的网页源码

res = requests.get('https://image.baidu.com/search/index?tn=baiduimage&ps=1&ct=201326592&lm=-1&cl=2&nc=1&ie=utf-8&word=%E5%B0%8F%E7%8B%97')

我想用这行代码获取百度图片搜索“小狗”的结果，但是获取不到正确的源代码HTML，这是为什么？

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

CSDN专家-HGJ 2021-06-21 15:06

关注

需要添加headers。

headers = {
    'User-Agent': 'Mozilla/5.0 (Windows NT 6.3; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36', 'Accept': 'text/html, application/xhtml+xml, application/xml;q = 0.9, image/webp, image/apng, */*;    q = 0.8, application/signed-exchange;v = b3;q = 0.9','Accept-Encoding': 'gzip, deflate, br'
}
res = requests.get(
    'https://image.baidu.com/search/index?tn=baiduimage&ps=1&ct=201326592&lm=-1&cl=2&nc=1&ie=utf-8&word=%E5%B0%8F%E7%8B%97',headers=headers)
res.encoding='utf-8'
print(res.text)

如有帮助，请点击我这个回答右上方的【采纳】按钮。

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(1条)

报告相同问题？

关注问题

python requests.get无法取出网页_Python requests获取网页常用方法解析
2020-12-23 06:48

weixin_39558804的博客这篇文章主要介绍了Python requests获取网页常用方法解析,文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值,需要的朋友可以参考下主要记录使用 requests 模块获取网页源码的方法class ...
Python requests获取网页常用方法解析
2020-09-17 21:51

`requests.get()`函数是最基础的网页获取方法，它接收一个URL作为参数，返回一个Response对象。在`Crawler`类中，`_getCookie()`方法展示了如何通过GET请求获取站点的cookie： ```python def _getCookie(self): try...
Python爬虫：通过requests.get()方法获取网站源码
2022-01-18 08:43

阿P的哲学的博客方法常用参数 requests.get() url=* , headers=* 其中url参数传入的必须为一个字符串类型（str） headers参数传入的必须为一个字典格式（dict），这个参数的传入内容就是反反爬的关键
python requests get获取网页
2022-07-03 21:00

marc_chen的博客 python requests get
使用request.get获取网页源码
2022-12-18 09:30

m0_51459421的博客使用request.get获取网页源码
python request.get ip参数_python requests指定出口ip的例子
2020-12-24 20:51

秋葵葵的博客 python requests指定出口ip的例子爬虫需要，一个机器多个口，一个口多个ip，为轮询这些ipdemo#coding=utf-8import requests,sys,socketfrom requests_toolbelt.adapters import sourcereload(sys)sys....
Python3使用requests包抓取并保存网页源码的方法
2020-09-21 17:21

# 发送GET请求，获取网页源码 html = requests.get("http://www.baidu.com") # 使用with语句打开文件并以utf-8编码写入，确保不会出现乱码 with open('test.txt', 'w', encoding='utf-8') as f: f.write(html.text...
py爬虫入门笔记（requests.get的使用）
2024-01-14 17:27

喜欢乙醇的四氯化碳的博客简单爬虫的使用，包括request.get、xpath、re模块、线程池的语法
python爬虫 requests.get()返回值与html网页不一致
2022-12-30 15:19

信息化未来的博客写爬虫时，需要的html和用requests.get返回的html不一样导致后面用bs老出错。requests.get()获取不到正确的源代码HTML。这个库，没看出来为什么，有的网页可以，有的却是错的。
使用requests获取网页源代码-python爬虫开发从入门到实践
2024-07-21 10:58

龟仙岛的博客后面的.group（1）是指的输出获取到的内容，如果不加这个.group会返回很多很多内容，比如获取到的这个字符的长度，之类的东西，所有要加上.group（1）网页打开方式有很多种，最常见的是get方式和post方式，在浏览器...
没有解决我的问题, 去提问

码龄粉丝数原力等级 --

Python的requests.get()获取不到正确的网页源码

2条回答默认最新

码龄粉丝数原力等级 --

Python的requests.get()获取不到正确的网页源码

2条回答 默认 最新

2条回答默认最新