星光斑斓里放歌 2023-01-29 22:36 采纳率: 85%
浏览 36

使用python的requests库爬取百度中的数据显示乱码

使用python的requests库爬取百度中的数据

使用response.encoding = 'utf8'转换后

response.text仍然乱码

结果:

img

代码如下:


```python

import urllib.request
import urllib.parse
import requests
from bs4 import BeautifulSoup

# 模拟百度搜索
def baiduAPI(params):
    headers = {
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/104.0.0.0 Safari/537.36",
        "Cookie": "HMACCOUNT_BFESS=BD73DB1D35AF871A; BDUSS_BFESS=VJJdkswYnVrT0JHUmFGTFRpTWhUUn5wLVlLbkJvTzA1WTBDcnZwQTU2UU9pYlZqRVFBQUFBJCQAAAAAAAAAAAEAAABcJyX5wO7KzcH6NwAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA78jWMO~I1jc; BAIDUID_BFESS=2B61874755F437B5EC6BEF21B3134DF7:FG=1; ZFY=K9QfSyPCgzUKQk2s8CZ4eAp9owXYz:BJRF7ftEin2pJ8:C; H_PS_PSSID=36543_37552_38092_38052_37990_37796_36803_37930_38088_38041_26350_38009_37881; ab_sr=1.0.1_NzYyZWM1ZDU3OGQ2NGM2ODMxYmQ3MTZmNjE4YWI0OGY2NzIzYWY2YzQ3MjI3YTg5M2U2ODljYWFjZWY2NTg0ODE5NWY2YjhiZDk2YTQ2ZjU0NzZhM2MyZmY2YzFlOGM0YWRmZGU3ZDNkN2VjZjY4MjYwYzQ5MzNhYWQzOTdkYTExOWZkMDhhZTRlMzI0MWZkZGJkNDU3ZTk5YjY2N2ZhZDM3NjAwNDZhMGMxNjg4N2U4N2UwMzU4MTg3ZTFhNTY3",
        "Accept": "image/avif,image/webp,image/apng,image/svg+xml,image/*,*/*;q=0.8",
        "Accept-Encoding": "gzip, deflate, br",
        'Accept-Language': 'zh-CN,zh;q=0.9'
    }

    url = "https://www.baidu.com/s?" + params
    response = requests.get(url, headers=headers)
    response.encoding = 'utf8'

    print(response.text)

    #with open("zhongguo.html",'wb') as f:
    #    f.write(response.text)
    return response


if __name__ == "__main__":
    wd = {"wd": "中国"}
    params = urllib.parse.urlencode(wd)
    print(params)  # 'wd=aa'
    response = baiduAPI(params)
    # print(response.content)

```

  • 写回答

3条回答 默认 最新

  • Halifax ‎ 2023-01-29 23:11
    关注

    望采纳!!!

    设置一下编码集即可。

    response = requests.get(url=url,headers=headers)
    content = response.content.decode('utf8')
    print(content)
    
    评论 编辑记录

报告相同问题?

问题事件

  • 创建了问题 1月29日

悬赏问题

  • ¥15 ue5 .3之前好好的现在只要是激活关卡就会崩溃
  • ¥50 MATLAB实现圆柱体容器内球形颗粒堆积
  • ¥15 python如何将动态的多个子列表,拼接后进行集合的交集
  • ¥20 vitis-ai量化基于pytorch框架下的yolov5模型
  • ¥15 如何实现H5在QQ平台上的二次分享卡片效果?
  • ¥15 python爬取bilibili校园招聘网站
  • ¥30 求解达问题(有红包)
  • ¥15 请解包一个pak文件
  • ¥15 不同系统编译兼容问题
  • ¥100 三相直流充电模块对数字电源芯片在物理上它必须具备哪些功能和性能?