爬虫请求服务器被拒,代码412
编写爬虫程序向B站发送requests请求,想爬取B站弹幕数据,请求被拒,返回代码412。
希望有码神出来指点一下刚学习爬虫 ^.^
import requests
import re
# url = 'https://comment.bilibili.com/1473879133.xml'
url = 'https://api.bilibili.com/x/v1/dm/list.so?oid=1473879133'
headers = \
{'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/123.0.0.0 Safari/537.36'}
response = requests.get(url, headers)
response.encoding = 'utf-8'
print(response.text)
# {"code":-412,"message":"request was banned","ttl":1}
print(response.status_code)
# 412