HeartLikesstars 2022-03-30 21:34 采纳率: 100%
浏览 295
已结题

爬虫实战-豆瓣Top250爬取实战 ip被禁

问题遇到的现象和发生背景

实战练习导致IP被禁,尝试代理ip无法解决

问题相关代码,请勿粘贴截图 #使用代理
    proxies = [{'https':'115.75.5.17:38351'},{'https':'14.215.212.37:9168'}]
    proxy = random.choice(proxies)
    print(proxy)
    resp = requests.get(url, headers=headers,proxies=proxy)

运行结果及报错内容
{'https': '115.75.5.17:38351'}
Traceback (most recent call last):
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\connection.py", line 174, in _new_conn
    conn = connection.create_connection(
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\util\connection.py", line 95, in create_connection
    raise err
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\util\connection.py", line 85, in create_connection
    sock.connect(sa)
TimeoutError: [WinError 10060] 由于连接方在一段时间后没有正确答复或连接的主机没有反应,连接尝试失败。

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\connectionpool.py", line 700, in urlopen
    self._prepare_proxy(conn)
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\connectionpool.py", line 994, in _prepare_proxy
    conn.connect()
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\connection.py", line 358, in connect
    self.sock = conn = self._new_conn()
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\connection.py", line 179, in _new_conn
    raise ConnectTimeoutError(
urllib3.exceptions.ConnectTimeoutError: (<urllib3.connection.HTTPSConnection object at 0x000002712D2D5A20>, 'Connection to 115.75.5.17 timed out. (connect timeout=None)')

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\requests\adapters.py", line 440, in send
    resp = conn.urlopen(
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\connectionpool.py", line 785, in urlopen
    retries = retries.increment(
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\urllib3\util\retry.py", line 592, in increment
    raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='movie.douban.com', port=443): Max retries exceeded with url: /top250?start=0&filter= (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x000002712D2D5A20>, 'Connection to 115.75.5.17 timed out. (connect timeout=None)'))

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "E:\Users\18081\PycharmProjects\数据解析\实战演练——爬取豆瓣TOP250.py", line 66, in <module>
    main()
  File "E:\Users\18081\PycharmProjects\数据解析\实战演练——爬取豆瓣TOP250.py", line 61, in main
    detail_urls = get_detail_uel(url)
  File "E:\Users\18081\PycharmProjects\数据解析\实战演练——爬取豆瓣TOP250.py", line 15, in get_detail_uel
    resp = requests.get(url, headers=headers,proxies=proxy)#headers=headers)
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\requests\api.py", line 75, in get
    return request('get', url, params=params, **kwargs)
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\requests\api.py", line 61, in request
    return session.request(method=method, url=url, **kwargs)
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\requests\sessions.py", line 529, in request
    resp = self.send(prep, **send_kwargs)
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\requests\sessions.py", line 645, in send
    r = adapter.send(request, **kwargs)
  File "C:\Users\18081\AppData\Local\Programs\Python\Python310\lib\site-packages\requests\adapters.py", line 507, in send
    raise ConnectTimeout(e, request=request)
requests.exceptions.ConnectTimeout: HTTPSConnectionPool(host='movie.douban.com', port=443): Max retries exceeded with url: /top250?start=0&filter= (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x000002712D2D5A20>, 'Connection to 115.75.5.17 timed out. (connect timeout=None)'))

我的解答思路和尝试过的方法

使用开放代理ip的方法,但是一直报错

我想要达到的结果

使用代理IP进行实战课程演练

  • 写回答

1条回答 默认 最新

  • RE_ABANDON 2022-03-31 14:01
    关注

    随便找的免费代理ip质量不行,十个未必能有一个有用的,需要花钱买

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

问题事件

  • 系统已结题 4月29日
  • 已采纳回答 4月21日
  • 创建了问题 3月30日

悬赏问题

  • ¥15 metadata提取的PDF元数据,如何转换为一个Excel
  • ¥15 关于arduino编程toCharArray()函数的使用
  • ¥100 vc++混合CEF采用CLR方式编译报错
  • ¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误,如何解决?
  • ¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
  • ¥15 c#逐行读取txt文本,但是每一行里面数据之间空格数量不同
  • ¥50 如何openEuler 22.03上安装配置drbd
  • ¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
  • ¥15 无线连接树莓派,无法执行update,如何解决?(相关搜索:软件下载)
  • ¥15 Windows11, backspace, enter, space键失灵