lzyzsere 2016-04-21 16:33 采纳率: 0%
浏览 2794

python3 爬取https指向的图片链接 问题

初学python爬虫部分,昨天动手做了做爬取网页图片并且下载的小程序。
发现网页中有的图片可以被下载并且保存在本地,有的却不能。
查看了下网页代码原来可以被爬取的是

而不能被爬取的是
用urllib包好像处理不了

请问如何解决

附上程序报错提示
Traceback (most recent call last):
File "D:\Python34\lib\urllib\request.py", line 1182, in do_open
h.request(req.get_method(), req.selector, req.data, headers)
File "D:\Python34\lib\http\client.py", line 1088, in request
self._send_request(method, url, body, headers)
File "D:\Python34\lib\http\client.py", line 1126, in send_request
self.endheaders(body)
File "D:\Python34\lib\http\client.py", line 1084, in endheaders
self._send_output(message_body)
File "D:\Python34\lib\http\client.py", line 922, in _send_output
self.send(msg)
File "D:\Python34\lib\http\client.py", line 857, in send
self.connect()
File "D:\Python34\lib\http\client.py", line 1231, in connect
server_hostname=server_hostname)
File "D:\Python34\lib\ssl.py", line 365, in wrap_socket
_context=self)
File "D:\Python34\lib\ssl.py", line 583, in __init
_
self.do_handshake()
File "D:\Python34\lib\ssl.py", line 810, in do_handshake
self._sslobj.do_handshake()
ssl.SSLError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:600)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "D:\py_practice\pachong_rumen_1024.py", line 45, in
bytes = urllib.request.urlopen(url);
File "D:\Python34\lib\urllib\request.py", line 161, in urlopen
return opener.open(url, data, timeout)
File "D:\Python34\lib\urllib\request.py", line 463, in open
response = self._open(req, data)
File "D:\Python34\lib\urllib\request.py", line 481, in _open
'_open', req)
File "D:\Python34\lib\urllib\request.py", line 441, in _call_chain
result = func(*args)
File "D:\Python34\lib\urllib\request.py", line 1225, in https_open
context=self._context, check_hostname=self._check_hostname)
File "D:\Python34\lib\urllib\request.py", line 1184, in do_open
raise URLError(err)
urllib.error.URLError:

  • 写回答

3条回答

  • lzyzsere 2016-04-21 16:37
    关注

    ```可以被爬取的是
    不可以被爬取的是

    
    
    
    
    评论

报告相同问题?

悬赏问题

  • ¥20 腾讯企业邮箱邮件可以恢复么
  • ¥15 有人知道怎么将自己的迁移策略布到edgecloudsim上使用吗?
  • ¥15 错误 LNK2001 无法解析的外部符号
  • ¥50 安装pyaudiokits失败
  • ¥15 计组这些题应该咋做呀
  • ¥60 更换迈创SOL6M4AE卡的时候,驱动要重新装才能使用,怎么解决?
  • ¥15 让node服务器有自动加载文件的功能
  • ¥15 jmeter脚本回放有的是对的有的是错的
  • ¥15 r语言蛋白组学相关问题
  • ¥15 Python时间序列如何拟合疏系数模型