2 lzyzsere lzyzsere 于 2016.04.22 00:33 提问

python3 爬取https指向的图片链接 问题

初学python爬虫部分,昨天动手做了做爬取网页图片并且下载的小程序。
发现网页中有的图片可以被下载并且保存在本地,有的却不能。
查看了下网页代码原来可以被爬取的是

而不能被爬取的是
用urllib包好像处理不了

请问如何解决

附上程序报错提示
Traceback (most recent call last):
File "D:\Python34\lib\urllib\request.py", line 1182, in do_open
h.request(req.get_method(), req.selector, req.data, headers)
File "D:\Python34\lib\http\client.py", line 1088, in request
self._send_request(method, url, body, headers)
File "D:\Python34\lib\http\client.py", line 1126, in send_request
self.endheaders(body)
File "D:\Python34\lib\http\client.py", line 1084, in endheaders
self._send_output(message_body)
File "D:\Python34\lib\http\client.py", line 922, in _send_output
self.send(msg)
File "D:\Python34\lib\http\client.py", line 857, in send
self.connect()
File "D:\Python34\lib\http\client.py", line 1231, in connect
server_hostname=server_hostname)
File "D:\Python34\lib\ssl.py", line 365, in wrap_socket
_context=self)
File "D:\Python34\lib\ssl.py", line 583, in __init
_
self.do_handshake()
File "D:\Python34\lib\ssl.py", line 810, in do_handshake
self._sslobj.do_handshake()
ssl.SSLError: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:600)

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
File "D:\py_practice\pachong_rumen_1024.py", line 45, in
bytes = urllib.request.urlopen(url);
File "D:\Python34\lib\urllib\request.py", line 161, in urlopen
return opener.open(url, data, timeout)
File "D:\Python34\lib\urllib\request.py", line 463, in open
response = self._open(req, data)
File "D:\Python34\lib\urllib\request.py", line 481, in _open
'_open', req)
File "D:\Python34\lib\urllib\request.py", line 441, in _call_chain
result = func(*args)
File "D:\Python34\lib\urllib\request.py", line 1225, in https_open
context=self._context, check_hostname=self._check_hostname)
File "D:\Python34\lib\urllib\request.py", line 1184, in do_open
raise URLError(err)
urllib.error.URLError:

4个回答

lzyzsere
lzyzsere   2016.04.22 00:37

```可以被爬取的是
不可以被爬取的是




CSDNXIAOD
CSDNXIAOD   2016.04.22 00:42

Python3爬取图片
----------------------biu~biu~biu~~~在下问答机器人小D,这是我依靠自己的聪明才智给出的答案,如果不正确,你来咬我啊!

oyljerry
oyljerry   Ds   Rxr 2016.04.22 11:26

图片的网址对比一下是否有问题,两者图片地址用浏览器分别打开试试

qq_23214785
qq_23214785   2016.05.25 11:32

写爬虫,用神箭手云爬虫吧,比其他爬虫框架快多了,官方客服mm很热情

Csdn user default icon
上传中...
上传图片
插入图片
准确详细的回答,更有利于被提问者采纳,从而获得C币。复制、灌水、广告等回答会被删除,是时候展现真正的技术了!