求问xpath爬虫的全流程,之前找到的壁纸网站都是有//img元素的,所以可以直接使用这段程序:
result = requests.get(u)
data = etree.HTML(result.content)
d = data.xpath('/img')
cnt = 1
for i in d:
newurl = i.xpath('./@src')[0]
newname = '科幻朋克壁纸' + str(cnt)
newurl = 'https:' + str(newurl)
print(newurl)
request.urlretrieve(newurl, r'D:\pictures\科幻朋克壁纸' + newname + '.jpg')
print('<%s>下载完毕'%newname)
cnt += 1
但是对这个网址进行爬虫的时候,发现它并没有img元素,该如何解决,求指点。
https://www.360kuai.com/pc/huabao?rawurl=http%3A%2F%2Fhuabao.360.cn%2F324822&tj_url=983b3c7c28db4456f&nsid=08e323dfa268ac2381cee5ea90b18279&scene=1&refer_scene=0_dhsrc:lm&sign=360_36e79513&uid=234898968.614991739000600700.1685096948516.9038&djsource=XsgsZg&tmprtp=expf%3Dcomindexo2043%2Copenscreenv3%2Ccube_gw%2Csearch_beta%2Ccube_new2021%26reqid%3D08e323dfa268ac2381cee5ea90b18279&hsid=70b9f698df143cf4