muddyoussy 2023-04-15 19:21 采纳率: 0%
浏览 29

爬虫爬取报错,报出一大堆file:,一个下午没看出问题来,请各位告知报的错什么意思

各位好,我不知道为什么会报错,请帮帮忙谢谢,就是粗体那段,不知道为什么

import requests
from bs4 import BeautifulSoup
import os

url='https://pic.netbian.com/4kmeinv/'
resp=requests.get(url)
resp.encoding='gbk'
print(resp.status_code)
html=resp.text
  #print(html)

soup=BeautifulSoup(html,'html.parser')
imgs=soup.find_all('img')
for img in imgs:
    src=img['src']
    if '/uploads/'not in src:
        continue
    src=f'https://pic.netbian.com{src}'
    print(src)


filename=os.path.basename(src)
with open(f'图片/{filename}','wb')as f:
    resp_img=requests.get(src)
    f.write(resp_img.content)


C:\Users\ASUS\PycharmProjects\pythonProject\venv\Scripts\python.exe C:\Users\ASUS\PycharmProjects\pythonProject\venv\爬取美女图片\爬取单页图片.py 
200
https://pic.netbian.com/uploads/allimg/230414/113810-16814434903cb8.jpg
https://pic.netbian.com/uploads/allimg/230412/004747-16812316675904.jpg
https://pic.netbian.com/uploads/allimg/221216/095854-1671155934fe32.jpg
https://pic.netbian.com/uploads/allimg/230224/131035-1677215435a04e.jpg
https://pic.netbian.com/uploads/allimg/230405/000245-16806241657d31.jpg
https://pic.netbian.com/uploads/allimg/230414/011844-16814063241a9d.jpg
https://pic.netbian.com/uploads/allimg/230409/001253-168097037325bd.jpg
https://pic.netbian.com/uploads/allimg/230209/002530-1675873530bece.jpg
https://pic.netbian.com/uploads/allimg/230329/160731-1680077251990e.jpg
https://pic.netbian.com/uploads/allimg/210831/102129-16303764895142.jpg
https://pic.netbian.com/uploads/allimg/230311/002822-16784657029f31.jpg
https://pic.netbian.com/uploads/allimg/230327/195510-1679918110f654.jpg
https://pic.netbian.com/uploads/allimg/230318/000902-1679069342ab7c.jpg
https://pic.netbian.com/uploads/allimg/230331/234835-16802777159ca8.jpg
https://pic.netbian.com/uploads/allimg/230325/002032-167967483278ba.jpg
https://pic.netbian.com/uploads/allimg/230313/232601-1678721161bf92.jpg
https://pic.netbian.com/uploads/allimg/230322/235707-16795006270c8e.jpg
https://pic.netbian.com/uploads/allimg/230302/004645-16776892053278.jpg
https://pic.netbian.com/uploads/allimg/230331/235935-1680278375a731.jpg
https://pic.netbian.com/uploads/allimg/230309/002455-1678292695339b.jpg
**Traceback (most recent call last):
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\爬取美女图片\爬取单页图片.py", line 24, in <module>
    resp_img=requests.get(src)
             ^^^^^^^^^^^^^^^^^
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\Lib\site-packages\requests\api.py", line 73, in get
    return request("get", url, params=params, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\Lib\site-packages\requests\api.py", line 59, in request
    return session.request(method=method, url=url, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\Lib\site-packages\requests\sessions.py", line 573, in request
    prep = self.prepare_request(req)
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\Lib\site-packages\requests\sessions.py", line 484, in prepare_request
    p.prepare(
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\Lib\site-packages\requests\models.py", line 368, in prepare
    self.prepare_url(url, params)
  File "C:\Users\ASUS\PycharmProjects\pythonProject\venv\Lib\site-packages\requests\models.py", line 439, in prepare_url
    raise MissingSchema(
requests.exceptions.MissingSchema: Invalid URL '/static/img/qqonline.jpg': No scheme supplied. Perhaps you meant https:///static/img/qqonline.jpg?
**
进程已结束,退出代码1

```

  • 写回答

2条回答 默认 最新

  • 清忖灬 2023-04-15 21:51
    关注

    看样子是前面几张图片能正常爬,到有一张时爬不了,可以看看这张图片的结构是否和其他的图片结构一致,一般是特定的某张图片结构不同导致的

    评论

报告相同问题?

问题事件

  • 创建了问题 4月15日

悬赏问题

  • ¥100 复现论文:matlab仿真代码编写
  • ¥15 esp32驱动GC9A01循环播放视频
  • ¥15 惠普360g9的最新bios
  • ¥30 这个功能用什么软件发合适?
  • ¥60 微信小程序,取消订单,偶尔订单没有改变状态
  • ¥15 用pytorch实现PPO算法
  • ¥15 关于调制信号的星座图?
  • ¥30 前端传参时,后端接收不到参数
  • ¥15 这是有什么问题吗,我检查许可证了但是显示有呢
  • ¥15 机器学习预测遇到的目标函数问题