python3爬虫no host given错误

# -*- codeing = utf-8 -*-
# @Time : 2021/1/22 13:08
# @Author : 贾维斯
# @File : spider.py
# @software : PyCharm

from bs4 import BeautifulSoup                              #网页解析，获取数据
import re                               #正则表达式，进行文字匹配
import urllib.request,urllib.error      #指定URL，获取网页数据
import xlwt                             #进行excel操作
import sqlite3                          #进行SQlite数据库操作

def main():
    baseurl = "https: //movie.douban.com/top250?start="
    #1.爬取网页
    datalist = getDate(baseurl)
    savepath = ".\\豆瓣电影TOP250.xls"            #或者 r“.\”
    #3.保存数据
    # saveData(savepath)

    askURL("https: //movie.douban.com/top250?start=0")



#爬取网页
def getDate(baseurl):
    datalist = []
    # 2.逐一解析数据
    return datalist

def askURL(url):
    # 模拟浏览器头部信息，向豆瓣服务器发送消息
    # 用户代理，告诉豆瓣服务器，我们是什么类型的机器，浏览器（本质上是告诉浏览器，我们可以接受什么水平的文件内容）
    head = {"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:84.0) Gecko/20100101 Firefox/84.0"}
    # head["U"]     #多个用数组
    request = urllib.request.Request(url=url,headers=head)
    html = ""
    try:
        response = urllib.request.urlopen(request)
        html = response.read().decode("utf-8")
        print(html)
    except urllib.error.URLError as e:
        if hasattr(e,"code"):       #判断e这个对象里是否包含code这个属性
            print(e.code)
            print("code")
        if hasattr(e,"reason"):
            print(e.reason)
            print("reason")

    #return html








#保存数据
def saveData(savepath):
    print("save...")




if __name__ == "__main__":          #当程序执行时
    #调用函数
    main()

出现结果：

这个怎么回事呀

写回答
好问题 0 提建议
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
腾v 2021-01-24 17:50
关注
我又把这个代码复制回去试了下问题在于你说的那个链接那块 https：//后面多了个空格感谢

解决 1
无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

PYTHON 3 爬虫出现<urlopen error no host given> 问题
2016-09-20 17:33

正版RX-0的博客 python3爬虫手把手教 python官方手册
Python - 爬虫：解析 email （一）
2023-11-09 16:47

yfdxb_u的博客 Python - 爬虫：解析 email
python 爬虫图片打不开_python爬虫抓取图片终端报错 <urlopen error no hsot given> 是什么原因？...
2021-02-09 12:57

教室君的博客 python利用urllib爬虫，图片获取二十几张后就报错python版本3.6 windows系统下运行urllib.error.URLError:代码如下：#!/usr/bin/python# -*- coding:utf-8 -*-import urllibimport requestsimport refrom bs4 import...
Python爬虫
2021-09-30 12:41

_森罗万象的博客 Python爬虫
open python error_python爬虫爬图，报错<urlopen error no hsot given>.
2020-12-06 12:01

weixin_39533307的博客 ......改进了代码.写入了try.虽然还是会报错........./usr/bin/python# -*- coding:utf-8 -*-import urllibimport requestsimport refrom bs4 import BeautifulSoupimport csvimport socketsocket.setdefaul...
python 网络爬虫
2023-06-11 09:50

圆弧创意的博客 python 网络爬虫，爬取文字、图片、音频等综合信息。
淘宝代写 python_python爬虫代写代做python爬虫
2021-01-30 20:35

别列夫的博客 ABOUTThis is the base implementation of a full crawler that uses a spacetime cache server to receive requests.CONFIGURATIONStep 1: Install dependenciesIf you do not have Python 3.6+:Check if pip is in...
python爬虫报告_python爬虫分析报告
2020-11-22 15:56

weixin_39534395的博客在python课上布置的作业，第一次进行爬虫，走了很多弯路，也学习到了很多知识，借此记录。1. 获取学堂在线合作院校页面要求：1.确定目标打开页面，通过查看网页源代码并没有相关内容。可以猜测具体数据由前端通过...
python程序使用代理IP，出现407错误如何解决
2022-12-14 13:13

亿牛云爬虫专家的博客 python使用代理IP，出现 407 错误响应的响应处理。
Python中的爬取缓存
2022-06-16 14:48

小陈步吃人的博客缓存机制，可以帮助我们抓取相同数据时效率提高好几倍，但并不是所有的爬虫项目都需要构建缓存机制，这一节，讲解缓存机制的使用场景，以及磁盘缓存和数据库缓存。
没有解决我的问题, 去提问

python3爬虫no host given错误

4条回答 默认 最新

4条回答默认最新