请问我的Python爬虫代码哪里出现了问题？（要求：爬取猫眼电影top100榜单的信息）

代码如下：

 import requests
from requests.exceptions import RequestException
import time
from bs4 import BeautifulSoup

def get_one_page(url):
    try:
        headers={'User-Agent':'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/63.0.3239.132 Safari/537.36'}
        response = requests.get(url,headers=headers)
        if response.status_code==200:
            return response.text
        return None
    except RequestException:
        return None

def page(offset):
    url='http://maoyan.com/board/6?offset='+str(offset)
    return url

for j in range(10):
    html_doc = get_one_page(page(j*10))
    soup = BeautifulSoup(html_doc,'lxml')
    i = 1
    for dd in soup.select("dd"):
        print(dd.find("i","board-index board-index-"+str(i+j*10)).get_text()
              +dd.find("p","name").get_text()
              +dd.find("p","star").get_text().strip()
              +dd.find("p","releasetime").string
              +dd.find("p","score").get_text()+'\n')
        i = i + 1
    time.sleep(1)

运行反馈结果为：

 Traceback (most recent call last):

  File "<ipython-input-8-95f75b1c7bd0>", line 1, in <module>
    runfile('H:/程序语言学习用文件夹/Spider/beautifulSoup.py', wdir='H:/程序语言学习用文件夹/Spider')

  File "C:\Users\pc1\Anaconda3\lib\site-packages\spyder\utils\site\sitecustomize.py", line 705, in runfile
    execfile(filename, namespace)

  File "C:\Users\pc1\Anaconda3\lib\site-packages\spyder\utils\site\sitecustomize.py", line 102, in execfile
    exec(compile(f.read(), filename, 'exec'), namespace)

  File "H:/程序语言学习用文件夹/Spider/beautifulSoup.py", line 29, in <module>
    soup = BeautifulSoup(html_doc,'lxml')

  File "C:\Users\pc1\Anaconda3\lib\site-packages\bs4\__init__.py", line 192, in __init__
    elif len(markup) <= 256 and (

TypeError: object of type 'NoneType' has no len()

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
lyhsdy 2018-11-26 03:05
关注
我这里测试没有出现你这边的错误，应该是'lxml'没有安装好，试下print（soup）的结果看看是什么，或者换成
soup= BeautifulSoup(html_doc,'html.parser')

我这边测试的dd.find("p","score").get_text()出错，没找到对应的值，建议查看下值的获取方式，或者删掉

解决
无用 1
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

悬赏问题

¥15 slam rangenet++配置
¥15 对于相关问题的求解与代码
¥15 ubuntu子系统密码忘记
¥15 信号傅里叶变换在matlab上遇到的小问题请求帮助
¥15 保护模式-系统加载-段寄存器
¥15 电脑桌面设定一个区域禁止鼠标操作
¥15 求NPF226060磁芯的详细资料
¥15 使用R语言marginaleffects包进行边际效应图绘制
¥20 usb设备兼容性问题
¥15 错误(10048): “调用exui内部功能”库命令的参数“参数4”不能接受空数据。怎么解决啊

请问我的Python爬虫代码 哪里出现了问题？（要求：爬取猫眼电影top100榜单的信息）

1条回答 默认 最新

悬赏问题

请问我的Python爬虫代码哪里出现了问题？（要求：爬取猫眼电影top100榜单的信息）

1条回答默认最新