用BS爬取网页内容之后提取标签属性，显示AttributeError: 'NoneType' object has no attribute 'text'。用print可以成功提取出文本内容，放在循环里就出错。

用BS爬取网页内容之后标签属性一直出错，显示AttributeError: 'NoneType' object has no attribute 'text'

我用print在循环之前试过是可以成功提取出文本内容的，不知道为什么在循环里就不行。求大神解惑！

#s = content[0].find('h5',class_="result-sub-header")
#print(s.text.strip())

#遍历content，取出结果
#因为find_all返回的是一个list，再对list用find_all时，需要指定元素[0]
for i in range(len(content)): 
    #提取标题
    t = content[i].find('a',class_="title")
    title = t.text.strip()
    #提取链接
    url = 'https://www.forrester.com'+t['href']
    #提取摘要
    s = content[i].find('h5',class_="result-sub-header")
    summary = s.text.strip()

    #将提取的内容放在列表paper中
    paper = [title,'Cloud Migration',url,summary]
    #把每个paper加到paperlist
    paperlist.append(paper)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

报告相同问题？

关注问题

爬虫AttributeError: 'NoneType' object has no attribute 'find' 的问题 python
2022-08-02 10:22

回答 5 已采纳你这个代码是在循环中多次执行的你不是每次循环 son_page中都有figure标签,第一次循环 son_page中就没有figure标签, son_tu = son_page.find("figur
代码运行错误：AttributeError: 'NoneType' object has no attribute 'find' html pycharm python
2022-01-12 13:49

回答 2 已采纳我发现你采集的数据里没有 class_="enry-footer" 和 class_="tags-links" 的内容注释掉了，就正常运行了你检查一下这两个关键字，改成正确的内容
python提示 AttributeError: 'NoneType' object has no attribute 'text' python 有问必答
2021-09-17 21:08

回答 1 已采纳因为最后一个news_li没有h2对象，所以article.h2为None，在调用text就出错了。需要先判断h2是否存在在获取text内容有帮助麻烦点个采纳【本回答右上角】，谢谢~~ for ar
Python爬虫报错（属性报错）：AttributeError: ‘NoneType‘ object has no attribute ‘children‘
2023-01-20 12:40

北辰远_code的博客 Python爬虫报错（属性错误）：AttributeError: 'NoneType' object has no attribute 'children'，加入erllib3模块，关闭ssl警告。
Python: 'NoneType' object has no attribute 'string' python 爬虫
2022-05-12 17:23

回答 2 已采纳我估计是link下边没有a，你取了没有的元素再调用string属性就报这个错了，你打印一下link看看吧
'NoneType' object has no attribute 'string'错误 python
2022-05-03 22:58

回答 2 已采纳看错误提示是因为p[1].find('span')没有找到这个标签，是一个空值
求助，python 报错：AttributeError: module 'log0' has no attribute 'out'怎么办？ python 开发语言
2020-03-02 11:24

回答 2 已采纳 import log0 as log 这个log0是哪里来的？ imagecodecs_lite 这个包似乎也没有安装对
Python 爬取网页信息 AttributeError :’NoneType’ object has no attribute ’attrs’
2020-12-13 13:26

我叫_W的博客小白一枚我不知道哪里错了请各位大神们解决一下吧代码附上 coding=gbk” 模拟访问页面 ...GET请求-------大多数网页的访问 HTTP请求------用户需要传输数据...2 对接收的文本进行筛选获取想要的内容用于筛选文本的包
爬取图片网站想获得href时报错，如何解决？(标签-BeautifulSoup|关键词-requests) python
2022-07-31 21:57

回答 1 已采纳 alist=main_page.find("div",class_="main-left left").find_all("a").find("div",class_="main-left left"
为什么class对应的属性值是唯一的，但拿不到任何值 python
2022-08-02 22:32

回答 2 已采纳
为啥我取不到p标签的数据？ python
2021-09-30 13:34

回答 1 已采纳 temp = html.find_all("div", class_="col-lg-4")首先一个页面含有多个div，并不是每个div里面都会含有p标签你要判断啊，不能默认肯定能找到p标签
AttributeError: ‘NoneType‘ object has no attribute ‘text‘
2023-02-20 15:19

m0_66116157的博客 import requests from bs4 ...product_price = soup.find('span', {'class': 'a-price-whole'}).text.strip() # 输出提取的数据 print(f'Product Title: {product_title}') print(f'Product Price: {product_price}')
爬虫程序返回值只有中文是乱码 python 爬虫
2021-09-15 13:49

回答 2 已采纳帮你修改了下 import requests from bs4 import BeautifulSoup #爬取所有的章节标题和章节内容 # https://www.xbiquge.la/13/13
解决python爬虫时遇到AttributeError: ‘NoneType‘ object has no attribute ‘find_all‘
2020-09-16 22:38

小朱小朱绝不服输的博客最近在练习学到的爬虫实例遇到AttributeError: ‘NoneType’ object has no attribute 'find_all’的错误。爬虫要求如下：任务描述：https://movie.douban.com/cinema/later/beijing/ 这个页面描述了北京最近上映...
AttributeError: 'NoneType' object has no attribute 'get_text'
2019-08-29 12:06

fwpevil的博客目标：为了找一些好看的电影，爬取猫眼电影排行榜前100的电影信息，看大家的选择是否适合自己工具：pycharm 第三方库：requests,bs4 代码思路：模拟浏览器请求，得到网页源码通过解析库获取需要的标签信息 ...
没有解决我的问题, 去提问

悬赏问题

¥15 电力市场出清matlab yalmip kkt 双层优化问题
¥30 ros小车路径规划实现不了，如何解决？(操作系统-ubuntu)
¥20 matlab yalmip kkt 双层优化问题
¥15 如何在3D高斯飞溅的渲染的场景中获得一个可控的旋转物体
¥88 实在没有想法，需要个思路
¥15 MATLAB报错输入参数太多
¥15 python中合并修改日期相同的CSV文件并按照修改日期的名字命名文件
¥15 有赏，i卡绘世画不出
¥15 如何用stata画出文献中常见的安慰剂检验图
¥15 c语言链表结构体数据插入

码龄粉丝数原力等级 --

用BS爬取网页内容之后提取标签属性，显示AttributeError: 'NoneType' object has no attribute 'text'。用print可以成功提取出文本内容，放在循环里就出错。

0条回答

悬赏问题