求解爬虫时出现str' object has no attribute 'get_text'问题

问题遇到的现象和发生背景当我爬取小量页面的时候就没问题，可页面多起来的时候就会出现str' object has no attribute 'get_text'问题，应该如何改下列代码，然后爬取多点页面数据，不出问题呢？十分感谢

import requests
import time
from  bs4 import BeautifulSoup
headers={
    'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/95.0.4638.69 Safari/537.36 Edg/95.0.1020.53'
}
def get_info(url):
    res=requests.get(url,headers=headers)
    soup=BeautifulSoup(res.text,'lxml')
    name=soup.select('#proName > a') if len(soup.select('#proName > a'))>0 else '无'
    price=soup.select('body > div.product-con > div.product-con > div.fl.pro-left > div.cell-con > div.cell-price > span.red')if len(soup.select('body > div.product-con > div.product-con > div.fl.pro-left > div.cell-con > div.cell-price > span.red'))>0 else '无'
    operator=soup.select('#cell-con-table > ul:nth-child(2) > li:nth-child(1) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(2) > li:nth-child(1) > div.right > p'))>0 else '无'
    fast=soup.select('#cell-con-table > ul:nth-child(4) > li:nth-child(6) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(4) > li:nth-child(6) > div.right > p'))>0 else '无'
    size=soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(1) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(1) > div.right > p'))>0 else '无'
    texture=soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(2) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(2) > div.right > p'))>0 else '无'
    resolution=soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(3) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(3) > div.right > p'))>0 else '无'
    pixel=soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(4) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(6) > li:nth-child(4) > div.right > p'))>0 else '无'
    system=soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(1) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(1) > div.right > p'))>0 else '无'
    cpu=soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(2) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(2) > div.right > p'))>0 else '无'
    memory=soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(7) > div.right > p > a')if len(soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(7) > div.right > p > a'))>0 else '无'
    capacity=soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(9) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(8) > li:nth-child(9) > div.right > p'))>0 else '无'
    sensor=soup.select('#cell-con-table > ul:nth-child(10) > li:nth-child(1) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(10) > li:nth-child(1) > div.right > p'))>0 else '无'
    rearcamera=soup.select('#cell-con-table > ul:nth-child(10) > li:nth-child(2) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(10) > li:nth-child(2) > div.right > p'))>0 else '无'
    frontcamera=soup.select('#cell-con-table > ul:nth-child(10) > li:nth-child(3) > div.right > p')if len(soup.select('#cell-con-table > ul:nth-child(10) > li:nth-child(3) > div.right > p'))>0 else '无'
    for name,price,operator,fast,size,texture,resolution,pixel,system,cpu,memory,capacity,sensor,rearcamera,frontcamera in zip(name,price,operator,fast,size,texture,resolution,pixel,system,cpu,memory,capacity,sensor,rearcamera,frontcamera):
       data={
            'name':name.get_text().strip(),
            'price': price.get_text().split('\n')[-1].strip(),
            'operator': operator.get_text().split('\n')[1].strip(),
             'fast':fast.get_text().strip(),
            'size':size.get_text().strip(),
            'texture':texture.get_text().strip(),
            'resolution':resolution.get_text().strip(),
            'pixel':pixel.get_text().strip(),
            'system':system.get_text().strip(),
            'cpu':cpu.get_text().strip(),
            'memory': memory.get_text().strip(),
            'capacity': capacity.get_text().strip(),
            'sensor':sensor.get_text().strip(),
            'rearcamera': rearcamera.get_text().strip(),
            'frontcamera':frontcamera.get_text().strip(),
        }
       print(data)





if __name__ == '__main__':
    urls=['https://product.cnmo.com/1626/162{}/param.shtml'.format(("%04d" % i)) for i in range(5700,5732)]

    for url in urls:
        get_info(url)

    time.sleep(1)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

报告相同问题？

关注问题

pyspark报错，'DataFrame' object has no attribute '_jdf' python spark
2022-04-29 16:55

回答 1 已采纳可以参考一下
AttributeError: 'PandasArray' object has no attribute '_str_contains dashboard python
2021-12-28 16:52

回答 4 已采纳 dff = dff.loc[dff[col_name].str.contains(filter_value)] 把这个".loc"删掉试试？
'str' object has no attribute 'to_period' python 有问必答
2021-07-19 22:27

回答 1 已采纳报错信息是字符串对象没有to_period属性，使用pandas.Series.dt.to_period，见： https://pandas.pydata.org/pandas-docs/stable
《Python基础教程》内容总览篇（持续更新中）
2023-08-26 07:45

爱编程的喵喵的博客大家好，我是爱编程的喵喵。双985硕士毕业，现担任全栈工程师一职，热衷于将数据思维应用到工作与生活中。从事机器学习以及相关的前后...个人精心开设的《Python基础课程》专栏订阅量接近900，帮助不少同学解决了Bug。
python问题 AttributeError:xx object has no attribute python
2022-11-09 12:37

回答 4 已采纳初始化函数名字有问题还有其他的若干问题，给你修复了，你着❤️画的不错 import random from math import sin, cos, pi, log from tkinter im
Python报错：AttributeError: 'HomeSpider' object has no attribute 'get_page_all', 请教各位? python
2021-09-02 17:51

回答 2 已采纳后面那几个成员函数缩进不对,应该在class内部而不是和class同级
AttributeError: 'WebDriver' object has no attribute '_timeout'问题如何解决 python selenium 有问必答
2021-09-26 17:03

回答 1 已采纳 waitObject那条不需要，如果是想等页面元素加载完，可以用： driver.implicitly_wait(30)
python csv模块dictwrite_新手求助大神~关于csv模块-将数据写入csv文件总是报错，求解!...
2020-11-30 11:05

weixin_39626369的博客 dict_to_list(rowdict)) TypeError: *str* does not support the buffer interface 意思貌似是说writerow传的参数-字典类型的有问题？可到底是什么原因呢，csv模块中给的就是字典类型的呀，我快崩溃了，本人是初学者...
爬虫出现'NoneType' object has no attribute 'find_all错误 html 爬虫
2022-07-26 23:44

回答 2 已采纳你打印你的text看下，是乱码，要设置字符集
代码运行报错'NoneType' object has no attribute 'get' python 有问必答
2022-03-18 00:07

回答 2 已采纳你content字典中没有content属性content.get('content')获取的是 None你 print(content) 输出下字典看看获取的内容正确吗? 是不是错误信息可能是Co
AttributeError: 'str' object has no attribute 'keys' python 开发语言
2022-07-07 12:00

回答 2 已采纳第6/7行这种方法是按照字符读出文件，city_infos是一段字符串，city_infos=[0]是其中一个字符而已，不能当做字典使用。
小甲鱼零基础入门学习python笔记
2019-08-14 11:06

亦我飞也的博客小甲鱼老师零基础入门学习Python全套资料百度云(包括小甲鱼零基础入门学习Python全套视频+全套源码+全套PPT课件+全套课后题及Python常用工具包链接、电子书籍等）请往我的资源... 000 愉快的开始 ...
'str' object has no attribute 'get'咋解决？ python
2018-01-27 05:59

回答 9 已采纳 1.获取客户端提交的表达数据，数据类型为ImmutableMultiDict formData = request.form 2.将提取的数据转化成字典 formDict = formData.
爬虫教程（ 6 ） --- 爬虫进阶、扩展
2022-07-11 07:35

「已注销」的博客 1. 前言 1. 先看一个最简单的爬虫。...r = requests.get(url) print(r.text) 2. 一个正常的爬虫程序上面那个最简单的爬虫，是一个不完整的残疾的爬虫。因为爬虫程序通常需要做的事情如下： 1)给定的种子 URLs，...
Python初级教程-廖雪峰Python教程
2022-09-04 14:59

阿木霖的博客 # =为赋值，相当于在内存中创建了一个getVar的变量与5，并将getVar变量指向5 print(getVar) getVar = 3.1415 print(getVar) #两种写法均为将getVar加上5后再重新赋给getVar getVar = getVar + 5 getVar += 5 print...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 11月27日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 11月19日

悬赏问题

¥15 lvgl v8.2定时器提前到来
¥15 qtcp 发送数据时偶尔会遇到发送数据失败？用的MSVC编译器(标签-qt|关键词-tcp)
¥15 cam_lidar_calibration报错
¥15 拓扑学，凸集，紧集。。
¥15 如何扩大AIS数据容量
¥15 单纯型python实现编译报错
¥15 c++2013读写oracle
¥15 c++ gmssl sm2验签demo
¥15 关于模的完全剩余系(关键词-数学方法)
¥15 有没有人懂这个博图程序怎么写，还要跟SFB连接，真的不会，求帮助

求解爬虫时出现str' object has no attribute 'get_text'问题

问题遇到的现象和发生背景 当我爬取小量页面的时候就没问题，可页面多起来的时候就会出现str' object has no attribute 'get_text'问题，应该如何改下列代码，然后爬取多点页面数据，不出问题呢？十分感谢

0条回答 默认 最新

问题事件

悬赏问题

问题遇到的现象和发生背景当我爬取小量页面的时候就没问题，可页面多起来的时候就会出现str' object has no attribute 'get_text'问题，应该如何改下列代码，然后爬取多点页面数据，不出问题呢？十分感谢

0条回答默认最新