问题遇到的现象和发生背景
import requests
from bs4 import BeautifulSoup
url = 'https://www.shicimingju.com/book/sanguoyanyi.html'
headers = {"user-agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.198 Safari/537.36"}
url = "https://www.shicimingju.com/book/sanguoyanyi.html"
response = requests.get(url=url,headers=headers).content.decode('utf-8')
soup = BeautifulSoup(response,'lxml')
print("正在请求章节内容")
gettitle = soup.select("#main>#main_left>.book-mulu a").text
print(title)
运行结果及报错内容
C:\Users\Administrator\AppData\Local\Programs\Python\Python310\python.exe E:/编程/python/作品/实验/pycharm项目/爬虫/爬取三国演义.py
Traceback (most recent call last):
File "E:\编程\python\作品\实验\pycharm项目\爬虫\爬取三国演义.py", line 7, in <module>
soup = BeautifulSoup(response,'lxml').select("#main>#main_left>.book-mulu a").text
File "C:\Users\Administrator\AppData\Local\Programs\Python\Python310\lib\site-packages\bs4\element.py", line 2253, in __getattr__
raise AttributeError(
AttributeError: ResultSet object has no attribute 'text'. You're probably treating a list of elements like a single element. Did you call find_all() when you meant to call find()?
Process finished with exit code 1
问题应该出在倒数第二行末尾的.text上,但上网找了很多办法都没有用。听说可能是对象没有实例化,但我实例化了呀。
爬取三国演义