爬虫过程中遇到报错：ValueError: can only parse strings

源代码如下：
import requests
import json
from requests.exceptions import RequestException
import time
from lxml import etree

def get_one_page(url):
try:
headers = {
'User-Agent': 'Mozilla/5.0(Macintosh;Intel Mac OS X 10_13_3) AppleWebKit/537.36(KHTML,like Gecko) Chorme/65.0.3325.162 Safari/537.36'
}
response = requests.get(url,headers = headers)
if response.status_code == 200:
return response.text
return None
except RequestException:
return None

def parse_one_page(html):
html_coner = etree.HTML(html)
pattern = html_coner.xpath('//div[@id="container"]/div[@id="main"/div[@class = "ywnr_box"]//a/text()')
return pattern

def write_to_file(content):
with open('results.txt','a',encoding='utf-8') as f:
f.write(json.dumps(content,ensure_ascii=False)+'\n')

def main(offset):
url = 'http://www.cdpf.org.cn/yw/index_'+str(offset)+'.shtml'
html = get_one_page(url)
for item in parse_one_page(html):
print(item)
write_to_file(item)

if name == '__main__':
for i in range(6):
main(offset=i*10)
time.sleep(1)
请问各位大佬到底是哪里出了错？？

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
threenewbee 2019-10-12 17:37
关注
python代码依赖缩进，你贴出来的连缩进都没有。
python给出的错误信息，同时会给出你错误的行号甚至行的内容，你也不给。
自己仔细检查下吧，有一个地方要你给字符串，你给的变量不是字符串

解决 4

无用 1
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

python操作word报错ValueError: can only parse strings。 python
2020-01-15 14:28

回答 2 已采纳 https://blog.csdn.net/zx520113/article/details/86228167
python lstm建模报错：ValueError: too many values to unpack (expected 2) lstm python pytorch
2021-11-25 21:13

回答 1 已采纳 get_train_data() 定义的方法返回就一个值你用x，y两个变量去接收肯定会报错
Python报错：ValueError: Length mismatch: Expected axis has 11 elements, new values have 9 elements python
2022-01-04 11:40

回答 1 已采纳看看读入的那个EXCEL 文件，有几列？看提示应该有11列。
python ValueError: can only parse strings
2021-01-08 16:47

宋学慧的博客 ValueError: can only parse strings 网上搜索了下，好几个回答都是etree.HTML(content.text) 在参数中调用属性，他们好像都行，但我这是不行，会提示没有这个属性 xml = etree.HTML(content.text) # 报错信息 ...
python 用pip 安装有的模块时出现报错信息：ValueError: source code string cannot contain null bytes python 后端开发语言有问必答
2022-02-16 12:30

回答 5 已采纳远程看看
报错：ValueError: Failed to convert a NumPy array to a Tensor (Unsupported object type numpy.ndarray) python 深度学习
2023-02-23 19:42

回答 3 已采纳该回答内容部分引用GPT，GPT_Pro更好的解决问题ValueError: Failed to convert a NumPy array to a Tensor (Unsupported obje
用matplotlib画图报错：ValueError: could not convert string to float: '2020-02-05' python 大数据
2021-12-12 15:13

回答 3 已采纳那三组数据，大概样子贴一下，看提示，就是你有个日期字段(字符串类型)，你按数值来要求 plt 展示了。
Python 使用xpath遇到问题 ValueError: can only parse strings
2020-07-08 14:09

Otto_1027的博客从零开始入坑爬虫，记录一下遇到的问题源代码： import requests from bs4 import BeautifulSoup as bf from lxml import etree url = 'http://movie.douban.com/top250/' headers = { 'User-Agent': 'Mozilla/...
python报错：lxml.etree.XPathEvalError: Invalid expression，如何解决？ python 有问必答
2022-03-30 23:22

回答 2 已采纳 Bigtit_list = html.xpath('//<div[@class="book-mid-info"]/h2/a/text()') div前面的<去掉改成 Bigtit_li
Python 链接 mogodb Atlas 报错 ValueError: error parsing asn1 value？ mongodb python 数据库有问必答
2021-11-15 18:00

回答 1 已采纳在pymongo.MongoClient函数里加个参数ssl_cert_reqs=ssl.CERT_NONE，试试。参考：https://stackoverflow.com/questions/698
字符串格式化的时候报错ValueError: NaTType does not support strftime，如何解决？ python 大数据
2022-03-24 21:21

回答 2 已采纳我使用你的代码，不会报错啊是不是你的原始数据中有空数据或者不是你写的格式的数据啊
爬虫常见报错
2023-03-28 16:43

yuwangcom的博客一、报错’NoneType’ object has no attribute ...#二、报错can only parse strings，------加上.text 三、报错’lxml.etree._Element’ object has no attribute ‘strip’,-------添加/text（）就能提取正确内容
结果报错ValueError: cannot derive by this array python
2022-08-24 17:04

回答 3 已采纳既然手动输入可以，那就用动态函数： from sympy import * #用于函数定义求导 from scipy.optimize import fsolve #用于方程求解 x=
python爬虫难点_Python爬虫技巧
2020-12-05 08:50

weixin_39604139的博客在本文中，我们将分析几个真实网站，来看看我们在《用Python写网络爬虫(第2版)》中学过的这些技巧是如何应用的。首先我们使用Google演示一个真实的搜索表单，然后是依赖JavaScript和API的网站Facebook，接下来是...
python爬虫 xpath使用问题整理
2022-10-11 14:04

JSON_L的博客 python爬虫 xpath使用问题整理与解决 module 'lxml' has no attribute 'html' ValueError: can only parse strings AttributeError: 'NoneType' object has no attribute 'xpath'
Python网络爬虫之中国天气网
2019-05-18 10:00

Lin769440473的博客大家好，今天我们来讲讲怎么用python对中国天气网进行爬取并且对爬取...我们由网页可以看出这里是没有运用ajax等加载技术的，这样比较方便我们一个爬虫新手对其进行爬取，在爬取过程中只需要对一些文本进行格式化就...
Python深度学习-NLP实战：FastText实现中文文本分类（代码已跑通！）
2021-05-05 17:56

南浔Pyer的博客朋友托我写个爬虫，本身是个爬虫小白的我还是接受了此次重任，总共历时五天左右，过程中遇到过无数bug，好在一路披荆斩棘，还是大差不差的完成了此次委托！但感觉这次的经历还是有必要和大家分享一下，正好最近也...
python--爬虫--获取和解析存储网页内容--以薄荷网为例
2019-04-10 17:03

张小凡vip的博客如需转载请注明出处:python–爬虫–获取和解析存储网页内容–以薄荷网为例我们在之前的文章中已经学习了如何进行数据抓包和截取以及分析访问网页。例如: 抓取app数据教程–fiddler抓包数据截取-薄荷app为例本章...
没有解决我的问题, 去提问

悬赏问题

¥15 树莓派与pix飞控通信
¥15 自动转发微信群信息到另外一个微信群
¥15 outlook无法配置成功
¥30 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题

爬虫过程中遇到报错：ValueError: can only parse strings

1条回答 默认 最新

悬赏问题

1条回答默认最新