RedisSpider爬虫报错 2020-09-02 14:44:36 [twisted] CRITICAL: Unhandled error in Deferred:

# -*- coding: utf-8 -*-
import scrapy,re
from bs4 import BeautifulSoup
from scrapy_redis.spiders import RedisSpider
from urllib import parse

class TtttSpider(RedisSpider):
    name = 'tttt'
    allowed_domains = ['chinanews.com']
    redis_key = "tttt"

    def parse(self, response):
        html = response.text
        soup = BeautifulSoup(html, 'html.parser')
        data = soup.find_all('a')
        for item in data:
            index = {}
            if item.string is not None and item['href'] != 'javascript:;' and item.get('href') and item['href'] != '#':
                url = parse.urljoin(response.url, item.get('href'))
                index[url] = item.string
                print("index", index)
                print(url)
                yield scrapy.Request(url, callback=self.next_parse, meta={"item": index})

    def next_parse(self, response):
        print("11111111")

不是立即报错，调用了parse函数，就不在往下执行了。报错

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
threenewbee 2020-09-02 15:19
关注
这一般是原生dll内存泄露或者调用错误。
https://blog.csdn.net/z564359805/article/details/80803730

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

python运行scrapy框架出现报错 NameError: name 'imp' is not defined python
2022-04-28 23:20

回答 7 已采纳如果你不记得改了什么的话，重装吧。毕竟你改了啥，怎么改回去就只有神才知道了。环境里面的.py文件改了的话基本没什么方法，除了重装。按报错来看，playwright, pyee,twisted,win3
scrapy部署在服务器运行一段时间出现ERROR: Error downloading selenium ubuntu 爬虫
2022-08-09 17:43

回答 1 已采纳服务器掉网？？应该不会，你在服务器的那个控制平台不是可以看网络监控嘛？应该不是断网的问题。可能就是被反爬，你爬的数据多，一段时间内请求多，一般都会被反扒，而你又没有设置代码
爬虫scrapy框架爬不出来，但是request可以出来 http python 爬虫
2022-05-06 00:26

回答 2 已采纳你应该继承 scrapy.SpiderCrawlSpider 不要自定义 parse 函数。
pycharm +scrapy 运行报错：[twisted] CRITICAL: Unhandled error in Deferred
2017-02-24 20:01

11122323l123的博客 ... 这个问题可能是你的pywin32版本不太对，无论你的电脑是32位还是64位，你要安装的pywin32一定要与你Python的版本相一致， ...CRITICAL: Unhandled error in Deferred: （2） Scrapy 0.25 文档
python中twisted的安装问题 python 开发语言
2022-04-13 22:51

回答 1 已采纳你没有下载这个包或包的路径不对。最好使用：pip install Twisted[windows_platform]
在以瀑布流方式翻页的网站,使用scrapy网络爬虫,但是只爬取了第一页数据,没有爬取第二页. python 爬虫
2021-09-05 19:18

回答 2 已采纳那叫ajax，
scrapy如何手动停止爬虫？ python
2021-05-10 09:54

回答 1 已采纳 Ctrl+C 只是终止主线程,你的其他线程没有守护,所以 Ctrl+C 后它们继续运行。另外scrapy中的 Ctrl+C 是暂停，并不是完全停止，Ctrl+C 是断点续爬的基础。
记mac上解决scrapy执行爬虫项目报错：[twisted] CRITICAL: Unhandled error in Deferred
2021-06-16 23:20

房东为什么不退押金的博客 2021-06-16 23:14:28 [twisted] CRITICAL: Unhandled error in Deferred: 解决什么解决，直接重新创建一个项目！首先在pycharm上创建一个新的项目选择创建一个虚拟环境安装scrapy模块 python解释器选择为刚刚...
爬虫关于xpath在代码中返回为空的问题_美剧天堂电影爬取的案例 python 有问必答
2021-05-13 22:48

回答 3 已采纳少了一个空格，没有选中li元素
Yii render和renderPartial：谷歌地图没有正确显示[重复] php
2014-07-28 08:33

回答 1 已采纳 Try this <style> #map-canvas img { max-width
如何用python获取这个网页的HTML（超文本链接语言）？ python 开发语言
2020-03-10 12:56

回答 2 已采纳实验了一下，加了个请求头，试过可以获取，我的代码 ``` import requests import html headers = {"User-Agent": "Mozilla/5.0
【求助】 python3-scrapy [twisted] CRITICAL: Unhandled error in Deferred
2019-04-29 17:50

Darkofnoon的博客各位大佬好，小弟在用python3写scrapy框架爬虫时遇到一些问题，代码与问题如下 .\spider\quotes.py import scrapy from tutorial.items import QuoteItem class QuotesSpider(scrapy.Spider): name = '...
如何在网格中点击背景图像？ css html javascript php
2018-01-17 21:02

回答 2 已采纳 A purely CSS HTML method would be to add a cover anchor element: <div class="f f1"> &lt
运行scrapy 报错：CRITICAL: Unhandled error in Deferred [Errno 11] Resource temporarily unavailable
2019-11-21 12:00

数据知道的博客报错如下： 2019-11-21 03:56:07 [engine.py:256] INFO: Spider opened 2019-11-21 03:56:07 [logstats.py:48] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 ...Unhandled error in Deferred:...
运行scrapy demo时报错：[twisted] CRITICAL: Unhandled error in Deferred
2019-06-11 02:14

CrazZy651314的博客报错+分析原始报错 ... 2019-06-11 01:23:22 [scrapy.core.engine] INFO: Spider opened ...2019-06-11 01:23:22 [twisted] CRITICAL: Unhandled error in Deferred: Traceback (most recent ca...
Scrapy [twisted] CRITICAL: Unhandled error in Deferred:错误
2016-01-08 12:10

忆梦涟的博客最近刚开始使用爬虫框架scrapy，开始就遇到这么个问题，严重挫伤学习心情，终于耗费一天时间解决这个问题解决方法：在安装scrapy的时候下载了pywin32，但是忘记安装这个模块进入python27目录下，手动安装，...
关于解决Unhandled error in Deferred或提示NameError: name 'xxPipeline' is not defined
2018-08-28 22:24

自封的羽球大佬的博客解决Unhandled error in Deferred或提示NameError: name 'xxPipeline' is not defined，错误描述如下： [root@Uu tutu]# scrapy crawl tutu 2018-08-26 18:18:12 [scrapy.utils.log] INFO: Scrapy 1.5.1 started ...
没有解决我的问题, 去提问

悬赏问题

¥15 微信公众号自制会员卡没有收款渠道啊
¥15 stable diffusion
¥100 Jenkins自动化部署—悬赏100元
¥15 关于#python#的问题：求帮写python代码
¥20 MATLAB画图图形出现上下震荡的线条
¥15 关于#windows#的问题：怎么用WIN 11系统的电脑克隆WIN NT3.51-4.0系统的硬盘
¥15 perl MISA分析p3_in脚本出错
¥15 k8s部署jupyterlab，jupyterlab保存不了文件
¥15 ubuntu虚拟机打包apk错误
¥199 rust编程架构设计的方案有偿

RedisSpider爬虫报错 2020-09-02 14:44:36 [twisted] CRITICAL: Unhandled error in Deferred:

1条回答 默认 最新

悬赏问题

1条回答默认最新