下面这段代码有bug，我调试不出来

请问下面这段代码的bug在哪里
运行了几次无法完全运行
from urllib import request
from bs4 import BeautifulSoup
import re
import sys

if name == "main":
#创建txt文件
file = open('一念永恒.txt', 'w', encoding='utf-8')
#一念永恒小说目录地址
target_url = 'http://www.biqukan.com/1_1094/'
head = {}
head['User-Agent'] = 'Mozilla/5.0 (Linux; Android 4.1.1; Nexus 7 Build/JRO03D) AppleWebKit/535.19 (KHTML, like Gecko) Chrome/18.0.1025.166 Safari/535.19'
target_req = request.Request(url = target_url, headers = head)
target_response = request.urlopen(target_req)
target_html = target_response.read().decode('gbk','ignore')
listmain_soup = BeautifulSoup(target_html)
#找出div标签中class为listmain的所有子标签
chapters = listmain_soup.find_all('div',class_ = 'listmain')
download_soup = BeautifulSoup(str(chapters))
#计算章节个数
numbers = (len(download_soup.dl.contents) - 1) / 2 - 8
index = 1
begin_flag = False
for child in download_soup.dl.children:
if child != '\n':
#找到《一念永恒》正文卷
if child.string == u"《一念永恒》正文卷":
begin_flag = True
#爬取链接并下载链接内容
if begin_flag == True and child.a != None:
download_url = "http://www.biqukan.com" + child.a.get('href')
download_req = request.Request(url = download_url, headers = head)
download_response = request.urlopen(download_req)
download_html = download_response.read().decode('gbk','ignore')
download_name = child.string
soup_texts = BeautifulSoup(download_html)
texts = soup_texts.find_all(id = 'content', class_ = 'showtxt')
soup_text = BeautifulSoup(str(texts))
write_flag = True
file.write(download_name + '\n\n')
#将爬取内容写入文件
for each in soup_text.div.text.replace('\xa0',''):

                if each == 'h':
                    write_flag = False
                if write_flag == True and each != ' ':
                    file.write(each)
                if write_flag == True and each == '\r':
                    file.write('\n')
                print('正在写入第{0}小节'.format(index))
                index+=1
            file.write('\n\n')
            #打印爬取进度
            sys.stdout.write("已下载:%.3f%%" % float(index/numbers) + '\r')
            sys.stdout.flush()
            index += 1
file.close()

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

2条回答默认最新

CSDN专家-showbo 2022-10-26 14:11

关注

发代码时用工具栏的</>按钮格式化下，要不python没缩进没法看这个代码。。

#from urllib import request
from bs4 import BeautifulSoup
import re
import sys
import requests


#创建txt文件
file = open('一念永恒.txt', 'w', encoding='utf-8')
#一念永恒小说目录地址
target_url = 'http://www.biqukan.com/1_1094/'
head = {}
head['User-Agent'] = 'Mozilla/5.0 (Linux; Android 4.1.1; Nexus 7 Build/JRO03D) AppleWebKit/535.19 (KHTML, like Gecko) Chrome/18.0.1025.166 Safari/535.19'

target_html =requests.get(target_url,headers=head).content.decode('gbk')
print(target_html)
listmain_soup = BeautifulSoup(target_html,features="html.parser")
#找出div标签中class为listmain的所有子标签
chapters = listmain_soup.find_all('div',class_ = 'listmain')
download_soup = BeautifulSoup(str(chapters),features="html.parser")

#计算章节个数
numbers = (len(download_soup.dl.contents) - 1) / 2 - 8
index = 1
begin_flag = False
for child in download_soup.dl.children:
    if child != '\n':
    #找到《一念永恒》正文卷
        if child.string == "《一念永恒》正文卷":
            begin_flag = True
            #爬取链接并下载链接内容
        if begin_flag == True and child.a != None:
            download_url = "http://www.biqukan.com" + child.a.get('href')
            download_html = requests.get(download_url).content.decode('gbk','ignore')
            download_name = child.string
            soup_texts = BeautifulSoup(download_html,features="html.parser")
            texts = soup_texts.find_all(id = 'content', class_ = 'showtxt')
            print(texts)
            soup_text = BeautifulSoup(str(texts),features="html.parser")
            write_flag = True
            file.write(download_name + '\n\n')
            #将爬取内容写入文件
            for each in soup_text.div.text.replace('\xa0',''):

                if each == 'h':
                    write_flag = False
                if write_flag == True and each != ' ':
                    file.write(each)
                if write_flag == True and each == '\r':
                    file.write('\n')
                print('正在写入第{0}小节'.format(index))
                index+=1
            file.write('\n\n')
            #打印爬取进度
            sys.stdout.write("已下载:%.3f%%" % float(index/numbers) + '\r')
            sys.stdout.flush()
            index += 1
file.close()

本回答被题主选为最佳回答 , 对您是否有帮助呢?

编辑记录

查看更多回答(1条)

报告相同问题？

关注问题

下面这段代码有bug，我调试不出来 python 有问必答
2022-10-26 14:01

回答 2 已采纳发代码时用工具栏的</>按钮格式化下，要不python没缩进没法看这个代码。。 #from urllib import request from bs4 import BeautifulS
这段代码有什么bug flask python 有问必答
2023-02-03 23:40

回答 4 已采纳你应该是想发送照片到端口，获取cookie，打印，启动web，接受结果。两个代码修改成同一文件： import requests, json, threading from flask impor
这个bug我查不出来啊啊 c++
2021-09-26 11:10

回答 1 已采纳你先把exe文件关了再重新编译。exe文件正在运行，无法覆盖。
前端编程中你是如何进行调试代码的？这篇文章让你学会更多调试方法——前端新手进阶
2023-02-22 22:33

亦世凡华、的博客要我讲就是：对自己代码自信的人，从来不需要调试，只是log一下值是否符合预期(doge)。哈哈，这当然是一句玩笑话，不管你是刚刚起步的新手，还是从业多年的老手，编程中或多或少都会遇到一些瓶颈，这时就需要进行...
这是python的bug还是我代码或算式有问题希望解答 python
2022-04-12 20:08

回答 3 已采纳问题出在这一行 b=int(a/(10**(ws-1))) 用int强制转换小数到整数时，一般是向下取整的，但是当小数全是9，且位数足够多的时候，这里的强制转换结果就是10，所以就出现了你现在的问题
De bug变量调试信息不可用 java 有问必答
2022-03-18 15:31

回答 4 已采纳索引越界异常，用超过长度的索引去获取值了
看看代码哪里错了，bug找不出来 c语言
2022-11-03 22:04

回答 2 已采纳 for里边是分号隔开不是逗号，你这个a接收也没有用呀
【C/调试实用技巧】—作为程序员应如何面对并尝试解决Bug？
2022-11-23 21:08

诺诺的包包的博客面对Bug应如何进行调试，通过具体代码进行调试分析，进一步帮助理解。
这串代码怎么理解呀，总是有bug，看不太懂 python
2021-11-22 16:21

回答 3 已采纳 for col in row这句有问题你list里啥都放，又有int又有list你必须类似是list或者str才可以遍历，是int不能遍历遍历之前先判断type
我想知道这个鸡兔同笼问题哪里有bug c语言
2022-03-09 22:38

回答 3 已采纳这里不是数学，不能用2ji+4tu==x，得用2*ji+4*tu==x if(ji+tu==y&&2*ji+4*tu==x) {printf("\n%d只鸡%d只兔",ji,tu);} }
代码存在bug，这个怎么改? c语言
2021-10-27 12:38

回答 1 已采纳 if(min!=a){ if(max!=a){ swap(min,a); if(max!=a+n-1) swa
学了编程却写出错误代码？程序运行结果与想象不符？当bug出现时该何去何从，别担心，这篇文章统统告诉你！手把手带你调试代码，让bug原形毕露！
2021-08-28 10:28

Stella_sss的博客【手把手带你搞定】实用调试技巧什么是bug 之前我们对C语言的知识进行了一一的讲解，但是当我们自己真正写代码的时候，又会发现很多问题，比如程序运行时崩溃、程序运行的结果不是我们想要的等等，虽然代码已经能够...
C++有关字符串比较的bug调试不懂 c++
2022-04-24 08:57

回答 1 已采纳 #include <stdio.h> #include <string.h> int main() { char x[80],y[80]; fgets(x,8
编程最重要的技术之一 — 调试（以C语言代码为例）
2023-06-02 22:39

独享你的盛夏的博客调试在软件开发中具有非常重要的意义和作用，是开发过程中不可或缺的一环，是对程序进行分析、排查错误和修正错误的过程。在实际开发过程中，调试可以帮助开发者发现程序中的潜在问题，提高代码的质量。同时调试，...
听说你们害怕异常？保姆式的图文手把手教你如何调试出程序的 bug
2022-03-18 07:00

哈哥撩编程的博客程序中出现的错误，但又没有...测试工程师一般也会将软件缺陷叫做 bug，当然了这是一种广义上的bug(分类：功能错误、内容相关、用户界面的UI、代码错误、需求变更等)。今天我们所说的bug指的是代码错误导致的程序错误。
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 11月3日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 10月26日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 10月26日

悬赏问题

¥15 关于#Java#的问题，如何解决？
¥15 加热介质是液体，换热器壳侧导热系数和总的导热系数怎么算
¥15 想问一下树莓派接上显示屏后出现如图所示画面，是什么问题导致的
¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
¥15 cmd cl 0x000007b
¥20 BAPI_PR_CHANGE how to add account assignment information for service line
¥500 火焰左右视图、视差（基于双目相机）
¥100 set_link_state
¥15 虚幻5 UE美术毛发渲染
¥15 CVRP 图论物流运输优化

下面这段代码有bug，我调试不出来

2条回答 默认 最新

问题事件

悬赏问题

2条回答默认最新