Scraping Numbers from HTML using BeautifulSoup

Scraping Numbers from HTML using BeautifulSoup， python代码哪里错了？？是只sum了最后一行吗救命啊！

问题相关代码，请勿粘贴截图：

import re
import urllib.request, urllib.parse, urllib.error
from bs4 import BeautifulSoup
import ssl
ctx = ssl.create_default_context()
ctx.check_hostname = False
ctx.verify_mode = ssl.CERT_NONE

url = input('Enter - ')
html = urllib.request.urlopen(url, context=ctx).read()
soup = BeautifulSoup(html, "html.parser")

tags = soup('span')
#lis = list ()
nums = list()
sums = 0
for tag in tags:
#y=str(tag)
#lis.append(y)
#print (lis)
nums = re.findall('[0-9]+',str(tag))
#nums.append(num)
for num in nums:
#y = ''.join(nums)
#num = int(num)
sums = sums +int(num)
#numbers = [ int(x) for x in nums ]
print (sums)

运行结果及报错内容，代码运行结果结果是2

我想要达到的结果：(Sum ends with 28)

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除
收藏举报

3条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
kakaccys 2022-08-14 22:42
关注
需要改一下，还有一个点就是你的nums每次tags这个for循环里，都重新赋值了一次，所以你获得的是2，你可以看下你输入的那个网站，最后一个行的值刚好是2，还有你发的这个网页的最终结果应该是2553，你input写错网址了：

sums = 0 for tag in tags: k = float(tag.contents[0]) sums += k print(sums)
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(2条)

报告相同问题？

关注问题

Scraping Numbers from HTML using BeautifulSoup python
2022-08-14 21:52

回答 3 已采纳需要改一下，还有一个点就是你的nums每次tags这个for循环里，都重新赋值了一次，所以你获得的是2，你可以看下你输入的那个网站，最后一个行的值刚好是2，还有你发的这个网页的最终结果应该是2553，
爬虫显示成功，但是保存的json文件里都是none，如何解决呢？ html python 爬虫
2022-06-24 16:12

回答 3 已采纳看你自己输出的日志2022-06-24 16:02:42,409 - INFO: get detail data {'cover': None, 'name': None, 'categories':
等同于Go中Python的HTML解析功能/模块？ html xml
2013-09-03 03:45

回答 1 已采纳 From the Go http.Get Example: package main import ( "fmt" "io/ioutil" "log" "net
Scraping HTML Data with BeautifulSoup
2023-01-11 12:50

*OASIS*的博客 Scraping Numbers from HTML using BeautifulSoupIn this assignment you will write a Python program similar tohttp://www.py4e.com/code3/urllink2.py. The program will useurllibto read the HTML from the ...
正则表达式在html中找到元素的每个实例[重复] html php
2014-02-12 00:53

回答 1 已采纳 Whatever you do, don't use a regex! HE COMES Instead, use a parser: $dom = new DOMDocument(); $d
限制gocolly一次处理有限数量的网址
2018-06-29 03:02

回答 1 已采纳 OnRequest is done before the request is actually sent to the server. Your debug statement is misle
每个已删除的html标记值都插入到mysql中的多个新行中 html mysql php
2014-12-14 14:00

回答 1 已采纳 Because you have insert query inside your for loop foreach ($describtion as $desc) { $conten
【题解】Scraping HTML Data with BeautifulSoup (Using Python to Access Web Data)
2020-02-08 12:36

暂时不用了的id的博客吐槽：因为不能听最新版的课，只好听...题目：Scraping Numbers from HTML using BeautifulSoup In this assignment you will write a Python program similar tohttp://www.py4e.com/code3/urllink2.py. The prog...
运行.exe时保持提示窗口打开（Windows）
2017-06-10 02:38

回答 1 已采纳 One trick is by waiting user input at the end of your application. Once user press any key, exit t
如何下载网页数据库提供的内容？ django html javascript php python
2017-04-06 08:29

回答 1 已采纳 Inspect for the API calls made from your browser on the Network tab on Chrome Developer tools when
PHP如何获取字符串中的第二个数字 php
2016-03-17 09:56

回答 3 已采纳 Try this: <?php $string = '1 PLN = 0.07 Gold'; $pattern = '/\d+\.\d+/'; $matches = array();
【Python for Everybody】所有assignment课后作业代码
2022-07-22 23:05

wnuow的博客 When you find a line that starts with 'From ’ like the following line: From stephen.marquard@uct.ac.za Sat Jan 5 09:14:16 2008 You will parse the From line using split() and print out the second ...
Facebook Scraping返回错误500 php
2012-05-30 17:45

回答 1 已采纳 I've changed the technique of retrieving the image, now I do it like this: $access_token=$faceboo
可视化编程语言_可视化编程语言影响图
2020-08-07 11:12

cumian8165的博客可视化编程语言 Gephi和Sigma.js的网络可视化教程 (A network visualization tutorial with Gephi and Sigma.js) Here’s a preview of what we’ll be making today: the programming languages influence graph. ...
python3 beautifulsoup 表格指定行,BeautifulSoup按数字指定表格列？
2021-01-14 20:24

weixin_39819393的博客 Using Python 2.7 and BeautifulSoup 4, I'm scraping song names from a table.Right now the script finds links in the row of a table; how can I specify I want the first column?Ideally I'd be able to swit...
python操作html的object,使用Selenium Python解析HTML并读取HTML表
2021-02-21 01:22

写材料宝库的博客 I am converting some of my web-scraping code from R to Python (I can't get geckodriver to work with R, but it's working with Python). Anyways, I am trying to understand how to parse and read HTML tabl...
selenium html table,Parse HTML and Read HTML Table with Selenium Python
2021-06-19 09:08

kotlit的博客 I am converting some of my web-scraping code from R to Python (I can't get geckodriver to work with R, but it's working with Python). Anyways, I am trying to understand how to parse and read HTML tabl...
Using Python to Access Web Data
2020-04-21 10:04

Loucas99的博客题目：Scraping Numbers from HTML using BeautifulSoup In this assignment you will write a Python program similar to http://www.py4e.com/code3/urllink2.py. The program will use urllib to read the HTML ...
python开发中级_针对中级Python开发人员的13个项目构想
2020-07-13 22:25

cumei1658的博客 After scraping content from various sites, you’ll need to save it somewhere. So, you’ll use a database to save the scraped content. 从各个站点抓取内容后，您需要将其保存在某处。因此，您将使用数据库...
文本预处理方法_生产中的自然语言处理27种快速文本预处理方法
2020-10-15 09:53

weixin_26729375的博客新的深度学习语言模型(变压器)已引起行业应用的爆炸式增长[5,6.11] 。 This blog is not an article introducing you to Natural Language Processing. Instead, it assumes you are familiar with noise reduction...
没有解决我的问题, 去提问

问题事件

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
系统已结题 8月31日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
已采纳回答 8月23日
关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
创建了问题 8月14日

悬赏问题

¥15 2024-五一综合模拟赛
¥15 下图接收小电路，谁知道原理
¥15 装 pytorch 的时候出了好多问题，遇到这种情况怎么处理？
¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
¥15 手机接入宽带网线，如何释放宽带全部速度
¥30 关于#r语言#的问题：如何对R语言中mfgarch包中构建的garch-midas模型进行样本内长期波动率预测和样本外长期波动率预测
¥15 ETLCloud 处理json多层级问题
¥15 matlab中使用gurobi时报错
¥15 这个主板怎么能扩出一两个sata口
¥15 不是，这到底错哪儿了😭