爬取豆瓣电影存入数据库，报错TypeError: %d format: a number is required, not str

import requests
from lxml import etree
import pymysql
import re
import time
conn=pymysql.connect(host='localhost',user='root',passwd='123456',db='mydb',port='3306',charset='utf8')
cursor=conn.cursor()#连接数据库及光标
headers={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.100 Safari/537.36'}
def get_movie_url(url):
    html=requests.get(url,headers=headers)
    selector=etree.HTML(html.text)
    movie_hrefs=selector.xpath('//div[@class="hd"/a/@href')
    for movie_href in movie_hrefs:
        get_movie_info(movie_href)

def get_movie_info(url):
    html = requests.get(url, headers=headers)
    selector = etree.HTML(html.text)
    try:
        name=selector.xpath('//div[@id="content"]/h1/span/text()')[0]
        director=selector.xpath('//div[@id="info"]/span[1]/span[2]/a/text()')[0]
        actors=selector.xpath('//div[@id="info"]/span[3]/span[2]/text()')[0]
        actor=actors.xpath('string(.)')
        style=re.findall('<span property="v:genre">(.*?)</span>',html.text,re.S)[0]
        country=re.findall('<span class="pl">制片国家/地区:</span>(.*?)<br>',html.text,re.S)[0]
        release_time=re.findall('上映日期:</span>.*?>(.*?)</span>',html.text,re.S)[0]
        time=re.findall('片长:</span>.*?>(.*?)</span>',html.text,re.S)[0]
        score=selector.xpath('//*[@id="interest_sectl"]/div[1]/div[2]/strong/text()"')[0]
        cursor.execute(
            "insert into doubanmovie (name,director,actor,style,country,release_time,time,score) values(%s,%s,%s,%s,%s,%s,%s,%s)",
            (str(name),str(director),str(actor),str(style),str(country),str(release_time),str(time),str(score)))


    except IndexError:
        pass

if __name__=='__main__':
    urls=['https://movie.douban.com/top250?start={}'.format(str(i)) for i in range(0,250,25)]
    for url in urls:
        get_movie_url(url)
        time.sleep(2)
    conn.commit()

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

3条回答

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
threenewbee 2018-12-01 16:06
关注
urls=['https://movie.douban.com/top250?start={}'.format(str(i)) for i in range(0,250,25)]
->
urls=['https://movie.douban.com/top250?start={}'.format(i) for i in range(0,250,25)]

问题如果解决，请点下我回答左上角的采纳，谢谢

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

安装第三方库报错TypeError: 'type' object is not subscriptable pycharm python
2022-05-16 11:51

回答 2 已采纳版本问题，python3.9太高了，换python3.7就行了。还不行的话试试改注册表中某个可读字符长为1
python文件打开报错TypeError: an integer is required (got type str)求解 python
2020-03-22 15:32

回答 3 已采纳 f=open("123.txt","rb+",encoding="utf-8") 你的文档有中文就要encoding
报错TypeError: memoryview: a bytes-like object is required, not 'str' python 有问必答
2021-11-02 15:44

回答 4 已采纳关键点：元组转字典 data = dict(data) response = requests.post(url=url, data=data) print(response.text) 或者
mysql insert %d_解决python mysql插入int型数据报错：TypeError: %d format: a number is required, not str...
2021-01-27 15:35

Hermione Tsang的博客今天在使用python爬取数据并将其写入mysql数据库时，使用该如下语句：cursor.execute("insert into comments_p_spider(owner_id,from_name,content,create_time,score,comment_level) values(%d,%s,%s,%s,%f,%s)",(p...
报错TypeError: 'type' object is not iterable python 有问必答
2021-11-04 17:00

回答 3 已采纳你加个类，代码得写到方法里，不能直接在类里写for循环你可以把for循环写到__init__构造函数里
Vue练习的报错：Uncaught TypeError: Vue is not a constructor vue.js
2022-03-27 09:57

回答 2 已采纳你去官方下载到本地然后本地引入试试用这个试下 <script src="https://unpkg.com/vue/dist/vue.js"></script>
报错TypeError:'type' object is not subscriptable python 有问必答深度学习
2021-06-04 22:21

回答 3 已采纳这个报错应该是类型对象不可下标，你的list是不是不是集合或者数组。
【python与数据分析】CH3 python序列结构补充——字符串
2022-10-15 13:04

数据人的自我救赎的博客 '4' >>> type("%o"%x) >>> "%d"%"555" #只有整数才能格式化为%d、%o、%x等形式 Traceback (most recent call last): File "", line 1, in "%d"%"555" TypeError: %d format: a real number is required, not str 2....
TypeError: transpileDependencies.map is not a function vue.js 前端前端框架
2022-09-19 08:52

回答 3 已采纳可能依赖没下载完，你把node_module模块删掉，重新下载依赖
Python报错TypeError: float() argument must be a string or a number, not 'NoneType' python 有问必答
2021-12-05 15:08

回答 1 已采纳 Excel里面取到的值，包含了null值，转换为float类型报错。
React报错 TypeError: instance.render is not a function react.js 有问必答
2021-10-25 14:22

回答 1 已采纳 render ，不是rander
scrapy mysql 豆瓣_Python Scrapy 爬取豆瓣 Top250 并存入 MySQL 数据库, 出现 TypeError: can't concat bytes to tuple...
2021-02-11 09:15

weixin_39535557的博客 Python Scrapy 爬取豆瓣 Top250 并存入 MySQL 数据库, 出现 TypeError: can't concat bytes to tuple这是我自己的第一篇博客嘿嘿嘿. 感觉各种各样的教程网上都很多的, 所以我想就记录一下自己踩的坑吧. 前两天刚刚...
python使用import os创建文件夹保存文件出现TypeError: an integer is required (got type str)错误 python 有问必答
2022-01-14 03:36

回答 2 已采纳中间有个, 要改成+ with open('./music/',music_name + '_' + music_artist +'.mp3','wb') as f: # ,改成+ with
python 操作mysql插入字符串、json报错：ProgrammingError: (1064, ‘You have an error in your SQL syntax；
2021-05-20 11:01

InceptionZ的博客执行sql语句的时候老是报错，原来不能在执行的字符串里面添加format，传给数据库的时候回转义，无论你用单引号，双引号，三引号，都没有用，怎么改都没有用解决方法在涉及变量的地方用格式化符号代替(全部用)，在...
python程序设计基础：字符串与正则表达式
2024-02-23 21:06

不似桂花酒的博客 '4' >"%s"%65 #%s转换成字符串 '65' >"%s"%65333 '65333' >"%d"%"555" #%d转换成整数 Traceback (most recent call last): File "", line 1, in "%d"%"555" TypeError: %d format: a real number is required,...
Python笔记（从入门到爬取数据写入excel与数据库）
2021-06-20 06:33

曾是惊鸿照影来`的博客 Python笔记（从入门到爬取数据写入excel与数据库）内容有点多，慢慢看，会有收获的！！字符格式化输出 %的用法 age = 18 name = '小帅锅' # % 占位符 %d 数字占位符 %s 字符串占位 print("我的年龄是： %d"%age) ...
scrapy-redis 使 redis 不止保存 url（例如：json）
2022-07-11 07:35

「已注销」的博客 crawler = getattr(self, 'crawler', None) if crawler is None: raise ValueError("crawler is required") settings = crawler.settings if self.redis_key is None: self.redis_key = settings.get( 'REDIS_START...
没有解决我的问题, 去提问

悬赏问题

¥15 运筹学排序问题中的在线排序
¥15 关于docker部署flink集成hadoop的yarn，请教个问题 flink启动yarn-session.sh连不上hadoop，这个整了好几天一直不行，求帮忙看一下怎么解决
¥30 求一段fortran代码用IVF编译运行的结果
¥15 深度学习根据CNN网络模型，搭建BP模型并训练MNIST数据集
¥15 C++ 头文件/宏冲突问题解决
¥15 用comsol模拟大气湍流通过底部加热（温度不同）的腔体
¥50 安卓adb backup备份子用户应用数据失败
¥20 有人能用聚类分析帮我分析一下文本内容嘛
¥30 python代码，帮调试，帮帮忙吧
¥15 #MATLAB仿真#车辆换道路径规划