weixin_45726477 2021-02-08 17:26 采纳率: 100%
浏览 65
已采纳

python爬虫到数据不会清洗整理,请帮忙取出最大值,谢谢!

{'size': 3782, 'mimeType': 'text/xml;charset=utf-8', 'text': "<root><keyIdInfo value='系统总值'/><time>20210208000000#20210209000000</time><flag>increment</flag><serv_date>08</serv_date><value><data>00:00:00#751.08307|00:05:00#741.41266|00:10:00#736.6002|00:15:00#731.436|00:20:00#728.5123|00:25:00#720.8494|00:30:00#714.84937|00:35:00#712.7811|00:40:00#708.42725|00:45:00#701.85803|00:50:00#697.9517|00:55:00#696.88196|01:00:00#690.7043|01:05:00#683.42126|01:10:00#682.9514|01:15:00#675.9378|01:20:00#672.8906|01:25:00#668.8239|01:30:00#664.5707|01:35:00#661.9885|01:40:00#656.99945|01:45:00#653.6169|01:50:00#653.60376|01:55:00#646.2455|02:00:00#642.6717|02:05:00#640.80865|02:10:00#641.80457|02:15:00#640.1921|02:20:00#641.86505|02:25:00#638.2216|02:30:00#640.5401|02:35:00#636.0871|02:40:00#633.96216|02:45:00#633.5946|02:50:00#630.5995|02:55:00#628.2829|03:00:00#628.0964|03:05:00#627.0724|03:10:00#625.2632|03:15:00#624.2384|03:20:00#621.66345|03:25:00#621.8926|03:30:00#622.3168|03:35:00#618.6344|03:40:00#621.3522|03:45:00#622.687|03:50:00#618.2967|03:55:00#622.1722|04:00:00#624.0422|04:05:00#619.6375|04:10:00#619.5008|04:15:00#619.8316|04:20:00#617.0117|04:25:00#618.7152|04:30:00#616.8513|04:35:00#616.9521|04:40:00#619.7799|04:45:00#620.54456|04:50:00#621.25616|04:55:00#619.22284|05:00:00#619.4201|05:05:00#623.513|05:10:00#625.0094|05:15:00#626.5777|05:20:00#629.3571|05:25:00#629.65625|05:30:00#631.114|05:35:00#632.51166|05:40:00#632.54456|05:45:00#634.16437|05:50:00#641.59515|05:55:00#645.8057|06:00:00#641.1918|06:05:00#649.31256|06:10:00#656.4364|06:15:00#664.671|06:20:00#671.5155|06:25:00#677.17065|06:30:00#681.54443|06:35:00#692.7332|06:40:00#698.6563|06:45:00#710.1721|06:50:00#716.949|06:55:00#725.0946|07:00:00#737.63947|07:05:00#752.26697|07:10:00#769.0588|07:15:00#787.99347|07:20:00#806.53925|07:25:00#822.4117|07:30:00#839.4705|07:35:00#859.8165|07:40:00#871.9757|07:45:00#884.2808|07:50:00#897.7839|07:55:00#905.7737|08:00:00#910.9376|08:05:00#915.5463|08:10:00#923.7726|08:15:00#928.39923|08:20:00#932.8077|08:25:00#93

  • 写回答

3条回答 默认 最新

  • 放风喽 2021-02-08 18:31
    关注
    import re
    
    
    shuju = "00:00:00#751.08307|00:05:00#741.41266|00:10:00#736.6002|00:15:00#731.436|00:20:00#728.5123|00:25:00#720.8494|00:30:00#714.84937|00:35:00#712.7811|00:40:00#708.42725|00:45:00#701.85803|00:50:00#697.9517|00:55:00#696.88196|01:00:00#690.7043|01:05:00#683.42126|01:10:00#682.9514|01:15:00#675.9378|01:20:00#672.8906|01:25:00#668.8239|01:30:00#664.5707|01:35:00#661.9885|01:40:00#656.99945|01:45:00#653.6169|01:50:00#653.60376|01:55:00#646.2455|02:00:00#642.6717|02:05:00#640.80865|02:10:00#641.80457|02:15:00#640.1921|02:20:00#641.86505|02:25:00#638.2216|02:30:00#640.5401|02:35:00#636.0871|02:40:00#633.96216|02:45:00#633.5946|02:50:00#630.5995|02:55:00#628.2829|03:00:00#628.0964|03:05:00#627.0724|03:10:00#625.2632|03:15:00#624.2384|03:20:00#621.66345|03:25:00#621.8926|03:30:00#622.3168|03:35:00#618.6344|03:40:00#621.3522|03:45:00#622.687|03:50:00#618.2967|03:55:00#622.1722|04:00:00#624.0422|04:05:00#619.6375|04:10:00#619.5008|04:15:00#619.8316|04:20:00#617.0117|04:25:00#618.7152|04:30:00#616.8513|04:35:00#616.9521|04:40:00#619.7799|04:45:00#620.54456|04:50:00#621.25616|04:55:00#619.22284|05:00:00#619.4201|05:05:00#623.513|05:10:00#625.0094|05:15:00#626.5777|05:20:00#629.3571|05:25:00#629.65625|05:30:00#631.114|05:35:00#632.51166|05:40:00#632.54456|05:45:00#634.16437|05:50:00#641.59515|05:55:00#645.8057|06:00:00#641.1918|06:05:00#649.31256|06:10:00#656.4364|06:15:00#664.671|06:20:00#671.5155|06:25:00#677.17065|06:30:00#681.54443|06:35:00#692.7332|06:40:00#698.6563|06:45:00#710.1721|06:50:00#716.949|06:55:00#725.0946|07:00:00#737.63947|07:05:00#752.26697|07:10:00#769.0588|07:15:00#787.99347|07:20:00#806.53925|07:25:00#822.4117|07:30:00#839.4705|07:35:00#859.8165|07:40:00#871.9757|07:45:00#884.2808|07:50:00#897.7839|07:55:00#905.7737|08:00:00#910.9376|08:05:00#915.5463|08:10:00#923.7726|08:15:00#928.39923|08:20:00#932.8077|08:25:00#93"
    
    result = []
    pattent = re.compile("\d+:\d+:\d+#\d+.\d+|",re.S)
    jieguo = re.findall(pattent,shuju)
    print(jieguo)
    for t in jieguo:
        if t != "":
            aa = t.split("#")
            print(aa)
            result.append({"时间":aa[0],"参数":float(aa[1])})
    print(result)

    排序你自己写吧

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(2条)

报告相同问题?

悬赏问题

  • ¥15 如何用stata画出文献中常见的安慰剂检验图
  • ¥15 c语言链表结构体数据插入
  • ¥40 使用MATLAB解答线性代数问题
  • ¥15 COCOS的问题COCOS的问题
  • ¥15 FPGA-SRIO初始化失败
  • ¥15 MapReduce实现倒排索引失败
  • ¥15 ZABBIX6.0L连接数据库报错,如何解决?(操作系统-centos)
  • ¥15 找一位技术过硬的游戏pj程序员
  • ¥15 matlab生成电测深三层曲线模型代码
  • ¥50 随机森林与房贷信用风险模型