qq_52655339 2021-06-14 11:17 采纳率: 100%
浏览 14
已采纳

用不了正则表达式的group()

import re
import requests

url = "https://www.xbiquge.la/"
def getbook(url):

    dic = {
        "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.101 Safari/537.36 Edg/91.0.864.48"
    }
    get_url =requests.get(url,headers=dic)
    get_url.encoding='utf-8'
    page_content = get_url.text
    obj =re.compile(
        r'<div class="item">.*?<span>(?P<name>.*?)</span>'
        r'<a href="(?P<href>.*?)">'
        r'(?P<shuming>.*?)</a></dt>.*?'
        r'<dd>(?P<jieshao>.*?)</dd>',re.S)

    result =obj.findall(page_content)
    
    for it in result:
        print(it)
        print(it.group("name"))#这里用不了group
getbook(url)

 

  • 写回答

2条回答 默认 最新

  • 关注
    import re
    import requests
    url = "https://www.xbiquge.la/"
    def getbook(url):
        dic = {
            "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.101 Safari/537.36 Edg/91.0.864.48"
        }
        get_url =requests.get(url,headers=dic)
        get_url.encoding='utf-8'
        page_content = get_url.text
        obj =re.compile(
            r'<div class="item">.*?<span>(?P<name>.*?)</span>'
            r'<a href="(?P<href>.*?)">'
            r'(?P<shuming>.*?)</a></dt>.*?'
            r'<dd>(?P<jieshao>.*?)</dd>',re.S)
        result =obj.findall(page_content)
        for it in result:
            print(it)
        print(re.search(obj,page_content).group("name"))
    getbook(url)

    group不能对元组用

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 加热介质是液体,换热器壳侧导热系数和总的导热系数怎么算
  • ¥15 想问一下树莓派接上显示屏后出现如图所示画面,是什么问题导致的
  • ¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
  • ¥15 cmd cl 0x000007b
  • ¥20 BAPI_PR_CHANGE how to add account assignment information for service line
  • ¥500 火焰左右视图、视差(基于双目相机)
  • ¥100 set_link_state
  • ¥15 虚幻5 UE美术毛发渲染
  • ¥15 CVRP 图论 物流运输优化
  • ¥15 Tableau online 嵌入ppt失败