请问怎么用正则表达式提取
“< a h r e f = “ # ” >”中#所代表的内容呢?
谢谢!
单个匹配可以用下列代码
import re
html = '<a href="#">'
pattern = r'href="(.*?)"'
match = re.search(pattern, html)
if match:
value = match.group(1)
print(value) # Output: #
全部匹配出来可以用下列代码
import re
html = '<a href="#">'
pattern = r'href="(.*?)"'
matches = re.findall(pattern, html)
for match in matches:
print(match)