问题相关代码,请勿粘贴截图
import re
article = '''
<a href="http://www.test-test.com">http://www.snowwo-fasngoagn.com</a>
http://www.test-test.com
http://www.test-test.com/gsldno-sd/test'''
ss = re.sub(r'http://[a-z0-9/.-]*|https://[a-z0-9/.-]*', "", article, re.I)
print(ss)
print()
match = re.findall(r'http://[a-z0-9/.-]*|https://[a-z0-9/.-]*', article)
print(match)
运行结果及报错内容
https://ask.csdn.net/new
http://www.test-test.com/
http://www.test-test.com/gsldno-sd/test
['http://www.test-test.com', 'http://www.snowwo-fasngoagn.com', 'http://www.test-test.com', 'http://www.test-test.com/gsldno-sd/test']
我想要达到的结果
我想要用正则替换所有链接,re.findall可以匹配出来所有链接,但是re.sub只替换了部分