查看网页代码发现数字被转码了,导致python输出结果全是乱的。代码如下:
# coding:utf-8
from selenium import webdriver
browser = webdriver.Chrome()
# chrome_options = webdriver.ChromeOptions()
# chrome_options.add_argument('--headless')
# browser = webdriver.Chrome(chrome_options=chrome_options)
# browser = webdriver.PhantomJS()
# browser.maximize_window() # 最大化窗口,可以选择设置
browser.get('http://data.eastmoney.com/kzz/default.html')
element = browser.find_element_by_css_selector('#dt_1')
# 提取表格内容td
td_content = element.find_elements_by_tag_name("td")
lst = []
for td in td_content:
lst.append(td.text)
print(lst)
网页乱码定位图片如下已上传!