XXXTEN6B 2022-10-09 23:57 采纳率: 100%
浏览 22
已结题

关于#python#的问题:获取的页面代码与开发者工具中的代码不一致

我是一名python爬虫初学者,我通过urlopen函数获取Aliexpress的源代码并保存为html,然后用浏览器打开,但是我发现开发者工具和页面源代码还是不一致。
开发者工具中所示的代码是常规的树状结构,我所需的内容都在对应标签内。
页面源代码中对于我想获得的商品类别下的在售产品名称、价格等是全部集合在字典中,如下所示(部分代码)。

<script type="text/javascript">
        window._isCrawler = false;
                        window.runParams = {};
                    window.runParams = {"mods":{"legalJeopardyInfo":{"tItemType":"nt_mainsearch_legal_jeopardy_info","content":{"isShowHowWeRank":true}},"itemList":{"content":[{"lunchTime":"2022-09-29 00:00:00","image":{"imgUrl":"//ae01.alicdn.com/kf/S41c1fd814a354ada853b4c6cff0b0dadF/Apple-AirPods-2nd-Generation-Wireless-Earbuds-with-Lightning-Charging-Case-Included-Bluetooth-Headphones-for-iPhone.jpg_220x220xz.jpg","imgWidth":220,"imgHeight":220,"imgType":"0"},"itemType":"productV3","trace":{"exposure":{"displayCategoryId":"","postCategoryId":"63705","selling_point":"885603359","algo_exp_id":"801fc9ba-caa1-45ba-a75f-41e519ccf485-0"},"custom":{},"detailPage":{"algo_pvid":"801fc9ba-caa1-45ba-a75f-41e519ccf485","algo_exp_id":"801fc9ba-caa1-45ba-a75f-41e519ccf485-0"},"pdpParams":{"pdp_cdi":"%7B%22traceId%22%3A%222100bdec16653078524525573ed331%22%2C%22itemId%22%3A%221005004801091733%22%2C%22fromPage%22%3A%22search%22%2C%22skuId%22%3A%2212000030537225703%22%2C%22shipFrom%22%3A%22CN%22%2C%22order%22%3A%225%22%2C%22star%22%3A%22%22%2C%22freeShip%22%3A%22true%22%7D","pdp_npi":"2%40dis%21EUR%21120.87%21120.87%21%21%21%21%21%402100bdec16653078524525573ed331%2112000030537225703%21sea","pdp_ext_f":"%7B%22sku_id%22%3A%2212000030537225703%22%7D"},"click":{"algo_pvid":"801fc9ba-caa1-45ba-a75f-41e519ccf485","haveSellingPoint":"true"},"utLogMap":{"original_price_type":"offer","formatted_price":"€ 120,87","csp":"120.87,1","x_object_type":"productV3","algo_pvid":"801fc9ba-caa1-45ba-a75f-41e519ccf485","hit_19_forbidden":false,"is_detail_next":"1","model_ctr":0.2603732645511627,"sku_id":"12000030537225703","mixrank_success":"false","sku_ic_tags":"[]","is_adult_certified":false,"mixrank_enable":"false","oip":"120.87,0","selling_point":"885603359","original_price_strategy":"default","bsp":"120.87,1","x_object_id":"1005004801091733"}},"trade":{"tradeDesc":"5 vendu\u0028s\u0029"},"nativeCardType":"nt_srp_cell_g","productId":"1005004801091733","store":{"storeUrl":"//fr.aliexpress.com/store/1100085087","aliMemberId":240289074,"storeName":"Shop1100085087 Store","storeId":1100085087},"title":{"displayTitle":"Apple – écouteurs sans fil AirPods 2e génération, oreillettes avec étui de chargement Lightning inclus. Écouteurs Bluetooth pour iPhone"

请问为什么我在获取此静态页面源代码(我也不确定这是不是静态的)后通过保存为html文件再打开,其还是与开发者工具中所示的不一致?对于我想获得该字典中某个键所对应的多个值有什么办法呢?

  • 写回答

2条回答 默认 最新

  • 爱音斯坦牛 全栈领域优质创作者 2022-10-10 00:01
    关注

    这是通过js动态渲染的,对于这种你可以开发者工具打到network找接口,然后找到接口后对这个接口链接发起请求。
    然后另一种最简单的办法就是用selenium抓取,抓取页面内容完全可以处理动态加载的数据。

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

问题事件

  • 系统已结题 10月18日
  • 已采纳回答 10月10日
  • 创建了问题 10月9日

悬赏问题

  • ¥100 set_link_state
  • ¥15 虚幻5 UE美术毛发渲染
  • ¥15 CVRP 图论 物流运输优化
  • ¥15 Tableau online 嵌入ppt失败
  • ¥100 支付宝网页转账系统不识别账号
  • ¥15 基于单片机的靶位控制系统
  • ¥15 真我手机蓝牙传输进度消息被关闭了,怎么打开?(关键词-消息通知)
  • ¥15 装 pytorch 的时候出了好多问题,遇到这种情况怎么处理?
  • ¥20 IOS游览器某宝手机网页版自动立即购买JavaScript脚本
  • ¥15 手机接入宽带网线,如何释放宽带全部速度