龙哥 毛哼哼爸 2021-06-01 19:20 采纳率: 33.3%
浏览 304

请教如何使用python从word中依次提取想要的内容(文字和数字),存入CSV

我的word是这样的:

分组:2 打印结果依次为 i,b1,P1,breakpoint,b2,P2

和生物燃料的线性关系

 

IL6,IL10无断点

 

Version:1.0 StartHTML:0000000107 EndHTML:0000004359 StartFragment:0000000127 EndFragment:0000004341

] "此时变量y 为:"

[1] "IL1b"

[1] "IL1b"               "-0.107648809523819" "0.339014229656206"  "22.7457597598328"   "0.103020721767575"

[6] "0.361098065608572"

[1] "breakpoint:"

[1] 22.74576

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL5"

[1] "IL5"                 "0.0393546517271973"  "0.507875620155004"   "44.9984318923382"    "-0.0739995182265517"

[6] "0.25727508099701"   

[1] "breakpoint:"

[1] 44.99843

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL13"

[1] "IL13"               "0.376060723360758"  "0.0653383537180709" "49.9999861005274"   "-0.542856877790935"

[6] "0.0169611166504877"

[1] "breakpoint:"

[1] 49.99999

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL17"

[1] "IL17"               "0.0860326532682866" "0.803489195497008"  "53.3990288656935"   "-0.370486142873717"

[6] "0.357193768344205"

[1] "breakpoint:"

[1] 53.39903

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IFNr"

[1] "IFNr"              "-7.94244047618997" "0.325286936422967" "22.4305844194408"  "7.76908912355428"

[6] "0.337025369493169"

[1] "breakpoint:"

[1] 22.43058

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "EOTAXIN"

[1] "EOTAXIN"           "-3.19220024544197" "0.466336086251742" "24.99888058906"    "3.28965729656367"

[6] "0.454145441048398"

[1] "breakpoint:"

[1] 24.99888

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "MIP1a"

[1] "MIP1a"              "0.0582462737049946" "0.643667463861282"  "49.9993850823633"   "-0.135150111858338"

[6] "0.355761901203293"

[1] "breakpoint:"

[1] 49.99939

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "MIP1b"

[1] "MIP1b"             "1.18660626056846"  "0.594131897158957" "40.0035997039163"  "-1.90512501059916"

[6] "0.414291160749824"

[1] "breakpoint:"

[1] 40.0036

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "PDGFbb"

[1] "PDGFbb"            "1846.74539973546"  "0.180428569662787" "10.434272525514"   "-1852.54843445444"

[6] "0.17910971607734"

[1] "breakpoint:"

[1] 10.43427

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "VEGF"

[1] "VEGF"               "1.55264914223873"   "0.0539185505804264" "57.8416286807244"   "-2.40165569187811"

[6] "0.0118117208877093"

[1] "breakpoint:"

[1] 57.84163

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL4"

[1] "IL4"               "1.27999999998011"  "0.330057548990383" "10.0985173827728"  "-1.28961138277114"

[6] "0.326504655595597"

[1] "breakpoint:"

[1] 10.09852

[1] "-----------------------(end)-----------------------"

 

分组3和生物燃料的线性关系

Version:1.0 StartHTML:0000000107 EndHTML:0000002924 StartFragment:0000000127 EndFragment:0000002906

] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IP10"

[1] "IP10"              "12.4103946012244"  "0.277783987110855" "75.0002533890576"  "-22.3140527255365"

[6] "0.22861438928901"

[1] "breakpoint:"

[1] 75.00025

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL5"

[1] "IL5"                 "0.0193789625792632"  "0.902503822777654"   "37.5002033563521"    "-0.0428856630639781"

[6] "0.788687079613217"  

[1] "breakpoint:"

[1] 37.5002

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL13"

[1] "IL13"               "0.325190826123019"  "0.366478086988692"  "33.4755966365242"   "-0.428677652371574"

[6] "0.240316877278676"

[1] "breakpoint:"

[1] 33.4756

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL17"

[1] "IL17"               "-0.558499768286659" "0.20158773812542"   "56.5034911966074"   "0.612674327358895"

[6] "0.23718679716517"  

[1] "breakpoint:"

[1] 56.50349

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IFNr"

[1] "IFNr"               "0.208709345635782"  "0.901327773715502"  "39.9974562419409"   "-0.688939209353522"

[6] "0.687155159991812"

[1] "breakpoint:"

[1] 39.99746

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "EOTAXIN"

[1] "EOTAXIN"           "-1.15079157549727" "0.131659696708967" "60.0026382447805"  "1.15538882349681"

[6] "0.238787654104007"

[1] "breakpoint:"

[1] 60.00264

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] IL1b、"MIP1a"'MIP1b' 无断点

Version:1.0 StartHTML:0000000107 EndHTML:0000001870 StartFragment:0000000127 EndFragment:0000001852

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "VEGF"

[1] "VEGF"               "1.66670588555979"   "0.125015139247943"  "39.9999439538872"   "-2.39061297133361"

[6] "0.0372644819819421"

[1] "breakpoint:"

[1] 39.99994

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL4"

[1] "IL4"                 "-0.0254351161976871" "0.123794542106226"   "56.9905835060053"    "0.0279678266289953"

[6] "0.153137116770019"  

[1] "breakpoint:"

[1] 56.99058

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL6"

[1] "IL6"                "0.351828340746056"  "0.602891815008419"  "19.9998507920235"   "-0.372396178356298"

[6] "0.582588645807018"

[1] "breakpoint:"

[1] 19.99985

[1] "-----------------------(end)-----------------------"

 

分组4 和吸烟指数的线性关系

 

Version:1.0 StartHTML:0000000107 EndHTML:0000005081 StartFragment:0000000127 EndFragment:0000005063

 "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL1b"

[1] "IL1b"                 "-0.00079996410203338" "0.95494457079189"     "80.1870218402303"     "0.0295699318762593"  

[6] "0.692083125848704"   

[1] "breakpoint:"

[1] 80.18702

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL4"

[1] "IL4"                  "-0.00513413500011774" "0.415717909845008"    "80.0017931939949"     "0.0502544263802562"  

[6] "0.556949554676668"   

[1] "breakpoint:"

[1] 80.00179

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL6"

[1] "IL6"               "-2.0429726929538"  "0.798973280443047" "9.99607225859261"  "2.02716752333531"

[6] "0.800507284177805"

[1] "breakpoint:"

[1] 9.996072

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IP10"

[1] "IP10"              "-5.38433533275196" "0.574798396801004" "50.0000638177571"  "9.7987227897167"  

[6] "0.500823663422504"

[1] "breakpoint:"

[1] 50.00006

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL5"

[1] "IL5"                "-0.762677080425243" "0.909639326867313"  "4.99993547455178"   "0.768430807053706"

[6] "0.908961095448069"

[1] "breakpoint:"

[1] 4.999935

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL13"

[1] "IL13"                "-0.0570261292187141" "0.67560959640963"    "44.0997828389191"    "0.257266403828407"  

[6] "0.179917685869111"  

[1] "breakpoint:"

[1] 44.09978

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IL17"

[1] "IL17"                "-0.0176581511638577" "0.948335914945211"   "87.3242868029467"    "1.8457679536987"    

[6] "0.618481022579001"  

[1] "breakpoint:"

[1] 87.32429

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "IFNr"

[1] "IFNr"               "-0.241346975676868" "0.0964784334948085" "83.6077155777258"   "1.17134697567687"  

[6] "0.54784163741068"  

[1] "breakpoint:"

[1] 83.60772

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "EOTAXIN"

[1] "EOTAXIN"           "-14.7167078403157" "0.904324624541991" "4.99984053647891"  "15.0703979984718"

[6] "0.902037476290304"

[1] "breakpoint:"

[1] 4.999841

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "MIP1a"

[1] "MIP1a"             "0.624281631043076" "0.443600272415238" "39.9995462652503"  "-0.83490747532721"

[6] "0.363105386563193"

[1] "breakpoint:"

[1] 39.99955

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "MIP1b"

[1] "MIP1b"             "0.734522601747885" "0.692925217886953" "86.2751310735082"  "7.35286258416322"

[6] "0.212823119524333"

[1] "breakpoint:"

[1] 86.27513

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "PDGFbb"

[1] "PDGFbb"            "26.5169631381258"  "0.384695417111139" "20.9992465758182"  "-22.2639148568416"

[6] "0.475822007586229"

[1] "breakpoint:"

[1] 20.99925

[1] "-----------------------(end)-----------------------"

[1] "-----------------------(start)-----------------?------"

[1] "此时变量y 为:"

[1] "VEGF"

[1] "VEGF"              "13.8462735097656"  "0.733116172348654" "7.56578155700165"  "-13.580946796839"

[6] "0.738051494868929"

[1] "breakpoint:"

[1] 7.565782

[1] "-----------------------(end)-----------------------"

我想依次提取出分组2(3,4)和生物燃料的线性关系

然后把  "IL1b"               "-0.107648809523819" "0.339014229656206"  "22.7457597598328"   "0.103020721767575"  这行中的字符串内容提取出来,还有下面那行[6]后面的 "0.361098065608572"提取出来,以及后面的"IL5" 、"IL13"后面的字符串也提出来,

按下面的这个顺序,写入CSV文件。

像这样:

分组:2   和生物燃料的线性关系

IL1b   -0.10764881      0.339014229656206    22.7457597598328  0.103020721767575 0.361098065608572

IL5

IL13

....

分组:3   和生物燃料的线性关系

IL1b   

IL5

IL13









 

  • 写回答

1条回答 默认 最新

  • CSDN-Ada助手 CSDN-AI 官方账号 2022-09-07 17:06
    关注
    不知道你这个问题是否已经解决, 如果还没有解决的话:

    如果你已经解决了该问题, 非常希望你能够分享一下解决方案, 以帮助更多的人 ^-^
    评论

报告相同问题?

悬赏问题

  • ¥15 校内二手商品转让网站
  • ¥20 高德地图聚合图层MarkerCluster聚合多个点,但是ClusterData只有其中部分数据,原因应该是有经纬度重合的地方点,现在我想让ClusterData显示所有点的信息,如何实现?
  • ¥100 求Web版SPC控制图程序包调式
  • ¥20 指导如何跑通以下两个Github代码
  • ¥15 大家知道这个后备文件怎么删吗,为啥这些文件我只看到一份,没有后备呀
  • ¥15 C++为什么这个代码没报错运行不出来啊
  • ¥15 一道ban了很多东西的pyjail题
  • ¥15 关于#r语言#的问题:如何将生成的四幅图排在一起,且对变量的赋值进行更改,让组合的图漂亮、美观@(相关搜索:森林图)
  • ¥15 C++识别堆叠物体异常
  • ¥15 微软硬件驱动认证账号申请