dongqi19827 2017-12-23 20:53
浏览 244
已采纳

如何使用正则表达式添加空格和标点符号来捕获第一组? 如何在LibreOffice中停止分成两列的某些标签?

Anyone help me out. Been trying to get this regex working, and it’s nearly there. They all seem to be correct, but the first one should be:

word: el, la
gender: art
word_en: the (+m, f)

The first test string is:

1

el, la art the (+m, f)
• el diccionario tenía también frases útiles – the dictionary also had
useful phrases
2055835 | 201481381

The other issue is that I’ve been trying to simply copy info. from the ‘Substitution’ section into LibreOffice. All I want to do is create 6 columns for the data. The Problem is that the 6th column (sent_en) can sometimes divide between columns ‘G’ and ‘A’, instead of all the data for sent_en being in column ‘G’. If you copy the data below ‘Substitution’ into LibreOffice Calc, you’ll get a better idea of what I mean. I just can’t figure this out, and if someone can help me out I’d really appreciate it. Thanks.

Here’s the link https://regex101.com/r/m3yySN/2/

^

(?<frequency>[0-9]+) \W+
(?<word>\pL+\W?) \h+
(?<gender> [\pL()]+ (?:, \h* [\pL()]+)* ) \h+
(?<word_en> [^•]*[^•\s]) \h* \R

• \h*
(?<sent_esp> [^–]*[^\s–] ) \s*–\s*
(?<sent_en> .* (?:\R .*)*? ) \h* \R

(?<num1> [0-9]+) \h* \| \h*
(?<num2> .*\S)

\1\t\2\t\3\t\4\t\5\t\6\t
  • 写回答

1条回答 默认 最新

  • dongpi3237 2017-12-23 22:29
    关注

    This one was a bit hairy, but after all, just a small adjustment was needed:

    ^
    (?<frequency>[0-9]+) \W+
    (?<word>\pL+(?:,\h\pL+|\W)*) \h+
    (?<gender> [\pL()]+ (?:, \h* [\pL()]+)* ) \h+
    (?<word_en> [^•]*[^•\s]) \h* \R
    • \h*
    (?<sent_esp> [^–]*[^\s–] ) \s*–\s*
    (?<sent_en> .* (?:\R .*)*? ) \h* \R
    (?<num1> [0-9]+) \h* \| \h*
    (?<num2> .*\S)
    

    Results look good to me now.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 metadata提取的PDF元数据,如何转换为一个Excel
  • ¥15 关于arduino编程toCharArray()函数的使用
  • ¥100 vc++混合CEF采用CLR方式编译报错
  • ¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误,如何解决?
  • ¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
  • ¥15 c#逐行读取txt文本,但是每一行里面数据之间空格数量不同
  • ¥50 如何openEuler 22.03上安装配置drbd
  • ¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
  • ¥15 无线连接树莓派,无法执行update,如何解决?(相关搜索:软件下载)
  • ¥15 Windows11, backspace, enter, space键失灵