如何使用正则表达式添加空格和标点符号来捕获第一组？如何在LibreOffice中停止分成两列的某些标签？

Anyone help me out. Been trying to get this regex working, and it’s nearly there. They all seem to be correct, but the first one should be:

word: el, la
gender: art
word_en: the (+m, f)

The first test string is:

1

el, la art the (+m, f)
• el diccionario tenía también frases útiles – the dictionary also had
useful phrases
2055835 | 201481381

The other issue is that I’ve been trying to simply copy info. from the ‘Substitution’ section into LibreOffice. All I want to do is create 6 columns for the data. The Problem is that the 6th column (sent_en) can sometimes divide between columns ‘G’ and ‘A’, instead of all the data for sent_en being in column ‘G’. If you copy the data below ‘Substitution’ into LibreOffice Calc, you’ll get a better idea of what I mean. I just can’t figure this out, and if someone can help me out I’d really appreciate it. Thanks.

Here’s the link https://regex101.com/r/m3yySN/2/

^

(?<frequency>[0-9]+) \W+
(?<word>\pL+\W?) \h+
(?<gender> [\pL()]+ (?:, \h* [\pL()]+)* ) \h+
(?<word_en> [^•]*[^•\s]) \h* \R

• \h*
(?<sent_esp> [^–]*[^\s–] ) \s*–\s*
(?<sent_en> .* (?:\R .*)*? ) \h* \R

(?<num1> [0-9]+) \h* \| \h*
(?<num2> .*\S)

\1\t\2\t\3\t\4\t\5\t\6\t

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dongpi3237 2017-12-23 22:29
关注
This one was a bit hairy, but after all, just a small adjustment was needed:

^ (?<frequency>[0-9]+) \W+ (?<word>\pL+(?:,\h\pL+|\W)*) \h+ (?<gender> [\pL()]+ (?:, \h* [\pL()]+)* ) \h+ (?<word_en> [^•]*[^•\s]) \h* \R • \h* (?<sent_esp> [^–]*[^\s–] ) \s*–\s* (?<sent_en> .* (?:\R .*)*? ) \h* \R (?<num1> [0-9]+) \h* \| \h* (?<num2> .*\S)

Results look good to me now.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

如何使用正则表达式添加空格和标点符号来捕获第一组？如何在LibreOffice中停止分成两列的某些标签？ php
2017-12-23 20:53

回答 1 已采纳 This one was a bit hairy, but after all, just a small adjustment was needed: ^ (?<frequency&gt
PHP+LibreOffice+Centos实现Word转PDF页面样式设置 centos php
2022-04-21 10:06

回答 1 已采纳 LibreOffice 没有配置样式代码的？
LibreOffice将PDF转换为Word作为文本框而不是普通文档 php
2018-12-13 13:46

回答 1 已采纳 Your problem lies with the software used to create the PDF; output in the form of textboxes in a P
gDriveOOo:您的Google云端硬盘数据终于可以在LibreOffice OpenOffice中使用
2021-05-27 22:23

使用HsqlDB要求在LibreOffice / OpenOffice中安装和配置JRE最低版本为1.8 （即Java版本8）。我建议将作为Java安装源。如果您在Linux上使用LibreOffice ，则可能会。要变通解决此问题，请卸载软件包： libreof
PHP：使用utf8_encode时在csv中错误编码的字符 mysql php
2016-06-23 14:31

回答 2 已采纳 The var_dump shows that the string is already encoded in UTF-8. Using utf8_encode on it will garbl
software安装软件libreoffice出现cannot perform the following tasks怎么解决？ linux ubuntu
2019-06-28 20:44

回答 1 已采纳然而下面的错误信息看不到但是一般来说，是网络的问题，ubuntu英文版，默认的软件源在国外，检查是不是网络问题要么就是权限问题
在WIN7 64位环境下，LibreOffice_6.1.5_Win_x64.msi 安装后，提示安装不成功，自动回滚是怎么回事？ java java-ee
2019-04-12 13:06

回答 1 已采纳资源管理器选择 LibreOffice_6.1.5_Win_x64.msi ，右键。属性，看下数字签名，是否有效，如果无效，重新下载。如果正确，检查系统是否有杀毒软件，如果有，关闭。检查你的win
libreoffice在liunx和uos国产化上面的部署.docx
2021-10-22 14:41

LibreOffice是一款开源的办公软件套件，支持多种操作系统，包括Linux和国产化的UOS（统一操作系统）。在Linux系统上部署LibreOffice涉及以下几个关键步骤： 1. **下载LibreOffice**：首先，你需要访问LibreOffice的...
arm编译libreoffice遇到的问题 java linux ubuntu
2023-02-02 09:23

回答 1 已采纳 “该回答引用ChatGPT”可参考下面的解决方案：看起来是缺失了中文翻译文件，导致编译失败。 1、从 LibreOffice 的官方站点下载相应语言的翻译文件并且把它们放到对应的目录。2、使用不需要
PHP str_getcsv不会分隔索引1和2中的元素 php
2016-08-22 14:57

回答 1 已采纳 So here's your problem: $db = array_map('str_getcsv', file($dbLocation), $paramStrGetCsv); Firs
PHP和XML之间有什么关系？ php xml
2012-08-28 00:03

回答 3 已采纳 XML is not a grammar (that's another thing entirely). XML (as the name suggests) is a markup langu
linux openEuler aarch64架构libreoffice安装包，支持中文字体
2024-06-25 21:37

此版本libreoffice在openEuler aarch64架构的服务器上成功安装，完美兼容。配合对应的中文字体，能够解决转换过程中的中文乱码问题。文档转换命令示例：libreoffice word转pdf(可以替换为html等其他格式) cd /opt/...
Golang Docker多阶段构建无法运行：exec：“执行”：在$ PATH中找不到可执行文件 docker
2019-01-29 23:31

回答 1 已采纳 Your actual build line: RUN CGO_ENABLED=0 GOOS=linux GOARCH=amd64 go build -installsuffix cgo -ld
serverless-libreoffice：在AWS Lambda中运行LibreOffice以创建PDF并转换文档
2021-02-03 13:57

无服务器LibreOffice 给我看代码此回购包含用于运行代码。 ├── compile.sh <-- commands used to compile LibreOffice for Lambda├── infra <-- terraform config to deploy example Lambda│ ├── ...
fc-libreoffice：84 MB LibreOffice可以放入使用Brotli压缩的Aliyun Function Compute中
2021-02-03 18:58

fc-libreoffice是一个开箱即用的word转pdf NPM包。在 docker提供的runtime-nodejs8环境下编译，并且进行了精简，采用压缩比最高的Brotli工具进行打包，最终压缩包大小为84M。这个大小仍然超过了FC 50M的代码包限制...
libreoffice7.1.8 安装教程和启动kkfielview
2022-01-21 09:09

LibreOffice 7.1.8 安装教程和启动 kkFileView LibreOffice 是一个自由及开放源代码的办公套件，提供了 word 处理、电子表格、演示文稿、数据库管理、绘图等功能。kkFileView 是一个文件预览组件，支持多种文件格式...
自由设计办公室：将使用LibreOffice设计的书籍翻译成繁体中文的工作
2021-02-04 08:45

标题中的“自由设计办公室”可能是指一个开源社区或者项目，专注于使用开源工具进行设计工作，而这个特定的任务是将使用LibreOffice设计的书籍翻译成繁体中文。LibreOffice是一款免费且开源的办公软件套件，包含了...
48个常用中文字体，可导入onlyoffice、libreoffice等
2024-05-24 13:12

48个常用中文字体，可导入onlyoffice、libreoffice等
windows下LibreOffice SDK配置
2024-01-19 14:42

windows下，通过官网下载LibreOffice7.4.2.2及SDK包，配置环境变量及生成头文件(C++),可直接使用，文件解压后有使用说明。
LibreOffice7.4.7.2 centos编译arm64版本
2024-03-14 15:36

LibreOffice是一款开源的办公...总的来说，编译LibreOffice 7.4.7.2的arm64版本是为了解决在基于鲲鹏910这类ARM架构设备上使用办公软件的需求，提供与x86架构同样功能丰富的体验，同时充分利用硬件的性能和能效优势。
没有解决我的问题, 去提问

悬赏问题

¥15 metadata提取的PDF元数据，如何转换为一个Excel
¥15 关于arduino编程toCharArray()函数的使用
¥100 vc++混合CEF采用CLR方式编译报错
¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误，如何解决？
¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
¥15 c#逐行读取txt文本，但是每一行里面数据之间空格数量不同
¥50 如何openEuler 22.03上安装配置drbd
¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
¥15 无线连接树莓派，无法执行update，如何解决？（相关搜索：软件下载）
¥15 Windows11, backspace, enter, space键失灵

如何使用正则表达式添加空格和标点符号来捕获第一组？ 如何在LibreOffice中停止分成两列的某些标签？

1条回答 默认 最新

悬赏问题

如何使用正则表达式添加空格和标点符号来捕获第一组？如何在LibreOffice中停止分成两列的某些标签？

1条回答默认最新