dsfsad089111 2018-04-05 16:13
浏览 39
已采纳

正则表达式将测试Go中的拉丁字母

I'm trying to write a regex in Go to test for Latin letters only.

I know that \p{Latin} matches with any Latin script characters, but it also matches things such as Roman Numerals (e.g. "ⅻ"). That leads me to \p{L} which matches Unicode letters, but it matches any script, not just Latin.

Best I've been able to come with so far is two regexes with an &&:

latinRe := regexp.MustCompile(`\p{Latin}`)
letterRe := regexp.MustCompile(`\p{L}`)
if latinRe.Matches(testString) && letterRe.Matches(testString) {...}

I'm not happy that I can't test this as easily using something like regex101.com. Is there a better way? More succinct? Performant?

  • 写回答

1条回答 默认 最新

  • duanbei3704 2018-04-05 16:20
    关注

    You can use a range like the following to specify all the characters you want to match. Depending on the regex engine, one of the following should work:

    See regex in use here: Adapted from this link

    [A-Za-z\u00C0-\u00D6\u00D8-\u00f6\u00f8-\u00ff]
    [A-Za-z\xC0-\xD6\xD8-\xf6\xf8-\xff]
    

    Another option is to negate specific characters from a Unicode character class:

    See regex in use here

    [^\P{Latin}\p{N}]
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 微信会员卡接入微信支付商户号收款
  • ¥15 如何获取烟草零售终端数据
  • ¥15 数学建模招标中位数问题
  • ¥15 phython路径名过长报错 不知道什么问题
  • ¥15 深度学习中模型转换该怎么实现
  • ¥15 HLs设计手写数字识别程序编译通不过
  • ¥15 Stata外部命令安装问题求帮助!
  • ¥15 从键盘随机输入A-H中的一串字符串,用七段数码管方法进行绘制。提交代码及运行截图。
  • ¥15 TYPCE母转母,插入认方向
  • ¥15 如何用python向钉钉机器人发送可以放大的图片?