duangelin7513 2016-08-13 01:01
浏览 795
已采纳

golang如何使用正则表达式将字符串拆分为包含分界符的切片

I want to split a string to a slice but want to keep the delimiters. Is there a way to do that in golang?

For example

Input:

"Hello! It's, a? beautiful$ day* (today and tomorrow).

Output:

[Hello | ! | It's | , | a | ? | beautiful | $ | day | * | ( | today | and | tomorrow | ) | . ]

where | represents the separation of elements.

Can anyone help?

  • 写回答

1条回答 默认 最新

  • duanlachu7344 2016-08-13 03:08
    关注

    You can do that by creating a regular expression that matches either a word or one of your special characters. I don't know exactly what your rules are, but given the input and desired output, this works:

    [A-Za-z']+|[*?()$.,!]
    

    You can then use FindAllString to return the individual strings.

    See https://play.golang.org/p/se6B74G_Fv.

    The idea of the regular expression is to match either a word, here defined as a sequence of one or more upper or lower case letters or apostrophes, or a "special character", defined as any of the non-word characters in your sample input. The regular expression contains two parts separated by a |, meaning that either alternative can match. A set of characters contained within square brackets is called a character class, which means that any of the characters can match (and a-z means all characters from a to z). Since the left alternative is a character class followed by a +, one or more of the characters in the class will match. The portion on the right only matches one character. You may need to adjust this to behave exactly the way you want. For more information on Go regular expressions, see https://github.com/google/re2/wiki/Syntax.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥20 蓝牙耳机怎么查看日志
  • ¥15 Fluent齿轮搅油
  • ¥15 八爪鱼爬数据为什么自己停了
  • ¥15 交替优化波束形成和ris反射角使保密速率最大化
  • ¥15 树莓派与pix飞控通信
  • ¥15 自动转发微信群信息到另外一个微信群
  • ¥15 outlook无法配置成功
  • ¥30 这是哪个作者做的宝宝起名网站
  • ¥60 版本过低apk如何修改可以兼容新的安卓系统
  • ¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏