I need to split on characters that are neither \p{L}
nor the -
. I am a bit confused. \P{L}|[^-]
will obviously not work as everything will match [^-]
. I do not know how to put a Unicode class inside []
. Lookahead / lookbehind will latch on the previous / following character.
In other words, I need to split foo-bar;dásh
into ['foo-bar', 'dásh']
.