duanpin2034 2019-06-21 15:28
浏览 68
已采纳

复杂的正则表达式,用于获取字符串中花括号的数字

I need to get the number in curly braces in a complex string. Basically I am parsing some data, and I need to extract the ids that are represented in a curly braces.

Example string:

{1|{1078} {*|{1079}-}test{1|{4829}, test2 {4457}}} {*|{1078} {*|{1079}-}test3{1|{4829}, test4 {23232}}}

What I exactly need is to extract the number in curly braces that is near the pipe (|{4829}, |{1079}, |{1078}), and not any others numbers, so my end result would be something like:

4829,1079,1078

or array of this numbers, it does not matter. It should be unique values but that is not problem for me. My problem is create regex that will just extract those numbers. I have tried a lot of stuff during this day, latest one what I have tried is this:

public static function getAllAttributeIDsFromTheRule($attributeValues)
{

    preg_match_all('/{(.*?)}/', $attributeValues, $matches);
    preg_match_all('\|{\d*}', $attributeValues, $matches[1]);

    $attributeIDsWithPipe = (implode('', self::clean($matches[1])));

    $attributeIDs = explode('|', $attributeIDsWithPipe);

    var_dump($attributeIDs);

}

public static function clean($string) 
{
    $string = str_replace(' ', '-', $string);

    return preg_replace('/[^A-Za-z0-9|\-]/', '', $string);
}   

But I am always stuck with one other character in the result.In some result I get number extra or something like that. Now, it is time to ask for help if someone knows the better approach. Much appreciated.

  • 写回答

2条回答 默认 最新

  • duanping6698 2019-06-21 15:30
    关注

    You may use a regex to match |{, some 1+ digits, and } and capture the digits inside into a capturing group, and then just access the values from the group using $matches[1]:

    if (preg_match_all('~\|\{(\d+)}~', $s, $matches)) {
        print_r($matches[1]);
    }
    

    See the regex demo and the regex graph:

    enter image description here

    PHP demo:

    $s = '{1|{1078} {*|{1079}-}test{1|{4829}, test2 {4457}}} {*|{1078} {*|{1079}-}test3{1|{4829}, test4 {23232}}}';
    if (preg_match_all('~\|\{(\d+)}~', $s, $matches)) {
        print_r(array_unique($matches[1]));
    }
    // => Array ( [0] => 1078 [1] => 1079  [2] => 4829 )
    

    NOTE: array_unique will keep unique values only in the results.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 metadata提取的PDF元数据,如何转换为一个Excel
  • ¥15 关于arduino编程toCharArray()函数的使用
  • ¥100 vc++混合CEF采用CLR方式编译报错
  • ¥15 coze 的插件输入飞书多维表格 app_token 后一直显示错误,如何解决?
  • ¥15 vite+vue3+plyr播放本地public文件夹下视频无法加载
  • ¥15 c#逐行读取txt文本,但是每一行里面数据之间空格数量不同
  • ¥50 如何openEuler 22.03上安装配置drbd
  • ¥20 ING91680C BLE5.3 芯片怎么实现串口收发数据
  • ¥15 无线连接树莓派,无法执行update,如何解决?(相关搜索:软件下载)
  • ¥15 Windows11, backspace, enter, space键失灵