dongxinm279890 2012-12-28 15:51
浏览 37
已采纳

使用php将文本行解析为不同的变量

I am very new to php so I apologize for the seemingly simple question. I need to parse a line of text into different variables. More specifically, I need to parse many lines of text in different arrays. The line of text would resemble the following

timeStamp UserName* garbage text Number x item*
timeStamp UserName* garbage text Number x item*
timeStamp UserName* garbage text Number x item*

both userName and item could contain spaces. I would assume the best way to go about this would be 4 different arrays?

actual data would look like the following

03:12:34 mhopkins321 has acquired 5 x bottles of water
09:38:01 Nick Smith has acquired 100 x pennies
23:22:59 Fancy Frank has acquired 15684 x artichoke hearts

So I would assume the arrays would be

$timeStamp         $userName        $amount     $items
03:12:34           mhopkins321      5           bottles of water
09:38:01           Nick Smith       100         pennies
23:22:59           Fancy Frank      15684       artichoke hearts
  • 写回答

2条回答 默认 最新

  • duanqiu2064 2012-12-28 16:24
    关注

    This is a very bad format for machine parsing. Especially problematic is that names may have spaces but are not delimited.

    The only foolproof way to parse this is to know all the "garbage text" strings that may appear between the name and the amount. Unless you have a complete list, you may mess up your user names.

    It's possible to parse this using explode() to split a line into an array and then extracting parts. However, I think you should just use a regular expression.

    $sample = "
    03:12:34 mhopkins321 has acquired 5 x bottles of water
    09:38:01 Nick Smith has acquired 100 x pennies
    23:22:59 Fancy Frank has acquired 15684 x artichoke hearts
    ";
    
    $re = '/^(?<timeStamp>[0-9]{2}:[0-9]{2}:[0-9]{2}) # timestamp 
             \s+
             (?<userName>[\w\s]+)        # user name
             \s+(?:has\s+acquired)\s+    # garbage text between name and amount
             (?<amount>\d+)              # amount
             \s+x\s+                     # multiplication symbol
             (?<items>.*)\s*$            # item name (to end of line)
           /xmu';
    
    preg_match_all($re, $sample, $matches, PREG_SET_ORDER);
    
    var_export($matches);
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 cgictest.cgi文件无法访问
  • ¥20 删除和修改功能无法调用
  • ¥15 kafka topic 所有分副本数修改
  • ¥15 小程序中fit格式等运动数据文件怎样实现可视化?(包含心率信息))
  • ¥15 如何利用mmdetection3d中的get_flops.py文件计算fcos3d方法的flops?
  • ¥40 串口调试助手打开串口后,keil5的代码就停止了
  • ¥15 电脑最近经常蓝屏,求大家看看哪的问题
  • ¥60 高价有偿求java辅导。工程量较大,价格你定,联系确定辅导后将采纳你的答案。希望能给出完整详细代码,并能解释回答我关于代码的疑问疑问,代码要求如下,联系我会发文档
  • ¥50 C++五子棋AI程序编写
  • ¥30 求安卓设备利用一个typeC接口,同时实现向pc一边投屏一边上传数据的解决方案。