dongxinm279890 2012-12-28 15:51
浏览 37
已采纳

使用php将文本行解析为不同的变量

I am very new to php so I apologize for the seemingly simple question. I need to parse a line of text into different variables. More specifically, I need to parse many lines of text in different arrays. The line of text would resemble the following

timeStamp UserName* garbage text Number x item*
timeStamp UserName* garbage text Number x item*
timeStamp UserName* garbage text Number x item*

both userName and item could contain spaces. I would assume the best way to go about this would be 4 different arrays?

actual data would look like the following

03:12:34 mhopkins321 has acquired 5 x bottles of water
09:38:01 Nick Smith has acquired 100 x pennies
23:22:59 Fancy Frank has acquired 15684 x artichoke hearts

So I would assume the arrays would be

$timeStamp         $userName        $amount     $items
03:12:34           mhopkins321      5           bottles of water
09:38:01           Nick Smith       100         pennies
23:22:59           Fancy Frank      15684       artichoke hearts
  • 写回答

2条回答 默认 最新

  • duanqiu2064 2012-12-28 16:24
    关注

    This is a very bad format for machine parsing. Especially problematic is that names may have spaces but are not delimited.

    The only foolproof way to parse this is to know all the "garbage text" strings that may appear between the name and the amount. Unless you have a complete list, you may mess up your user names.

    It's possible to parse this using explode() to split a line into an array and then extracting parts. However, I think you should just use a regular expression.

    $sample = "
    03:12:34 mhopkins321 has acquired 5 x bottles of water
    09:38:01 Nick Smith has acquired 100 x pennies
    23:22:59 Fancy Frank has acquired 15684 x artichoke hearts
    ";
    
    $re = '/^(?<timeStamp>[0-9]{2}:[0-9]{2}:[0-9]{2}) # timestamp 
             \s+
             (?<userName>[\w\s]+)        # user name
             \s+(?:has\s+acquired)\s+    # garbage text between name and amount
             (?<amount>\d+)              # amount
             \s+x\s+                     # multiplication symbol
             (?<items>.*)\s*$            # item name (to end of line)
           /xmu';
    
    preg_match_all($re, $sample, $matches, PREG_SET_ORDER);
    
    var_export($matches);
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥35 MIMO天线稀疏阵列排布问题
  • ¥60 用visual studio编写程序,利用间接平差求解水准网
  • ¥15 Llama如何调用shell或者Python
  • ¥20 谁能帮我挨个解读这个php语言编的代码什么意思?
  • ¥15 win10权限管理,限制普通用户使用删除功能
  • ¥15 minnio内存占用过大,内存没被回收(Windows环境)
  • ¥65 抖音咸鱼付款链接转码支付宝
  • ¥15 ubuntu22.04上安装ursim-3.15.8.106339遇到的问题
  • ¥15 blast算法(相关搜索:数据库)
  • ¥15 请问有人会紧聚焦相关的matlab知识嘛?