doutong4088 2013-07-05 12:37
浏览 50
已采纳

优化正则表达式以捕获电子邮件签名

I have a PHP script that mirror my mailing list to web-based forum, in order to make the forum import look as nice as possible, I use regular expression to catch email signatures & style them appropriately. The signature formats that I'm catching with the regex are:

This is my message...
--
My signature
TheDude.

And

This is my message...
---------------
My signature
TheDude.

Right now I'm using this regex:

$message = preg_replace('/\s*(.+)(\s*[
]-{2,}\s+.*)/s', '$1<span class="msg_footer">$2</span>', $message);

It works, my but after some quick tests, I realized that this regex is really slow.

I'm not that good in regex, can someone please take a look at the regex & tell me how to optimize it & make it fast?

  • 写回答

2条回答 默认 最新

  • duanganleng0577 2013-07-05 12:47
    关注

    You are using regular expressions to handle the whole message, which is bound to be slow. A better alternative would be to use proper programming logic to process the message. For instance, go through the message line by line and test for each line whether it matches your "start of signature" regex. If not, add it to the array or string holding the actual message. If it does match, add the rest of the message to the footer.

    You might also want to start from the bottom instead of the top, if you think that your users will use lines matching your regex in the middle of the message.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
查看更多回答(1条)

报告相同问题?

悬赏问题

  • ¥15 在不同的执行界面调用同一个页面
  • ¥20 基于51单片机的数字频率计
  • ¥50 M3T长焦相机如何标定以及正射影像拼接问题
  • ¥15 keepalived的虚拟VIP地址 ping -s 发包测试,只能通过1472字节以下的数据包(相关搜索:静态路由)
  • ¥20 关于#stm32#的问题:STM32串口发送问题,偶校验(even),发送5A 41 FB 20.烧录程序后发现串口助手读到的是5A 41 7B A0
  • ¥15 C++map释放不掉
  • ¥15 Mabatis查询数据
  • ¥15 想知道lingo目标函数中求和公式上标是变量情况如何求解
  • ¥15 关于E22-400T22S的LORA模块的通信问题
  • ¥15 求用二阶有源低通滤波将3khz方波转为正弦波的电路