PHP正则表达式匹配带有全部大写字母的行与偶尔的连字符

I'm trying to to convert an existing PHP regular expression to apply to a slightly different style of document.

Here's the original style of the document:

**FOODS - TYPE A** 
___________________________________ 
**PRODUCT** 
1) Mi Pueblito Queso Fresco Authentic Mexican Style Fresh Cheese; 
2) La Fe String Cheese 
**CODE** 
Sell by date going back to February 1, 2009

And the successfully-running PHP Regex match code that only returns "true" if the line is surrounded by asterisks, and stores each side of the "-" as $m[1] and $m[2], respectively.

 if ( preg_match('#^\*\*([^-]+)(?:-(.*))?\*\*$#', $line, $m) ) { 
    // only for **header - subheader** $m[2] is set. 
    if ( isset($m[2]) ) { 
      return array(TYPE_HEADER, array(trim($m[1]), trim($m[2]))); 
    } 
    else { 
      return array(TYPE_KEY, array($m[1])); 
    } 
  }

So, for line 1: $m[1] = "FOODS" AND $m[2] = "TYPE A"; Line 2 would be skipped; Line 3: $m[1] = "PRODUCT", etc.

The question: How would I re-write the above regex match if the headers did not have the asterisks, but still was all-caps, and was at least 4 characters long? For example:

FOODS - TYPE A 
___________________________________ 
PRODUCT
1) Mi Pueblito Queso Fresco Authentic Mexican Style Fresh Cheese; 
2) La Fe String Cheese 
CODE
Sell by date going back to February 1, 2009

Thank you.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

4条回答默认最新

dqydp44800 2010-04-20 13:14

关注

Along the lines of (don't forget the "u" flag for Unicode regexes):

^(?:\*\*)?(?=[^*]{4,})(\p{Lu}+)(?:\s*-\s*(\p{Lu}+))?(?:\*\*)?\s*$

^               # start of line
(?:\*\*)?       # two stars, optional
(?=[^*]{4,})    # followed by at least 4 non-star characters
(\p{Lu}+)       # group 1, Unicode upper case letters
(?:             # start no capture group
  \s*-\s*       #   space*, dash, space*
  (\p{Lu}+)     #   group 2, Inicode upper case letters
)?              # end no capture group, make optional
(?:\*\*)?       # two stars, optional
\s*             # optional trailing spaces
$               # end of line

EDIT: Simplified, as per the comments:

^(?=[A-Z ]{4,})([A-Z ]+)(?:-([A-Z ]+))?\s*$

^               # start of line
(?=[A-Z -]{4,}) # followed by at least 4 upper case characters, spaces or dashes
([A-Z ]+)       # group 1, upper case letters or space
(?:             # start no capture group
  -             #   a dash
  ([A-Z ]+)     #   group 2, upper case letters or space
)?              # end no capture group, make optional
\s*             # optional trailing spaces
$               # end of line

Contents of groups 1 and 2 must be trimmed before use.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

查看更多回答(3条)

报告相同问题？

关注问题

PHP正则表达式匹配带有全部大写字母的行与偶尔的连字符 php
2010-04-20 13:05

回答 4 已采纳 Along the lines of (don't forget the "u" flag for Unicode regexes): ^(?:\*\*)?(?=[^*]{4,})(\p{Lu
正则表达式匹配不包含某个字符串的字符串 python 正则表达式
2021-03-07 09:46

回答 2 已采纳。。。 import re l = [] res = re.findall('ABC.*?BCD', r'ABC/dABC/213BCD/sfoajs/ABC/dddd/BCD') fo
利用正则表达式判断用户输入的字符串是否只有小写字母或数字或者大写字符。 python
2022-06-13 13:30

回答 3 已采纳测试: 代码: # encoding:utf-8 import re lowerRegex = re.compile('[a-z]') upperRegex = re.compile('[A-Z
php正则表达式重复出现的相同字母,PHP正则表达式匹配带有全部大写字母的行与偶尔的连字符...
2021-03-23 22:30

王锦添的博客沿着(不要忘记Unicode正则表达式的“u”标志)：^(?:\*\*)?(?=[^*]{4,})(\p{Lu}+)(?:\s*-\s*(\p{Lu}+))?(?:\*\*)?\s*$^ # start of line(?:\*\*)? # two stars, optional(?=[^*]{4,}) # followed by at least 4...
正则表达式匹配正负整数和正负小数或者空有问必答正则表达式
2021-08-25 15:28

回答 6 已采纳已私聊解决
正则表达式匹配字符串中的PHP序列化数据 php
2016-07-18 16:06

回答 1 已采纳 I found out about ini_set('session.serialize_handler', 'php_serialize'); It changes the serializat
php正则表达式匹配出想要的结果 php 正则表达式
2021-08-10 11:54

回答 3 已采纳 $content="第一集#https://iqiyi.cdn9-okzy.com/20201104/43645634255.html 第二集#https://iqiyi.cdn9-okzy.com
php正则字母,PHP匹配连续的数字或字母的正则表达式
2021-03-23 17:53

weixin_39811386的博客正则表达式的写法规则："/规则需要写在2个斜杠中间/"。(. ：小数点)用于匹配除换行符之外的所有字符。(\s：反斜杠小写s)用于匹配单个空格符，包括tab键和换行符；(\S：反斜杠大写S)用于匹配除单个空格符之外的所有...
python爬虫，当正则表达式无法匹配，怎么输出空字符 python 有问必答正则表达式爬虫
2021-09-01 16:19

回答 3 已采纳使用try except环绕即可
求一个php正则表达式 php 正则表达式
2022-01-23 19:47

回答 1 已采纳试试这个import repattern = re.compile (r'(?:money=)\d+.?\d*')pattern.findall(string)
使用java正则表达式匹配日期 java 正则表达式
2020-01-31 15:18

回答 1 已采纳 ``` ^\d{4}-0*((1|3|5|7|8|10|12)-0*([1-9]|[1-2]\d|3[0-1])|(4|6|9|11)-0*([1-9]|[1-2]\d|30)|2-0*([1-
php首字母大写正则,关于字符串：正则表达式首字母大写超过3个字符的单词，并在连字符和撇号之后...
2021-04-22 11:29

雄哥侃运营的博客基本上…我正在尝试对一个字符串执行自定义大写；我花了几个小时与regex斗争，但没有成功…要求：I need to capitalise:If first word >3 chars: First letter of the first word.If last word >3 chars: ...
C#正则表达式查找非纯数字的字符 c# 正则表达式
2022-04-27 01:53

回答 6 已采纳 (([a-zA-Z_])([a-zA-Z0-9_])+)|(([0-9])([a-zA-Z_])+)
php 正则提取连续字母,PHP匹配连续的数字或字母的正则表达式
2021-04-17 04:03

男护士的审美的博客正则表达式的写法规则："/规则需要写在2个斜杠中间/"。(. ：小数点)用于匹配除换行符之外的所有字符。(\s：反斜杠小写s)用于匹配单个空格符，包括tab键和换行符；(\S：反斜杠大写S)用于匹配除单个空格符之外的所有...
php与正则匹配连续数字,PHP匹配连续的数字或字母的正则表达式
2021-05-07 07:39

取名废zz的博客正则表达式的写法规则："/规则需要写在2个斜杠中间/"。(. ：小数点)用于匹配除换行符之外的所有字符。(\s：反斜杠小写s)用于匹配单个空格符，包括tab键和换行符；(\S：反斜杠大写S)用于匹配除单个空格符之外的所有...
没有解决我的问题, 去提问

悬赏问题

¥88 找成都本地经验丰富懂小程序开发的技术大咖
¥15 如何处理复杂数据表格的除法运算
¥15 如何用stc8h1k08的片子做485数据透传的功能？(关键词-串口)
¥15 有兄弟姐妹会用word插图功能制作类似citespace的图片吗？
¥200 uniapp长期运行卡死问题解决
¥15 请教：如何用postman调用本地虚拟机区块链接上的合约？
¥15 为什么使用javacv转封装rtsp为rtmp时出现如下问题：[h264 @ 000000004faf7500]no frame？
¥15 乘性高斯噪声在深度学习网络中的应用
¥15 关于docker部署flink集成hadoop的yarn，请教个问题 flink启动yarn-session.sh连不上hadoop，这个整了好几天一直不行，求帮忙看一下怎么解决
¥15 深度学习根据CNN网络模型，搭建BP模型并训练MNIST数据集

码龄粉丝数原力等级 --

PHP正则表达式匹配带有全部大写字母的行与偶尔的连字符

4条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

PHP正则表达式匹配带有全部大写字母的行与偶尔的连字符

4条回答 默认 最新

悬赏问题

4条回答默认最新