正则表达式价格<p>标签块与PHP的重复[重复]

This question already has an answer here:

How do you parse and process HTML/XML in PHP? 30 answers

I am trying to scrape the prices block out of a webpage and I want to match the contents between the opening and closing paragraph tags which have the prices in. However the problem is in the html output source this is spit onto multiple lines with multiple white spaces. Here is a sample of the output http://pastebin.com/hfeuHqTN

I am trying to use:

$pricesClass = '/<p class="price-wrap">
(.*)/';

preg_match_all($pricesClass, $page, $pricesMatches);

How can I match the whole of the paragraph with the class of price-wrap until the closing paragraph tag?

At the moment it just matches the first two lines up to:

<p class="price-wrap"><strong class="product-price" itemprop="price">

I would like to match the whole thing e.g.

 <p class="price-wrap"><strong class="product-price" itemprop="price"> £120</strong> was&nbsp;<del>£186.00</del></p>

</div>

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dpwh11290 2016-04-30 12:40
关注
Use a proper HTML parser like DOMDocument and preg_replace (\s+) only to remove the “whitespace characters” (any Unicode separator, tab, line feed, carriage return, vertical tab, form feed)

$dom = new DOMDocument(); $dom->loadHTML(file_get_contents("http://thesite.com"); $xpath = new DOMXpath($dom); foreach ($xpath->query("//p[@class='price-wrap']") as $pText){ echo preg_replace("/\s+/", "", $pText->textContent); }

Ideone Demo
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

正则表达式价格<p>标签块与PHP的重复[重复] php
2016-04-30 12:29

回答 1 已采纳 Use a proper HTML parser like DOMDocument and preg_replace (\s+) only to remove the “whitespace
正则表达式 匹配 <script></script> 正则表达式
2017-07-14 02:38

回答 4 已采纳 ``` ```
PHP正则表达式替换<>及中间内容 php 正则表达式
2015-08-25 12:03

回答 2 已采纳自己找到了，好象是因为有换行.匹配不了\n，先用\s\s+把空白格去掉，再用上面的语句就可以了
Python 正则表达式详解
2021-12-26 20:16

yggcwhat的博客 search findall re.s sub split 贪婪与非贪婪案例匹配手机号提取网页源码中所有的文字提取图片地址 正则表达式是对字符串提取的一套规则，我们把这个规则用正则里面的特定语法表达出来，去匹配满足这个规则的...
js 求一正则表达式是去掉<p> 这个标签的正则表达式
2015-07-01 09:58

回答 3 已采纳 ``` var s='afef'; s=s.replace(/]*>/gi,'') alert(s) ```
使用DOM或正则表达式删除<p>＆nbsp; </ p> html php
2011-07-23 17:15

回答 3 已采纳 If you want to remove a string that is exactly, always, '<p> </p>', the simplest
正则表达式条<a>包含特定href值的标签 php
2012-06-07 13:01

回答 2 已采纳 Here's a more reliable, DOM-based approach: <?php $a = 'Lorem ipsum <a href="http://mysite
php正则表达式工具,正则表达式语法教程（含在线测试工具）
2021-03-26 10:34

lewis青的博客什么是正则表达式?正则表达式是一组由字母和符号组成的特殊文本, 它可以用来从文本中找出满足你想要的格式的句子.一个正则表达式是在一个主体字符串中从左到右匹配字符串时的一种样式."Regular expression"这个词...
PHP正则表达式返回<option>值 php
2012-03-07 01:38

回答 4 已采纳 preg_match will return an array containing only the first match. The first index of the array wil
正则表达式：如果内部没有数据，则将<div>标签的内容替换为<br>标签 html php
2016-04-29 16:11

回答 2 已采纳 I would highly suggest using DOM Manipulation to accomplish this. You can use regular expressions
Python正则表达式怎么匹配<>里面的内容 python
2022-06-27 11:15

回答 3 已采纳 import re a = "a=submit from host <sn01.xz>,CWD</home/export>" result = re.findall(r".*?
php 两个单词 正则表达式字符前_PHP正则表达式详解（二）
2020-12-30 20:18

李晓舟的博客前言：在本文中讲述了正则表达式中的组与向后引用，先前向后查看，条件测试，单词边界，选择符等表达式及例子，并分析了正则引擎在执行匹配时的内部机理。本文是Jan Goyvaerts为RegexBuddy写的教程的译文，版权归原...
PHP正则表达式 - 删除所有<br>标签 php
2017-01-04 10:53

回答 1 已采纳 Try something like this^ $text = preg_replace_callback( '~\[tex\].*?\[/tex\]~s', function
php正则表达式判断形如,PHP正则表达式教程(转载)
2021-04-29 08:31

weixin_39637457的博客 1、入门简介简单的说，正则表达式是一种可以用于模式匹配和替换的强有力的工具，主要用于字符串的模式分割、匹配、查找及替换操作。我们可以在几乎所有的基于UNIX系统的工具中找到正则表达式的身影，例如，vi编辑器...
php正则表达式除什么之外,正则表达式：匹配除特定模式以外的所有内容
2021-04-07 08:22

王司图的博客我需要一个能够匹配除以特定模式(特别是index.php及其后的内容，例如...正则表达式可能重复，以匹配不包含单词的行？正则表达式：匹配所有内容，但：以特定模式开头的字符串(例如，any-也为空-不是以foo开头的字符串...
没有解决我的问题, 去提问

悬赏问题

¥15 unity第一人称射击小游戏，有demo，在原脚本的基础上进行修改以达到要求
¥15 买了个传感器，根据商家发的代码和步骤使用但是代码报错了不会改，有没有人可以看看
¥15 关于#Java#的问题，如何解决？
¥15 加热介质是液体，换热器壳侧导热系数和总的导热系数怎么算
¥100 嵌入式系统基于PIC16F882和热敏电阻的数字温度计
¥15 cmd cl 0x000007b
¥20 BAPI_PR_CHANGE how to add account assignment information for service line
¥500 火焰左右视图、视差（基于双目相机）
¥100 set_link_state
¥15 虚幻5 UE美术毛发渲染

正则表达式价格<p>标签块与PHP的重复[重复]

1条回答 默认 最新

悬赏问题

1条回答默认最新