如何忽略span标签dom html

Hi i am trying to scrape Brand New Apple iPhone 8 64GB or 256GB - Sealed - GSM Unlocked in this code but it also scrape span with it, how do i ignore span text.

<h1 class="it-ttl" itemprop="name" id="itemTitle"><span class="g-hdn">Details about  &nbsp;</span>Brand New Apple iPhone 8 64GB or 256GB - Sealed - GSM Unlocked</h1>

This is the code :

$productname = $html->find("h1[class='it-ttl']",0)->plaintext;

echo $productname;

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

dpca4790 2019-05-01 20:22

关注

strip_tags_content is a function which is written in PHP Strip Tags and owner of the function is explains with these words. You can find more examples inside the link.

Output is: Brand New Apple iPhone 8 64GB or 256GB - Sealed - GSM Unlocked

"Hi. I made a function that removes the HTML tags along with their contents "

 function strip_tags_content($text, $tags = '', $invert = FALSE) {

        preg_match_all('/<(.+?)[\s]*\/?[\s]*>/si', trim($tags), $tags);
        $tags = array_unique($tags[1]);

        if(is_array($tags) AND count($tags) > 0) {
            if($invert == FALSE) {
                return preg_replace('@<(?!(?:'. implode('|', $tags) .')\b)(\w+)\b.*?>.*?</\1>@si', '', $text);
            }
            else {
                return preg_replace('@<('. implode('|', $tags) .')\b.*?>.*?</\1>@si', '', $text);
            }
        }
        elseif($invert == FALSE) {
            return preg_replace('@<(\w+)\b.*?>.*?</\1>@si', '', $text);
        }
        return $text;
    }


    $string = '<h1 class="it-ttl" itemprop="name" id="itemTitle"><span class="g-hdn">Details about  &nbsp;</span>Brand New Apple iPhone 8 64GB or 256GB - Sealed - GSM Unlocked</h1>';
    $string = strip_tags_content($string,'<span>',true);
    $string = strip_tags($string);

    echo $string;

For your problem after defining this function just call

$productname = $html->find("h1[class='it-ttl']",0)->plaintext; 
$productname = strip_tags_content($productname ,'<span>',true); 
$productname = strip_tags($string);

报告相同问题？

关注问题

将html标签转换为php数组 html javascript php
2016-05-30 13:19

回答 1 已采纳 $xml = new SimpleXMLElement($html); $result = $xml->xpath('//ul/li'); $up = array(); //array o
PHP - DomXPath空标签 php
2016-09-02 20:18

回答 1 已采纳 foreach($elements as $index => $element) { $dom = new DOMDocument(); $dom->appendChi
PHP提取html标签和内容[重复] html php
2015-04-05 22:15

回答 1 已采纳 This should work for PHP version 5.3.6+. Just pass the node to the DOMDocument::saveHTML function.
php 删除span标签,PHP使用DOMXPath剥离标签并删除节点
2021-04-16 14:48

hfcorriez的博客我正在尝试使用DOMDocument,但是遇到一些问题.我有一个像这样的字符串：Some Content to keepThis content should remain, but span around it should be strippedKeep this content tooThis whole node should be ...
无法解析成<code>标签 - PHP - 简单的html dom html php
2014-03-12 23:49

回答 2 已采纳 simplehtmldom among others strips out pre formatted tags. If you want code tag to be recognized de
如何从用户输入中删除不需要的HTML标记，但使用DOMDocument将文本保留在PHP中的标记内 html php
2016-09-13 10:41

回答 2 已采纳 It seems this problem needs to be broken down into two smaller steps in order to generalize the so
将“Image”标记替换为“a”标记PHP DOMDocument html php
2019-02-26 06:41

回答 3 已采纳 This is a case of when you alter the content of the document your iterating over a (your list of t
php html dom,HTML DOM操作的详细介绍
2021-04-15 15:06

章其琢的博客 HTML DOM文档节点 (document，唯一)元素节点 (那些个标签div，p之类)属性节点(class，src这种)文本节点(插入在p，div内的文本)document中的open()定义和用法open() 方法可打开一个新文档，并擦除当前文档的内容。...
Html Dom解析器得到第一个元素 php
2014-03-25 10:18

回答 2 已采纳 You can actually get at that one with: $html->find('h1 text', 0);
PHP - 标签之间的文本 html php
2014-09-12 15:19

回答 2 已采纳 Don't use a regexp. Use an HTML parser! Here's an example with PHP Simple HTML DOM Parser, but yo
将php变量内容转换为html字符串 html php
2013-03-04 18:09

回答 1 已采纳 $e is an array or an object. Casting it to an array will ensure you can implode on it even if its
php怎么获取html span标签的值_如何获取PHP中所有html元素的列表？
2021-03-22 23:43

我是一只萤火虫呀的博客但是你的代码在第一次迭代时用一个文本节点代替标签(包括所有子节点)。迭代的节点，并修改只有nodeType属性等于XML_TEXT_NODE节点：$nodes = $dom->getElementsByTagName('*');foreach ($nodes as $node)...
过滤子标签值 html php
2013-08-06 09:05

回答 1 已采纳 This is possibly not the cleanest solution, but it works: $dateDiv = $html->find('.date', 0);
php 正则匹配span标签,正则表达式 – 正则表达式获取span标签
2021-04-20 14:24

ZackRen的博客 span [^>] *>)>是你有一个小错字.你看,该表达式试图匹配两个结束>：仔细看看结束>)>.例如,它匹配< span hey there>>但不是&span; span hey there>要匹配开头范围,请确保只有一个...
php得到dom标签名,使用PHP简单HTML dom解析器搜索元素名称
2021-04-08 11:13

weixin_39517859的博客我正在成功使用PHP Simple HTML DOM解析器(http://simplehtmldom.sourceforge.net/manual.htm),但是现在我试图基于某个名称来查找元素.例如,在获取的HTML中,可能会有诸如以下的标记：Matt FacerMatt JonesDaveS ...
php 正则取span,正则表达式 – 正则表达式获取span标签
2021-04-20 12:43

ASC2050的博客 span [^>] *>)>是你有一个小错字.你看,该表达式试图匹配两个结束>：仔细看看结束>)>.例如,它匹配< span hey there>>但不是&span; span hey there>要匹配开头范围,请确保只有一个...
html span box shadow,Shadow DOM的简单实现
2021-06-14 06:23

叫我三叔就行的博客紧接Shadow DOM的简介，介绍关于Shadow DOM的使用方法。一、例子helloworld像这段HTML，在浏览器中被解析成DOM时，每一个元素就是一个节点，整体构成了一个节点树。而Shadow DOM可以让我们自己创建节点树，依旧是...
没有解决我的问题, 去提问

悬赏问题

¥35 lstm时间序列共享单车预测，loss值优化，参数优化算法
¥15 基于卷积神经网络的声纹识别
¥15 Python中的request，如何使用ssr节点，通过代理requests网页。本人在泰国，需要用大陆ip才能玩网页游戏，合法合规。
¥100 为什么这个恒流源电路不能恒流？
¥15 有偿求跨组件数据流路径图
¥15 写一个方法checkPerson，入参实体类Person，出参布尔值
¥15 我想咨询一下路面纹理三维点云数据处理的一些问题，上传的坐标文件里是怎么对无序点进行编号的，以及xy坐标在处理的时候是进行整体模型分片处理的吗
¥15 CSAPPattacklab
¥15 一直显示正在等待HID—ISP
¥15 Python turtle 画图

码龄粉丝数原力等级 --

如何忽略span标签dom html

1条回答默认最新

码龄粉丝数原力等级 --

悬赏问题

如何忽略span标签dom html

1条回答 默认 最新

悬赏问题

1条回答默认最新