如何用php DOMDocument输出纯文本？

I'm using this code (thank you Lawrence) to parse HTML table:

<?php
$html = file_get_contents('http://www.example.com');
$dom = new DOMDocument();
@$dom->loadHTML($html);

//TUE 1 1 4.37 6.39 1.08 5.35 9.18 6.00 1.30 6.30 7.42 9.40                 
echo '
<table>
    <tr>';
foreach($dom->getElementsByTagName('table') as $table) {
    echo innerHTML($table->getElementsByTagName('tr')->item(9));
}
echo '
    </tr>
</table>';

function innerHTML($current){
    $ret = "";
    $nodes = @$current->childNodes;
    if(!empty($nodes)){
        foreach($nodes as $v){
            $tmp = new DOMDocument();
            $tmp->appendChild($tmp->importNode($v, true));
            $ret .= $tmp->saveHTML();
        }
        return $ret;
    }
    return;
}
?>

The problem is that it outputs original HTML code, so how can I output plain text?

I have tried these changes, but it didn't work:

return $ret->textContent;
return $ret->nodeValue;
return $ret->plaintext;

echo innerHTML($table->getElementsByTagName('tr')->item(9)->textContent);
echo innerHTML($table->getElementsByTagName('tr')->item(9)->nodeValue);
echo innerHTML($table->getElementsByTagName('tr')->item(9)->plaintext);

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
duanqiang5722 2015-10-13 17:12
关注
The solution is actually very simple - strip_tags function.

echo strip_tags(innerHTML($table->getElementsByTagName('tr')->item(9)));

It takes the value and removes all of the HTML code, which results in plain text value.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

使用PHP DomDocument提取文本和图像src php
2014-08-22 15:18

回答 1 已采纳 You're using $t->nodeValue to obtain the content of a node. An <img> tag is empty, thus h
使用php DOMDocument从网页中提取文本 html php
2012-03-18 12:57

回答 1 已采纳 I've always used this http://simplehtmldom.sourceforge.net/ and every time with success.
如何从用户输入中删除不需要的HTML标记，但使用DOMDocument将文本保留在PHP中的标记内 html php
2016-09-13 10:41

回答 2 已采纳 It seems this problem needs to be broken down into two smaller steps in order to generalize the so
PHP基于DOMDocument解析和生成xml的方法分析
2020-10-19 12:40

首先，使用DOMDocument类生成XML文档的基本步骤包括创建DOMDocument实例、定义XML文档结构（根节点、子节点等）、设置编码类型、保存XML到字符串或文件。以下是一个简单的示例代码： ```php $doc = new DOMDocument...
如何在使用DOMDocument时将文本内容分隔为<BR> php
2017-01-12 05:00

回答 3 已采纳 In your example, $n contains 5 child nodes: "Name" "<br/>" " " "<span class='class2'&gt
使用PHP DOMDocument替换XML节点的非文本 php xml
2012-11-21 05:19

回答 1 已采纳 So, since you have created DOMDocument you can use DOMXpath. Or keep using getElementsByTagName()
使用php dom获取标题的文本 php
2014-01-01 09:55

回答 2 已采纳 Try this. echo $dom->getElementsByTagName('h1')->item(1)->nodeValue;
PHP读取XML文件的方法实例总结【DOMDocument及simplexml方法】
2020-10-16 10:39

在使用DOMDocument类之前，需要创建一个DOMDocument对象，通过实例化DOMDocument类即可完成。创建对象之后，可以调用load方法来载入XML文件，之后就可以通过一系列方法来访问XML文档的节点了。例如，我们要读取一...
RSS feed在PHP中返回纯文本而不是HTML？ php
2015-11-06 02:55

回答 1 已采纳 Using DOMDocument::saveHTML will preserve the html formatting of the node. This will give you what
PHP删除元素前后的文本 php
2017-12-11 10:40

回答 2 已采纳 As your trying to get 2 nodes, the way I've done it is to use 2 XPath expressions... $d = new DOM
PHP：DOMDocument：从嵌套元素中删除不需要的文本 php
2013-05-21 16:56

回答 2 已采纳 Assuming your XML actually parses, you could use XPath to make your queries a lot easier: $xp = n
PHP使用DOMDocument类生成HTML实例（包含常见标签元素）
2020-10-25 17:54

本知识点将详细介绍如何使用PHP中的DOMDocument类来生成包含常见HTML标签元素的HTML文档实例，包括表单、表格、CSS样式等，并通过示例演示如何编写代码来实现这一过程。 DOMDocument类是PHP中DOM扩展的一部分，它...
PHP XML操作类DOMDocument
2020-10-29 12:01

PHP中的DOMDocument类是用于处理XML文档的核心工具，它遵循DOM（Document Object Model）标准，允许程序员以结构化的方式访问和操作XML数据。DOMDocument类提供了丰富的属性和方法，使得XML文档的创建、读取、修改和...
PHP创建XML的方法示例【基于DOMDocument类及SimpleXMLElement类】
2020-10-16 10:39

在PHP中创建XML文件是一种基本的技能，本篇将介绍如何使用PHP的DOMDocument类和SimpleXMLElement类来创建XML文件。首先，我们来谈谈DOMDocument类。DOMDocument类是PHP中用于操作DOM（文档对象模型）的一个标准类...
php domelement domdocument,初识php– DOMDocument类
2021-04-22 13:23

是一个亿呀的博客今天遇到需要解析xml，用到了DOMDocument类文件1、xml的解析$doc=new DOMDocument();//如果是解析xml字符串则使用loadXML//$encryptMsg = file_get_contents(‘php://input’);//$xml_tree->loadXML($encryptMsg)...
没有解决我的问题, 去提问

悬赏问题

¥15 javaweb项目无法正常跳转
¥15 VMBox虚拟机无法访问
¥15 skd显示找不到头文件
¥15 机器视觉中图片中长度与真实长度的关系
¥15 fastreport table 怎么只让每页的最下面和最顶部有横线
¥15 R语言卸载之后无法重装，显示电脑存在下载某些较大二进制文件行为，怎么办
¥15 java 的protected权限，问题在注释里
¥15 这个是哪里有问题啊？
¥15 关于#vue.js#的问题：修改用户信息功能图片无法回显，数据库中只存了一张图片（相关搜索：字符串）
¥15 texstudio的问题，

如何用php DOMDocument输出纯文本？

2条回答 默认 最新

悬赏问题

2条回答默认最新