在Go中将xpath节点转换回html-markup

import (
    "fmt"
    "gopkg.in/xmlpath.v2"
    "log"
)

...

path := xmlpath.MustCompile("//div[@id='23']")
tree, err := xmlpath.ParseHTML(reader)
if err != nil {
    log.Fatal("HTML parsing error, maybe not wellformed", err)
}

iter := path.Iter(tree)
for iter.Next() {
    fmt.Println(iter.Node().String()) // returns only the values of the text-node
}

...

Is there a way to convert iter.Node() back to html markup like <div>...</div>? iter.Node().String() returns only the values of all inner text nodes. As far as I see the documentation of the xmlpath-package does not offer such function.

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
dpda53918 2016-04-08 20:35
关注
You are right - gopkg.in/xmlpath.v2 functions are limited to read content of nodes. And there is not many alternatives in Go to work with DOM.

From native Go libraries I can mention only goquery. It works only with HTML and does not support XPath but support CSS selectors. Maybe that would be enough in your case.

If you really need to work with both HTML and XML via XPath there is libxml wrapper for Go called gokogiri. It supports all features of libxml so you can get nodes, inner/outerHTML, attributes and other things. I used it to extract text content in one service which currently is in production state. It's a bit faster than PHP's DOMDocument. Only one limitation is fact that I'm not sure if it supports Go versions higher than 1.4.*. Oh and installation on Windows is a bit tricky.

本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

在Go中将xpath节点转换回html-markup
2016-04-08 14:27

回答 2 已采纳 You are right - gopkg.in/xmlpath.v2 functions are limited to read content of nodes. And there is n
XPATH在python selenium中的定位当前节点的子点的问题 html5 python selenium
2020-09-18 10:38

回答 1 已采纳 https://blog.csdn.net/sun_977759/article/details/100989829
使用xpath从background-image样式属性中提取值 php
2017-11-01 05:47

回答 1 已采纳 1) You lost quotes wrapping xpath - it's string. 2) with dom xpath, query returns set of nodes w
Ruby XML, XSLT 和 XPath
2019-09-23 10:04

江洗河的博客 XML 指可扩展标记语言（eXtensible Markup Language）。可扩展标记语言，标准通用标记语言的子集，一种用于标记电子文件使其具有结构性的标记语言。它可以用来标记数据、定义数据类型，是一种允许用户对自己的标记...
C#使用HtmlAgilityPack 获取xpath节点时出错 c# html5
2018-11-07 07:35

回答 1 已采纳 https://blog.csdn.net/heyangyi_19940703/article/details/78352378
xpath 获取当前节点标签名 python 全文检索数据挖掘
2021-03-25 13:22

回答 4 已采纳 from lxml import etreehtml = "world"a = etree.HTML(html)print(a.xpath("local-name(//a[@id='1'])"))pr
DOMXPath / DOMDocument - 在注释块中获取div html php
2014-10-29 04:18

回答 1 已采纳 You already can target the comment containing the links, just follow thru that and make another qu
Ruby学习之XML, XSLT 和 XPath使用方法
2019-01-04 13:15

luyaran的博客 XML就是指可扩展标记语言（eXtensible Markup Language），标准通用标记语言的子集，一种用于标记电子文件使其具有结构性的标记语言。它可以用来标记数据、定义数据类型，是一种允许用户对自己的标记语言进行定义的...
如何在执行后从xpath节点中删除空间？ php xml
2012-12-11 15:11

回答 1 已采纳 I will assume you want the price then. You don't get the same results in the browser and your scr
没有文本节点后代的文档中所有元素的Xpath？ html php xml
2016-12-15 21:52

回答 1 已采纳 This XPath, //*[not(.//text())] will select all elements in the document without text node desc
XPATH - 当内部节点名称空间不同时，在一个节点下返回整个对象 php xml
2013-11-21 10:16

回答 1 已采纳 To get entry nodes use only the first part of your expression: /atom:feed/atom:entry To get all
XML 路径语言（XPath）版本 1.0
2019-10-07 17:47

aili2460的博客 XML 路径语言（XPath）版本 1.0 万维网协会 (W3C) 建议 1999November16 本版本： http://www.w3.org/TR/1999/REC-xpath-19991116 （其它文件格式： XML [英文] HTML [英文] ）最新版本： ...
PHP + XPath在指定日期之间获取节点值 php xml
2016-06-23 16:55

回答 3 已采纳 You can parse data and add to an array, as a stdClass or whatever you like most: <?php $xml =
Java笔记整理九-javaweb（html，CSS，JavaScript，BOM，事件，XML）
2021-03-02 16:13

Dev晚风的博客 * html:html文档的根标签 * head：头标签。用于指定html文档的一些属性。引入外部的资源 * title：标题标签。 * body：体标签 * <!DOCTYPE html>：html5中定义该文档是html文档文本标签：和...
Xpath很全的学习地点，不看后悔
2019-09-24 22:50

dengguxinghe4335的博客 XPath 是一种用于对 XML 文档的元件寻址的一语言，设计为 XSLT 和 XPointer 使用。本文档的地位本文档已由万维网协会 (W3C) 组织成员和其他感兴趣的各方审阅，并已被组织理事批准为万维网协会 (W3C)建议。这...
没有解决我的问题, 去提问

悬赏问题

¥15 安装svn网络有问题怎么办
¥15 Python爬取指定微博话题下的内容，保存为txt
¥15 vue2登录调用后端接口如何实现
¥65 永磁型步进电机PID算法
¥15 sqlite 附加（attach database）加密数据库时，返回26是什么原因呢？
¥88 找成都本地经验丰富懂小程序开发的技术大咖
¥15 如何处理复杂数据表格的除法运算
¥15 如何用stc8h1k08的片子做485数据透传的功能？(关键词-串口)
¥15 有兄弟姐妹会用word插图功能制作类似citespace的图片吗？
¥15 latex怎么处理论文引理引用参考文献

在Go中将xpath节点转换回html-markup

2条回答 默认 最新

悬赏问题

2条回答默认最新