dsgw3315 2018-02-04 09:48
浏览 39
已采纳

从html.Node检索原始数据

I want to get contents of html.Node as a string.

Example:

<div id="my-node">
  <p>First paragraph</p>
  <p>Second paragraph</p>
</div>

Given myNode := html.Node("#my-node") (pseudocode), I want to retrieve entire above html as a string. Indentation does not matter.

I couldn't find anything on the internet except iterating over contents of node - myNode.NextSibling but its over complicated and I'm pretty sure there has to be easier way.

Update: I'm reffering to golang.org/x/net/html package.

  • 写回答

1条回答 默认 最新

  • du9757 2018-02-05 09:14
    关注

    I get what you mean, I use a lot of this in tests.

    What you need is already in the same x/net/html package - you can Render the Node to a bytes.Buffer then get a string out of it:

    var b bytes.Buffer
    err := html.Render(&b, node)
    return b.String()
    

    Please read the doc how rendering is done on the best effort basis - but it will probably fit you.

    PS. You can consult how it's used in a more real project of mine: https://github.com/wkhere/htmlx/blob/master/finder.go#L32-L39 https://github.com/wkhere/htmlx/blob/master/finder_test.go#L73

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论
编辑
预览

报告相同问题?

手机看
程序员都在用的中文IT技术交流社区

程序员都在用的中文IT技术交流社区

专业的中文 IT 技术社区,与千万技术人共成长

专业的中文 IT 技术社区,与千万技术人共成长

关注【CSDN】视频号,行业资讯、技术分享精彩不断,直播好礼送不停!

关注【CSDN】视频号,行业资讯、技术分享精彩不断,直播好礼送不停!

客服 返回
顶部