dsgw3315 2018-02-04 17:48
浏览 39
已采纳

从html.Node检索原始数据

I want to get contents of html.Node as a string.

Example:

<div id="my-node">
  <p>First paragraph</p>
  <p>Second paragraph</p>
</div>

Given myNode := html.Node("#my-node") (pseudocode), I want to retrieve entire above html as a string. Indentation does not matter.

I couldn't find anything on the internet except iterating over contents of node - myNode.NextSibling but its over complicated and I'm pretty sure there has to be easier way.

Update: I'm reffering to golang.org/x/net/html package.

  • 写回答

1条回答 默认 最新

  • du9757 2018-02-05 17:14
    关注

    I get what you mean, I use a lot of this in tests.

    What you need is already in the same x/net/html package - you can Render the Node to a bytes.Buffer then get a string out of it:

    var b bytes.Buffer
    err := html.Render(&b, node)
    return b.String()
    

    Please read the doc how rendering is done on the best effort basis - but it will probably fit you.

    PS. You can consult how it's used in a more real project of mine: https://github.com/wkhere/htmlx/blob/master/finder.go#L32-L39 https://github.com/wkhere/htmlx/blob/master/finder_test.go#L73

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥100 微信小程序跑脚本授权的问题
  • ¥100 房产抖音小程序苹果搜不到安卓可以付费悬赏
  • ¥15 STM32串口接收问题
  • ¥15 腾讯IOA系统怎么在文件夹里修改办公网络的连接
  • ¥15 filenotfounderror:文件是存在的,权限也给了,但还一直报错
  • ¥15 MATLAB和mosek的求解问题
  • ¥20 修改中兴光猫sn的时候提示失败
  • ¥15 java大作业爬取网页
  • ¥15 怎么获取欧易的btc永续合约和交割合约的5m级的历史数据用来回测套利策略?
  • ¥15 有没有办法利用libusb读取usb设备数据