dshu1235 2018-10-05 10:08
浏览 514
已采纳

如何使用XPath解析HTML字符串

How to select an image (or another HTML tag) with XPath in Go?

resp, _ := http.Get(url)
bytes, _ := ioutil.ReadAll(resp.Body)

s := string(bytes))

how to parse s with XPath?

like this code:

list := libxxxx.Find(s, "//a@href")

I get HTML code with http.Get but when I want to parse it I have a problem.

  • 写回答

1条回答 默认 最新

  • douxie0824 2018-10-05 10:54
    关注

    you can use htmlquery:

    doc, err := htmlquery.LoadURL("http://example.com/")
    

    or use string:

    s := `<html>....</html>`
    doc, err := htmlquery.Parse(strings.NewReader(s))
    

    then find everything:

    list := htmlquery.Find(doc, "//a")
    list := range htmlquery.Find(doc, "//a[@href]") 
    
    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?