duanqiu2064 2017-08-21 18:34
浏览 21
已采纳

如何在GO中解码灵活的xml?

I have the following xml:

    ...
    <solution>
      <ContainerBlockElement>
        <Paragraph>
           <Paragraph>
              Foo
           </Paragraph>
           <Paragraph>
              bar
           </Paragraph>
       </Paragraph>
     </ContainerBlockElement>
   </solution>
   ...

I want to extract the content but the problem is: The server can send me the second structure:

    ...
    <solution>
      <ContainerBlockElement>
        <Paragraph>
          baz
        </Paragraph>
      </ContainerBlockElement>
    </solution>
    ...

I've tried to use this struct in go to decode but it doesn't work:

       type Blah struct {
           ...
    Solutions           []string     `xml:"solution>ContainerBlockElement>Paragraph"`
    Solutions2Paragraph []string         `xml:"solution>ContainerBlockElement>Paragraph>Paragraph"`

}

How can I decode this?

  • 写回答

1条回答 默认 最新

  • dqh1984 2017-08-21 19:41
    关注

    With unpredictable structures, deserializing into a struct is not going to work. Instead, you'll be better off using the streaming mode of the XML parser using xml.Decoder.Token to parse elements in order and handle them as necessary.

    decoder := xml.NewDecoder(xmlFile) 
    solutions := make([]string,0,0)
    
    for { 
        t, _ := decoder.Token() 
        if t == nil { 
            break 
        }
        switch se := t.(type) { 
        case xml.StartElement: 
            if se.Name.Local == "Paragraph" {
                // Get the next token after the Paragraph start element, which will be the tag contents
                innerText,ok := decoder.Token().(xml.CharData)
                if !ok {
                    continue
                }
                solutions = append(solutions, string(innerText))
            }
        }
    }
    

    This code is untested but should provide a decent starting point.

    本回答被题主选为最佳回答 , 对您是否有帮助呢?
    评论

报告相同问题?

悬赏问题

  • ¥15 三菱伺服电机按启动按钮有使能但不动作
  • ¥20 为什么我写出来的绘图程序是这样的,有没有lao哥改一下
  • ¥15 js,页面2返回页面1时定位进入的设备
  • ¥200 关于#c++#的问题,请各位专家解答!网站的邀请码
  • ¥50 导入文件到网吧的电脑并且在重启之后不会被恢复
  • ¥15 (希望可以解决问题)ma和mb文件无法正常打开,打开后是空白,但是有正常内存占用,但可以在打开Maya应用程序后打开场景ma和mb格式。
  • ¥20 ML307A在使用AT命令连接EMQX平台的MQTT时被拒绝
  • ¥20 腾讯企业邮箱邮件可以恢复么
  • ¥15 有人知道怎么将自己的迁移策略布到edgecloudsim上使用吗?
  • ¥15 错误 LNK2001 无法解析的外部符号