在动态结构元素上进行encoding / xml解组

I'm working with epubs using Golang, I have to fetch the cover image from cover.xhtml file (or whatever file it is mentioned in .opf file).

My problem is with dynamic structure of elements in the Cover.xhtml files.

Each epubs has different structure on the Cover.xhtml file. For example,

<body>
    <figure id="cover-image">
        <img src="covers/9781449328030_lrg.jpg" alt="First Edition" />
    </figure>
</body>

Another epub cover.xhtml file

<body>
    <div>
        <img src="@public@vhost@g@gutenberg@html@files@54869@54869-h@images@cover.jpg" alt="Cover" />
    </div>
</body>

I need to fetch the img tag's src attribute from this file. But I couldn't do it.

Here is the part of my Code that deals with unmarshalling the cover.xhtml file

type CPSRCS struct {
    Src string `xml:"src,attr"`
}

type CPIMGS struct {
    Image CPSRCS `xml:"img"`
}

XMLContent, err = ioutil.ReadFile("./uploads/moby-dick/OPS/cover.xhtml")
CheckError(err)

coverFile := CPIMGS{}
err = xml.Unmarshal(XMLContent, &coverFile)
CheckError(err)
fmt.Println(coverFile)

The output is:

{{}}

The output I'm expecting is:

{{covers/9781449328030_lrg.jpg}}

Thanks in advance!

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

1条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doubianxian6557 2017-06-15 12:35
关注
This will pull out the img element from the read in file and then unmarshal the src attribute from the element. This is making the assumption that you will only ever need to grab the first img element from the file.

XMLContent, err = ioutil.ReadFile("./uploads/moby-dick/OPS/cover.xhtml") CheckError(err) //Parse the XMLContent to grab just the img element strContent := string(XMLContent) imgLoc := strings.Index(strContent, "<img") prefixRem := strContent[imgLoc:] endImgLoc := strings.Index(prefixRem, "/>") //Move over by 2 to recover the '/>' trimmed := prefixRem[:endImgLoc+2] var coverFile CPSRCS err = xml.Unmarshal([]byte(trimmed), &coverFile) CheckError(err) fmt.Println(coverFile)

This will produce the result of {covers/9781449328030_lrg.jpg} for the first input file and {@public@vhost@g@gutenberg@html@files@54869@54869-h@images@cover.jpg} for the second input file you provided.
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

接口和encoding / xml解组 xml
2014-08-22 14:59

回答 2 已采纳 So you can indeed inject an interface partway through, I was failing it allocate memory for the de
使用Golang中的encoding / xml包制作soap xml xml
2018-04-04 08:22

回答 2 已采纳 In your XML inside an <item> there is only a single <Mobile> tag. You don't want to re
如何在Go中向元素添加XML属性？ xml
2017-06-26 19:37

回答 1 已采纳 Your desired XML has 2 elements: <environment> and <temperature>, so you should have 2
TinyXml.rar_xml/soap/webservice_C/C++_
2021-08-12 04:14

3. **操作XML**：可以添加新元素、修改元素属性、删除元素，甚至在元素树中移动和插入新的元素。 4. **保存XML文件**：完成操作后，调用`SaveFile()`方法将内存中的XML结构写回到文件。 TinyXml的优点在于其轻量级...
使用encoding / xml.Encoder如何将xml标头放在自己的行上？ xml
2015-06-08 12:45

回答 1 已采纳 How about just write that one static line with a newline character yourself? Go Playground w := &
在Golang中解组简单的xml时出错 xml
2019-02-27 22:11

回答 1 已采纳 You have a specific line in your code that returns an error xml.Unmarshal(byteValue, &articles)
在Go中解组xml时省略空数组元素 xml
2017-03-17 12:11

回答 1 已采纳 Unless you want to get into xml.Unmarshaler dark magic, I'd suggest just do func compact(ss []str
XML 文档手册，开发手册
2022-04-18 13:56

这份“XML 文档手册，开发手册”涵盖了 XML 的基础知识、语法规范、解析方式以及在实际开发中的应用。一、XML 基础 1. XML 元素：XML 文件由元素构成，每个元素都有开始标签和结束标签，如 `<element>` 和 `</...
Golang中encoding / gob和encoding / json之间的区别 json
2016-12-16 07:39

回答 2 已采纳 Gob is much more preferred when communicating between Go programs. However, gob is currently suppo
解组时是否可以合并没有专用根元素的XML子节点？ xml
2019-03-19 17:37

回答 1 已采纳 I'm using XPath library xmlquery not Go marshal/unmash method. package main import ( "fmt"
Golang：encoding / xml中的UnmarshalXMLAttr xml
2014-07-27 10:46

回答 1 已采纳 The attribute unmarshaler needs to be the type of the title, not the show. here's a fixed version:
android layout XML解析错误的解决方法
2021-01-20 09:29

在Android开发过程中，布局文件（Layout XML）是构建用户界面的关键元素。然而，有时开发者可能会遇到XML解析错误，导致应用无法正常编译或运行。本文将深入探讨如何解决"android layout XML解析错误"的问题，以及...
操作xml文件_保存xml文件_操作XML_
2021-10-01 01:27

StAX流式解析，允许程序在解析过程中向前移动，适合处理大型XML文档。三、修改XML文件修改XML文件通常需要先解析XML，然后对解析得到的DOM树进行操作，最后再将修改后的DOM树写回文件。例如，我们可以添加新的...
C++QT开发——Xml、Json解析
2022-11-14 20:07

程序员老舅的博客 C++QT开发——Xml、Json解析
XML.zip_XML 解析_c xml_sbjson x_xml
2022-09-21 08:33

XML（eXtensible Markup Language）和JSON（JavaScript Object Notation）是两种广泛用于数据交换的格式，尤其在Web服务和移动应用开发中扮演着重要角色。本教程将引导初学者了解如何在iPhone开发中使用XML和JSON...
python【模块】xml.etree.ElementTree 解析 xml
2022-08-09 15:08

ghostwritten的博客 XML 创建了一种易于解释并支持层次结构的树状结构。只要页面遵循 XML，就可以将其称为 XML 文档。XML 文档具有称为元素的部分，由开始和结束标记定义。标签是一种以开头。开始标签和结束标签之间的字符（如果有的话...
XML_CSharp.zip_C# XML_c# xml 注释_c# xml多个字节_c#的xml_xml与C#
2022-09-21 20:17

2. 遍历XML树结构，通过`Read()`方法移动到下一个节点，并使用`NodeType`属性检查当前节点类型（如元素、属性、文本等）。 3. 输出节点信息，这可能包括节点名称、值、属性等。对于元素节点，还可以获取其子节点。 ...
没有解决我的问题, 去提问

悬赏问题

¥15 做个有关计算的小程序
¥15 MPI读取tif文件无法正常给各进程分配路径
¥15 如何用MATLAB实现以下三个公式（有相互嵌套）
¥30 关于#算法#的问题：运用EViews第九版本进行一系列计量经济学的时间数列数据回归分析预测问题求各位帮我解答一下
¥15 setInterval 页面闪烁，怎么解决
¥15 如何让企业微信机器人实现消息汇总整合
¥50 关于#ui#的问题：做yolov8的ui界面出现的问题
¥15 如何用Python爬取各高校教师公开的教育和工作经历
¥15 TLE9879QXA40 电机驱动
¥20 对于工程问题的非线性数学模型进行线性化

在动态结构元素上进行encoding / xml解组

1条回答 默认 最新

悬赏问题

1条回答默认最新