基本的DOM XML解析器需要什么？

I've started programming in Google's Go Language, and the package I'm attempting to write is an API for processing and creating DOCX files (I'm familiar with this topic and thought it would be a good way to learn Go). As DOCX files are primarly a ZIP file with various XML files inside them, I rather need a DOM XML parser. However, I was unable to find any native Go DOM XML Parsers, as the only ones I saw seemed to be very limited, and probably SAX parsers (anyone who uses Go, correct me if I'm wrong).

So this past weekend I wrote a very basic DOM XML parser that was able to parse one of the simpler XML files within the DOCX package and output it back intact. At the moment I'm not going to bother with Namespace, XSLT, or schema validation support, as those aren't useful for manipulating DOCX files. My question is, what other XML standards and functionality would be important to incorporate into the parser?

At the moment, it only really just creates a tree of elements and attributes, which I can modify and save. I'm not current handling CDATA elements or XML escape characters (though those would be easy to do and I'll get to that this weekend).

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
doujingjiao0015 2010-09-15 00:29
关注
First of all: if you specifically want to do DOM parser, you need to implement DOM API. But I am not sure if you actually mean that; perhaps you just mean an XML parser that produces XML tree model ("dom"); or just an XML parser? DOM is hardly the only way. Also note that implementing DOM tree model using SAX parser is the most common way; few if any DOM packages have embedded parsers, commonly parser is exposed separately.

As to XML parser features, some of things that are MUSTs in my opinion are:

Handling of character entities (ampersand and number), pre-defined general entities (lt, gt, apos, quot)

Handling of xml declaration ()

Handling of various input encodings; declared by xml declaration or externally -- too many parsers skimp on this, but is very imporant since xml documents can reliably detect encoding internally.

Checking for uniqueness of attribute values

Checking for proper nesting of elements

Skipping of comments

Skippping (if not handling) of processing instructions

CDATA handling -- it's simple to do

Keeping track of line numbers for error reporting

Other eventually useful things are:

Namespace handling

Checking of character validity, both content and names

Normalization of lineefeds as per xml specification
本回答被题主选为最佳回答 , 对您是否有帮助呢?

解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

查看更多回答(1条)

报告相同问题？

关注问题

基本的DOM XML解析器需要什么？ xml
2010-09-15 00:09

回答 2 已采纳 First of all: if you specifically want to do DOM parser, you need to implement DOM API. But I am n
dom4j中saxwriter跟xmlwriter有什么分别？ java xml
2017-03-07 13:31

回答 1 已采纳 dom4j提供了XML文档的多种输出形式。在org.dom4j.io包中，DOMWriter类可以将dom4j树输出为W3C DOM的Document对象，SAXWriter类可以将dom4j树作为S
PHP中最快的XML解析器是什么？ php xml
2010-06-15 19:59

回答 4 已采纳 The fastest parser will be SAX -- it doesn't have to create a dom, and it can be done with partial
Android中XML的三种解析器分析、实战
2020-05-28 16:13

卜大爷的博客本文分析了Android中，可使用的三种XML解析器，并对它们的实现逻辑及优缺点进行了分析和对比。我们在实战部分，分别用三种解析器实现了Demo中XML文件的解析，代码注释详细的介绍了整个过程。...
使用DOM解析器解析Android XML问题 android xml
2013-03-04 09:45

回答 1 已采纳 `&` 是 xml 中一个预先定义的实体，代表的一种特殊的方式。在 URL 中,如果你把所有的 `&`变为`&`程序就会正常运行了。在 convertStreamToString 方法使
如何将phpdom保存到xml中？ php xml
2018-12-23 16:53

回答 3 已采纳 You're using the DOM API to read the XML. The same API can be used to create and modify XML docume
使用DOM4J 怎么解析这种XML？ xml
2015-07-13 06:51

回答 2 已采纳你这个xml格式不是很标准，节点里面的内容不一样。解析这种xml需要首先确定节点到底有多少中结构，从你贴出来的xml看来，只有两中结构，一种是下面没有子节点，一种是有子节点。这时候使用程序去解析，必须
Qt Xml文件的创建和解析[xml和dom方式]
2022-11-15 17:12

Qt历险记的博客【5】Qt XML解析方式比较【6】QXmlStreamReader类说明【7】QXmlStreamWriter类说明【8】DOM说明【9】XML常用函数【10】DOM常用函数【11】XML和DOM源码 XML.pro mainwindow.h mainwindow.cpp 【12】XML和DOM...
PHP DOMDocument：如何使用COLONS解析自定义XML / RSS标记名称？ php xml
2016-06-29 09:14

回答 2 已采纳 Use getElementsByTagNameNS(): $node->getElementsByTagNameNS("urn:ietf:params:xml:ns:xcal", "de
PHP DomDocument xml解析器 php xml
2011-09-15 09:14

回答 1 已采纳 This is expected behavior. When you load a formatted XML document with DOM any whitespace, e.g. in
需要一个php解析的xml格式的类 php xml
2023-02-21 10:14

回答 2 已采纳回答不易求求您采纳哦可以使用PHP内置的SimpleXML库来解析XML数据。以下是一个示例代码，用于解析你提供的XML格式数据： $xml = simplexml_load_string($
Android XML数据的三种解析方式（Dom解析）
2019-06-11 21:36

qinxuexiang_blog的博客 android提供三种类型的XML解析器，分别是dom、sax和xmlpullparser。其中android推荐使用xmlpullparser，因为它既高效又易于使用。所以我们将使用xmlpullparser来解析XML。首先先在Assets目录下创...
java 使用dom3c解析xml文件问题，多级节点解析 eclipse java
2019-06-05 19:38

回答 4 已采纳直接上代码 ``` package cn.next; import java.io.IOException; import javax.xml.parsers.DocumentBuilder;
C++QT开发——Xml、Json解析
2022-11-14 20:07

程序员老舅的博客 C++QT开发——Xml、Json解析
Android之PULL、SAX、DOM解析XML
2020-03-17 09:54

阿宁呀的博客背景：解析天气预报的xml文件，在模拟器显示解析前准备 layout目录下weather.xml <?xml version="1.0" encoding="utf-8"?> <RelativeLayout xmlns:android="http://schemas.android.com/apk/res/android" ...
没有解决我的问题, 去提问

悬赏问题

¥15 如何在scanpy上做差异基因和通路富集？
¥20 关于#硬件工程#的问题，请各位专家解答！
¥15 关于#matlab#的问题：期望的系统闭环传递函数为G(s)=wn^2/s^2+2¢wn+wn^2阻尼系数¢=0.707，使系统具有较小的超调量
¥15 FLUENT如何实现在堆积颗粒的上表面加载高斯热源
¥30 截图中的mathematics程序转换成matlab
¥15 动力学代码报错，维度不匹配
¥15 Power query添加列问题
¥50 Kubernetes&Fission&Eleasticsearch
¥15 報錯：Person is not mapped，如何解決？
¥15 c++头文件不能识别CDialog

基本的DOM XML解析器需要什么？

2条回答 默认 最新

悬赏问题

2条回答默认最新