1. 相关概念
(1). ENCODING
The Unicode character set can be encoded into bytes for storage or transmission in a variety of different ways, called "encodings".
(2). SAX
(3). DOM
Tree-traversal APIs accessible from a programming language, for example DOM.
2. XML C/C++ Parese Libraries
文章[1]总结的非常好, 根据易用行和性能, 最终选择了pugixml
文章[2] 对各种XML的选择非常好