實例解析Ruby程序中調用REXML來解析XML格式數據的用法

2019-10-26 19:29:02

字體：大中小

來源：轉載

供稿：網友

REXML 是由 Sean Russell 編寫的庫。它不是 Ruby 的唯一 XML 庫，但它是很受歡迎的一個，并且是用純 Ruby 編寫（ NQXML 也是用 Ruby 編寫的，但 XMLParser 封裝了用 C 編寫的 Jade 庫）。在他的 REXML 概述中，Russell 評論道：
我有這樣的問題：我不喜歡令人困惑的 API。有幾種用于 Java 實現的 XML 解析器 API。其中大多數都遵循 DOM 或 SAX，并且在基本原理上與不斷出現的眾多 Java API 非常相似。也就是說，它們看上去象是由從未使用過他們自己的 API 的理論家設計出來的。通常，現有的 XML API 都很令人討厭。他們采用一種被明確設計成非常簡單、一流且功能強大的標記語言，然后用討厭的、過多的和大型 API 對它進行封裝。甚至是為了進行最基本的 XML 樹操作，我總是不得不參考 API 文檔；沒有任何東西是憑直覺的，而且幾乎每個操作都很復雜。
雖然我并不認為它有多么令人心煩，但我同意 Russell 的觀點：XML API 對于大多數使用它們的人來說無疑帶來了過多的工作量。

示例
看下面的book.xml:

引用

<library shelf="Recent Acquisitions">  <section name="Ruby">   <book isbn="0672328844">   <title>The Ruby Way</title>   <author>Hal Fulton</author>   <description>    Second edition. The book you are now reading.    Ain't recursion grand?   </description>   </book>  </section>  <section name="Space">   <book isbn="0684835509">    <title>The Case for Mars</title>    <author>Robert Zubrin</author>    <description>Pushing toward a second home for the human     race.    </description>   </book>   <book isbn="074325631X">    <title>First Man: The Life of Neil A. Armstrong</title>    <author>James R. Hansen</author>    <description>Definitive biography of the first man on     the moon.    </description>   </book>  </section> </library>

1 Tree Parsing(也就是DOM-like)

我們需要require rexml/document 庫，并且include REXML :

require 'rexml/document' include REXML  input = File.new("books.xml") doc = Document.new(input)  root = doc.root puts root.attributes["shelf"]  # Recent Acquisitions  doc.elements.each("library/section") { |e| puts e.attributes["name"] } # Output: # Ruby # Space  doc.elements.each("*/section/book") { |e| puts e.attributes["isbn"] } # Output: # 0672328844 # 0321445619 # 0684835509 # 074325631X  sec2 = root.elements[2] author = sec2.elements[1].elements["author"].text  # Robert Zubrin

這里要注意的是xml中的屬性和值被表示為一個hash，因此我們能夠通過attributes[]來提取我們需要的值，元素的值還能通過類似于path的字符串或者整數來取得.其中用整數取的話，是1-based而不是0-based.

上一篇：Ruby程序中發送基于HTTP協議的請求的簡單示例

下一篇：Ruby程序中正則表達式的基本使用教程