Added headings + examples to the changelog.
This commit is contained in:
parent
fb560429aa
commit
a2e5def263
|
@ -2,37 +2,103 @@
|
||||||
|
|
||||||
## 0.2.0 - Unreleased
|
## 0.2.0 - Unreleased
|
||||||
|
|
||||||
|
### SAX API
|
||||||
|
|
||||||
A SAX parser/API has been added. This API is useful when even the overhead of
|
A SAX parser/API has been added. This API is useful when even the overhead of
|
||||||
the pull-parser is too much memory wise.
|
the pull-parser is too much memory wise. Example:
|
||||||
|
|
||||||
|
class ElementNames
|
||||||
|
attr_reader :names
|
||||||
|
|
||||||
|
def initialize
|
||||||
|
@names = []
|
||||||
|
end
|
||||||
|
|
||||||
|
def on_element(namespace, name, attrs = {})
|
||||||
|
@names << name
|
||||||
|
end
|
||||||
|
end
|
||||||
|
|
||||||
|
handler = ElementNames.new
|
||||||
|
|
||||||
|
Oga.sax_parse_xml(handler, '<foo><bar></bar></foo>')
|
||||||
|
|
||||||
|
handler.names # => ["foo", "bar"]
|
||||||
|
|
||||||
|
### Racc Gem
|
||||||
|
|
||||||
Oga will now always use the Racc gem instead of the version shipped with the
|
Oga will now always use the Racc gem instead of the version shipped with the
|
||||||
Ruby standard library.
|
Ruby standard library.
|
||||||
|
|
||||||
|
### Error Reporting
|
||||||
|
|
||||||
XML parser errors have been made a little bit more user friendly, though they
|
XML parser errors have been made a little bit more user friendly, though they
|
||||||
can still be quite cryptic.
|
can still be quite cryptic.
|
||||||
|
|
||||||
|
### Serializing Elements
|
||||||
|
|
||||||
Elements serialized to XML/HTML will use self-closing tags whenever possible.
|
Elements serialized to XML/HTML will use self-closing tags whenever possible.
|
||||||
When parsing HTML documents only HTML void elements will use self-closing tags
|
When parsing HTML documents only HTML void elements will use self-closing tags
|
||||||
(e.g. `<link>` tags).
|
(e.g. `<link>` tags). Example:
|
||||||
|
|
||||||
|
Oga.parse_xml('<foo></foo>').to_xml # => "<foo />"
|
||||||
|
Oga.parse_html('<script></script>').to_xml # => "<script></script>"
|
||||||
|
|
||||||
|
### Default Namespaces
|
||||||
|
|
||||||
Namespaces are no longer removed from the attributes list when an element is
|
Namespaces are no longer removed from the attributes list when an element is
|
||||||
created.
|
created.
|
||||||
|
|
||||||
Default XML namespaces can now be registered using `xmlns="..."`. Previously
|
Default XML namespaces can now be registered using `xmlns="..."`. Previously
|
||||||
this would be ignored.
|
this would be ignored. Example:
|
||||||
|
|
||||||
Oga can now lex input such as `</` without entering an infinite loop.
|
document = Oga.parse_xml('<root xmlns="baz"></root>')
|
||||||
|
root = document.children[0]
|
||||||
|
|
||||||
|
root.namespace # => Namespace(name: "xmlns" uri: "baz")
|
||||||
|
|
||||||
|
### Lexing Incomplete Input
|
||||||
|
|
||||||
|
Oga can now lex input such as `</` without entering an infinite loop. Example:
|
||||||
|
|
||||||
|
Oga.parse_xml('</') # => Document(children: NodeSet(Text("</")))
|
||||||
|
|
||||||
|
### Absolute XPath Paths
|
||||||
|
|
||||||
Oga can now parse and evaluate the XPath expression "/" (that is, just "/").
|
Oga can now parse and evaluate the XPath expression "/" (that is, just "/").
|
||||||
This will return the root node (usually a Document instance).
|
This will return the root node (usually a Document instance). Example:
|
||||||
|
|
||||||
|
document = Oga.parse_xml('<root></root>')
|
||||||
|
|
||||||
|
document.xpath('/') # => NodeSet(Document(children: NodeSet(Element(name: "root"))))
|
||||||
|
|
||||||
|
### Namespace Ordering
|
||||||
|
|
||||||
Namespaces available to an element are now returned in the correct order.
|
Namespaces available to an element are now returned in the correct order.
|
||||||
Previously outer namespaces would take precedence over inner namespaces, instead
|
Previously outer namespaces would take precedence over inner namespaces, instead
|
||||||
of it being the other way around.
|
of it being the other way around. Example:
|
||||||
|
|
||||||
|
document = Oga.parse_xml <<-EOF
|
||||||
|
<root xmlns:foo="bar">
|
||||||
|
<container xmlns:foo="baz">
|
||||||
|
<foo:text>Text!</foo:text>
|
||||||
|
</container>
|
||||||
|
</root>
|
||||||
|
EOF
|
||||||
|
|
||||||
|
foo = document.at_xpath('root/container/foo:text')
|
||||||
|
|
||||||
|
foo.namespace # => Namespace(name: "foo" uri: "baz")
|
||||||
|
|
||||||
|
### Parsing Capitalized HTML Void Elements
|
||||||
|
|
||||||
Oga is now capable of parsing capitalized HTML void elements (e.g. `<BR>`).
|
Oga is now capable of parsing capitalized HTML void elements (e.g. `<BR>`).
|
||||||
Previously it could only parse lower-cased void elements. Thanks to Tero Tasanen
|
Previously it could only parse lower-cased void elements. Thanks to Tero Tasanen
|
||||||
for fixing this.
|
for fixing this. Example:
|
||||||
|
|
||||||
|
Oga.parse_html('<BR>') # => Document(children: NodeSet(Element(name: "BR")))
|
||||||
|
|
||||||
|
### Node Type Method Removed
|
||||||
|
|
||||||
The `node_type` method has been removed and its purpose has been moved into
|
The `node_type` method has been removed and its purpose has been moved into
|
||||||
the `XML::PullParser` class itself. This method was solely used by the pull
|
the `XML::PullParser` class itself. This method was solely used by the pull
|
||||||
|
|
Loading…
Reference in New Issue