diff --git a/doc/changelog.md b/doc/changelog.md index 98b41a3..81e9292 100644 --- a/doc/changelog.md +++ b/doc/changelog.md @@ -2,37 +2,103 @@ ## 0.2.0 - Unreleased +### SAX API + A SAX parser/API has been added. This API is useful when even the overhead of -the pull-parser is too much memory wise. +the pull-parser is too much memory wise. Example: + + class ElementNames + attr_reader :names + + def initialize + @names = [] + end + + def on_element(namespace, name, attrs = {}) + @names << name + end + end + + handler = ElementNames.new + + Oga.sax_parse_xml(handler, '') + + handler.names # => ["foo", "bar"] + +### Racc Gem Oga will now always use the Racc gem instead of the version shipped with the Ruby standard library. +### Error Reporting + XML parser errors have been made a little bit more user friendly, though they can still be quite cryptic. +### Serializing Elements + Elements serialized to XML/HTML will use self-closing tags whenever possible. When parsing HTML documents only HTML void elements will use self-closing tags -(e.g. `` tags). +(e.g. `` tags). Example: + + Oga.parse_xml('').to_xml # => "" + Oga.parse_html('').to_xml # => "" + +### Default Namespaces Namespaces are no longer removed from the attributes list when an element is created. Default XML namespaces can now be registered using `xmlns="..."`. Previously -this would be ignored. +this would be ignored. Example: -Oga can now lex input such as `') + root = document.children[0] + + root.namespace # => Namespace(name: "xmlns" uri: "baz") + +### Lexing Incomplete Input + +Oga can now lex input such as ` Document(children: NodeSet(Text("') + + document.xpath('/') # => NodeSet(Document(children: NodeSet(Element(name: "root")))) + +### Namespace Ordering Namespaces available to an element are now returned in the correct order. Previously outer namespaces would take precedence over inner namespaces, instead -of it being the other way around. +of it being the other way around. Example: + + document = Oga.parse_xml <<-EOF + + + Text! + + + EOF + + foo = document.at_xpath('root/container/foo:text') + + foo.namespace # => Namespace(name: "foo" uri: "baz") + +### Parsing Capitalized HTML Void Elements Oga is now capable of parsing capitalized HTML void elements (e.g. `
`). Previously it could only parse lower-cased void elements. Thanks to Tero Tasanen -for fixing this. +for fixing this. Example: + + Oga.parse_html('
') # => Document(children: NodeSet(Element(name: "BR"))) + +### Node Type Method Removed The `node_type` method has been removed and its purpose has been moved into the `XML::PullParser` class itself. This method was solely used by the pull