oga/spec/oga
Yorick Peterse 6b779d7883 Handle lexing of stray quotes in element heads
This adds lexing support for HTML/XML such as:

    <foo bar="""></foo>

While technically invalid, some websites (e.g. yahoo.com) contain HTML
just like this.

The lexer handles this as following:

1. When we're in the "element_head" machine, do business as usual until
   we bump into a "=".

2. Call (using Ragel's "fcall") the machine to use for processing the
   attribute value (if any).

3. In this machine quoted strings are processed. The moment a string has
   been processed the lexer jumps right back in to the "element_head"
   machine. This ensures that any stray quotes are ignored instead of
   being processed as extra attribute values (eventually leading to
   parsing errors due to unbalanced quotes).
2015-04-15 22:33:53 +02:00
..
css Added CSS/XPath Parser.parse_with_cache 2015-03-23 00:22:59 +01:00
html Lazy decoding of XML/HTML entities. 2015-03-05 23:00:43 +01:00
xml Handle lexing of stray quotes in element heads 2015-04-15 22:33:53 +02:00
xpath Ignore declared namespaces in HTML documents 2015-03-26 22:38:39 +01:00
entity_decoder_spec.rb Added Oga::EntityDecoder 2015-04-07 21:18:15 +02:00
lru_spec.rb Added LRU#maximum=/maximum 2015-03-23 00:26:48 +01:00
oga_spec.rb Use sax_parse_html for HTML documents. 2015-03-21 01:22:59 +01:00