core/oga - oga

Commit Graph

Author	SHA1	Message	Date
Yorick Peterse	9d6afb7d6f	Tweaked inline code blocks for YARD docs.	2014-11-02 23:30:28 +01:00
Yorick Peterse	bc8be9f725	Fixed various incorrect YARD tags.	2014-11-02 21:23:29 +01:00
Yorick Peterse	2e1320c2dc	Explicit return in step_modulo_value.	2014-11-02 19:32:02 +01:00
Yorick Peterse	ab8b451dc3	Parsing support for :nth-of-type()	2014-11-02 19:29:39 +01:00
Yorick Peterse	0faceffacb	Parsing support for nth-child(n+X)	2014-11-02 19:29:09 +01:00
Yorick Peterse	8d8d74ec41	Drop nth-child support of all negative sequences This removes parsing support for selectors such as :nth-child(-n-6). According to the CSS spec this isn't valid anyway (confirmed by testing it in Chromium). As a result there's no point in supporting it in any way.	2014-11-02 19:19:05 +01:00
Yorick Peterse	b31288b7d2	Use correct modulo for nth-child and negatives.	2014-11-02 18:53:23 +01:00
Yorick Peterse	9cce93fc4a	Parsing support for :nth-last-child.	2014-11-01 20:58:28 +01:00
Yorick Peterse	64f9c570fa	Use separate spec files for each pseudo class.	2014-11-01 20:20:33 +01:00
Yorick Peterse	03f897c2b7	Support for all possible nth-child arguments. That is, as far as I can tell based on Nokogiri's behaviour (which Oga now matches).	2014-10-30 23:03:46 +01:00
Yorick Peterse	0e6aefb727	Fixed parsing of nth-child(n) and nth-child(-n)	2014-10-30 00:24:31 +01:00
Yorick Peterse	87f6b9c723	Basic support for :nth-child() This already includes support for formulas such as 2n, odd, even and 2n+1. Negative formulas, just "n" and others are not yet supported.	2014-10-28 00:21:11 +01:00
Yorick Peterse	46646e2ace	Support for custom grouping of XPath expressions. This allows the use of expressions such as "(A or B) and C". This fixes #59.	2014-10-26 22:38:05 +01:00
Yorick Peterse	39c0f7147c	Support for parsing the :root pseudo class.	2014-10-26 22:20:23 +01:00
Yorick Peterse	32764c9a14	Proper parsing support for all CSS operators.	2014-10-26 12:45:43 +01:00
Yorick Peterse	24ae791f00	Better support for lexing multi-line strings. When lexing multi-line strings everything used to work fine as long as the input were to be read as a whole. However, when using an IO instance all hell would break loose. Due to the lexer reading IO instances on a per line basis, sometimes Ragel would end up setting "ts" to NULL. For example, the following input would break the lexer: <foo class="\nbar" /> Due to the input being read per line, the following data would be sent to the lexer: <foo class="\n bar" /> This would result in different (or NULL) pointers being used for building a string, in turn resulting in memory allocation errors. To work around this the string lexing setup has been broken into separate machines for single and double quoted strings. The tokens used have also been changed so that instead of just "T_STRING" there are now the following tokens: * T_STRING_SQUOTE * T_STRING_DQUOTE * T_STRING_BODY A string can have multiple T_STRING_BODY tokens (= multi-line strings, only the case for IO inputs). These strings are stitched back together by the parser. This fixes #58.	2014-10-26 11:39:56 +01:00
Yorick Peterse	fca88a69d1	Track Ragel call stacks in the Java lexer. This will be needed for the upcoming string lexing changes.	2014-10-26 11:39:19 +01:00
Yorick Peterse	d951a8cc87	Track XML C lexer state in C only. Instead of storing "act" and "cs" as an instance variable they (along with some other variables) are now stored in a struct. This struct is attached to a lexer instance using the (crappy) Data_Get_Struct/Data_Wrap_Struct API.	2014-10-26 11:38:06 +01:00
Yorick Peterse	b304b8b077	Fixed descendant-or-self with a predicate. Processing of this axis along with a predicate wouldn't quite work out. Even if the predicate returned false the node would still be matched (which should not be the case).	2014-10-23 01:12:10 +02:00
Yorick Peterse	47e4a3aa49	Added benchmark for descendant-or-self	2014-10-23 01:11:19 +02:00
Yorick Peterse	7ee7f25239	Support for parsing CSS axes.	2014-10-23 00:42:45 +02:00
Yorick Peterse	9955f61bcb	Renamed CSS axis tokens. These have been renamed as following: T_CHILD => T_GREATER T_FOLLOWING => T_TILDE T_FOLLOWING_DIRECT => T_PLUS	2014-10-21 23:25:11 +02:00
Yorick Peterse	823f2f1bad	Clean generated CSS lexer/parser files.	2014-10-21 23:21:23 +02:00
Yorick Peterse	851e7d6d0b	First pass of rewriting the CSS parser. The new parser uses way less confusing rule names, is a bit more strict and in general much less of a pain to deal with.	2014-10-21 23:19:31 +02:00
Yorick Peterse	e3de65a258	Lex whitespace preceding CSS axes separately. Previously input such as "x > y" would result in the following token sequences: T_IDENT, T_CHILD, T_IDENT This commit changes this to the following: T_IDENT, T_SPACE, T_CHILD, T_IDENT This allows the parser to use T_SPACE as a terminal token, this in turn prevents around 16 shift/reduce conflicts from arising. This does mean that input such as " > y" or " x > y" is now invalid. This however can be solved by simply _not_ adding leading/trailing whitespace to CSS queries.	2014-10-21 23:18:46 +02:00
Yorick Peterse	e2b4f51e64	Updated part of the CSS axis specs.	2014-10-20 19:07:06 +02:00
Yorick Peterse	21c27bf48e	Surround class values with spaces. When using a CSS class selector the resulting XPath string passed to contains() should be surrounded by spaces.	2014-10-20 09:29:42 +02:00
Yorick Peterse	15ebdb7de4	Fixed parsing of CSS class selectors. When a class selector is used it should be checked as one of the possible values, not as _the_ only value (unlike ID selectors).	2014-10-20 00:45:41 +02:00
Yorick Peterse	174d33c597	Re-enabled parsing of CSS predicates.	2014-10-20 00:39:12 +02:00
Yorick Peterse	d4150fd0f5	First step at rewriting the CSS parser. The new setup will not involve a separate transformation stage, instead the CSS parser will directly emit an XPath AST. This reduces the overhead needed for parsing/evaluating CSS selectors while also simplifying the code. The downside is that I basically have to re-write 80% of the parser.	2014-10-20 00:30:16 +02:00
Yorick Peterse	ea2baa2020	Swap child node order for CSS pseudo classes.	2014-10-16 23:18:14 +02:00
Yorick Peterse	63d27fa709	Swap child order of CSS class and id nodes. This makes it easier to transform the AST at a later stage.	2014-10-16 23:13:54 +02:00
Yorick Peterse	7ccd685acb	Use a helper method for transforming CSS ASTs.	2014-10-16 23:01:56 +02:00
Yorick Peterse	a85cd7cbd1	Trimmed CSS class transformer specs a bit.	2014-10-16 22:51:55 +02:00
Yorick Peterse	5fde2f9092	Basic tests for the CSS transformer.	2014-10-16 10:25:30 +02:00
Yorick Peterse	073e8fbe5b	Basic boilerplate for converting CSS to XPath.	2014-10-16 00:25:31 +02:00
Yorick Peterse	48eb4f83df	Lexing/parsing of CSS pseudos with ident arguments This allows the lexing/parsing of expressions such as "html:lang(en)".	2014-10-15 09:42:26 +02:00
Yorick Peterse	d9a4221a0a	Remove :axis CSS node types. The various axes are now simply their own node types.	2014-10-12 18:08:35 +02:00
Yorick Peterse	ed0cd7826e	Fixed precedence of ID/class CSS selectors	2014-10-07 23:05:34 +02:00
Yorick Peterse	91f9cc984b	Parsing of pseudo classes without node tests.	2014-10-07 23:01:58 +02:00
Yorick Peterse	a6b0bd96c8	Support for parsing CSS class/ID selectors.	2014-10-07 22:57:23 +02:00
Yorick Peterse	6792127600	Reworked CSS parser rules. This includes better rules for parsing separate path members, pseudo class arguments and some changes to remove all remaining parsing conflicts.	2014-10-07 22:47:47 +02:00
Yorick Peterse	b40c0243ce	Tighten up lexing of CSS predicates. Operators can now only occur inside predicates and any whitespcae in these predicates is ignored.	2014-10-07 22:17:04 +02:00
Yorick Peterse	625b9eeffd	Lexing of CSS axes with surrounding whitespace.	2014-10-07 22:06:45 +02:00
Yorick Peterse	619c0bbc14	Emit tokens for whitespace in the CSS lexer.	2014-10-07 21:55:41 +02:00
Yorick Peterse	6e18287a1d	Initial specs for parsing CSS IDs/classes.	2014-10-07 19:01:04 +02:00
Yorick Peterse	09315ea478	Test for operators inside CSS predicates.	2014-10-07 09:32:34 +02:00
Yorick Peterse	d960eb7cd5	Removed CSS lexer code that was commented out.	2014-10-07 09:29:11 +02:00
Yorick Peterse	16d66a7eb6	Better parsing for the nth-child pseudo class. This uses stricter (and more correct) rules in both the lexer and the parser. The resulting AST has also received a small rework to make it more compact and less confusing.	2014-10-06 23:52:46 +02:00
Yorick Peterse	60da2bdd3a	Use RSpec.shared_example vs just shared_example.	2014-10-05 23:52:12 +02:00

1 2 3 4 5 ...

769 Commits All Branches Search

769 Commits

All Branches