Removing indirect left recursion from this grammar

Question

I'm trying to figure out how to remove the indirect left recursion from the logical keyword expressions within my Rust port of a Ruby parser (https://github.com/kenaniah/ruby-parser/blob/master/ruby-parser/src/parsers/expression/logical.rs). The grammar looks like:

E --> N | A | O | t
N --> n E
A --> E a E
O --> E o E

E = expression
A = keyword_and_expression
O = keyword_or_expression
N = keyword_not_expression

How would I go about transforming this to remove the recursion in A and O?

As the source code shows I have already removed the left recursion from those (those were cases of direct left recursion, which I know how to handle). It's the indirect recursion that's tripping me up as I'm not sure how to transform it into an equivalent grammar. — Kenaniah

Kenaniah Kenaniah · Accepted Answer · 2020-09-01T19:52:02

According to this factorization tool, the resulting grammar would be:

E  -> N
    | A
    | O
    | t
N  -> n E
A  -> n E a E A'
    | O a E A'
    | t a E A'
O  -> n E o E O'
    | n E a E A' o E O'
    | t a E A' o E O'
    | t o E O'
A' -> a E A'
    | ϵ
O' -> a E A' o E O'
    | o E O'
    | ϵ

Looks like the factorizations for A and O ended up being rather complex thanks to the multiple productions of E.

Removing indirect left recursion from this grammar

2 Answers