Improving ANTLR DSL parse-error messages

Question

I'm working on a domain-specific language (DSL) for non-programmers. Non-programmers make a lot of grammar mistakes: they misspell keywords, they don't close parentheses, they don't terminate blocks, etc.

I'm using ANTLR to generate my parser; it provides a nifty mechanism for handling RecognitionExceptions to improve error handling. But I'm finding it pretty hard to develop good error-handling code for my DSL.

At this point, I'm considering ways to simplify the language to make it easier for me to provide users with high-quality error messages, but I'm not really sure how to go about this. I think I want to reduce the ambiguity of errors somehow, but I'm not sure how to implement that idea in a grammar.

In what ways can I simplify my language to improve parse-error messages for my users?

EDIT: Updated to clarify that I'm interested in ways to simplify my language, not just ANTLR error-handling tips in general. (Though, thanks for those!)

Can you give us some more information about the grammar as it is right now? What is your DSL good for and what is it capable of? — Thomas Schaub
choiceofgames.com/blog/choicescript-intro I'll point out right off the bat that it's not the least bit context-free... — Dan Fabulich
If your users make common errors, you can have grammar tokens rule that match the common error and then make those rules output a error message. — Ian Ringrose

Alex Miller Alex Miller · Accepted Answer · 2010-02-15T14:39:15

I wrote an article on recovering line and column numbers in ANTLR errors a couple years ago that might be helpful.

http://tech.puredanger.com/2007/02/01/recovering-line-and-column-numbers-in-your-antlr-ast/

Improving ANTLR DSL parse-error messages

4 Answers