The paper describes in detail the recovery effort of one of the official MediaWiki grammars. Over two hundred grammar transformation steps are reported and annotated, leading to delivery of a level 2 grammar, semi-automatically extracted from a community created semi-formal text using at least five different syntactic notations, several non-enforced naming conventions, multiple misspellings, obsolete parsing technology idiosyncrasies and other problems commonly encountered in grammars that were not engineered properly. Having a quality grammar will allow to test and validate it further, without alienating the community with a separately developed grammar.

Additional Metadata
Keywords grammar recovery, grammarware, grammar extraction, wiki, mediawiki, parsing
ACM Grammars and Other Rewriting Systems (acm F.4.2), Formal Languages (acm F.4.3)
MSC Grammars and rewriting systems (msc 68Q42)
THEME Software (theme 1)
Publisher Cornell University Library
Series arXiv.org e-Print archive
Project GrammarLab: Foundations of a Grammar Laboratory , GrammarLab: Foundations of a Grammar Laboratory
Grant This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/612.001.007 - GrammarLab: Foundations of a Grammar Laboratory
Citation
Zaytsev, V. (2011). MediaWiki Grammar Recovery. arXiv.org e-Print archive. Cornell University Library.