MediaWiki Grammar Recovery
The paper describes in detail the recovery effort of one of the official MediaWiki grammars. Over two hundred grammar transformation steps are reported and annotated, leading to delivery of a level 2 grammar, semi-automatically extracted from a community created semi-formal text using at least five different syntactic notations, several non-enforced naming conventions, multiple misspellings, obsolete parsing technology idiosyncrasies and other problems commonly encountered in grammars that were not engineered properly. Having a quality grammar will allow to test and validate it further, without alienating the community with a separately developed grammar.
|Keywords||grammar recovery, grammarware, grammar extraction, wiki, mediawiki, parsing|
|ACM||Grammars and Other Rewriting Systems (acm F.4.2), Formal Languages (acm F.4.3)|
|MSC||Grammars and rewriting systems (msc 68Q42)|
|THEME||Software (theme 1)|
|Publisher||Cornell University Library|
|Series||arXiv.org e-Print archive|
|Project||GrammarLab: Foundations of a Grammar Laboratory , GrammarLab: Foundations of a Grammar Laboratory|
|Grant||This work was funded by the The Netherlands Organisation for Scientific Research (NWO); grant id nwo/612.001.007 - GrammarLab: Foundations of a Grammar Laboratory|
Zaytsev, V. (2011). MediaWiki Grammar Recovery. arXiv.org e-Print archive. Cornell University Library.