Skip to content

ReppPet

RebeccaDridan edited this page Aug 5, 2011 · 4 revisions

As of August 2011, PET now includes native support for the Regular Expression Pre-Processor (REPP; see ReppTop). Repp processing can be requested using the -repp option to cheap, optionally with the name of a settings file as an argument: -repp=wiki. Settings files will be looked for in the base grammar directory, or the general settings directory (pet/). A default settings file, repp.set, is currently included in common.set in the trunk version of the ERG. That file contains these settings (which can each be overridden as required):

;;
;; REPP tokenization builds on a collection of rule sets, each in a file of its
;; own.  these are called modules (or at times just REPPs), and all are loaded
;; into the processor.  a specific configuration is then obtained by picking
;; one REPP module as the top-level entry point, and determining which named
;; group calls (to other modules) should be allowed, if called.  the following
;; is the global set of available modules.
;;
repp-modules := tokenizer xml latex ascii wiki robustness quotes lkb.

;;
;; the REPP module to provide the top-level entry point.
;;
repp-tokenizer := tokenizer.

;;
;; REPP modules can be parameterized in terms of external named groups, which
;; conceptually resemble sub-routines and can be activated or deactivated; the
;; following is the default list of groups to activate (and may be overwritten
;; in indivudal REPP configurations).
;;
repp-calls := xml ascii quotes.
Clone this wiki locally