apertium-desmediawiki
—
MediaWiki format processor for Apertium
apertium-desmediawiki |
[input_file
[output_file]] |
apertium-desmediawiki
is a processor for mediawiki XML
dumps (i.e., those produced using Special:Export). Data should be passed
through this processor before being piped to
lt-proc(1).
The program takes input in the form of a text file and produces output
suitable for processing with
lt-proc(1).
Format information (newlines, tabs, etc.) is enclosed in brackets so that
lt-proc(1)
treats it as whitespace between words.
-h
,
-
-help
- Display this help.
You could write the following to show how the word “gener” is
analysed:
echo “gener” |
apertium-destxt | lt-proc ca-es.automorf.bin
Copyright © 2005, 2006 Universitat d'Alacant / Universidad de Alicante.
This is free software. You may redistribute copies of it under the terms of
the GNU General
Public License.
Complicated links – [[page|alternative text]], [[link]]s, etc. are not
supported.
The mediawiki parser has special support for mixing apostrophes
and apostrophes as formatting. This is not supported either.