apertium-unformat
—
unformatted text extractor for Apertium
apertium-unformat |
[-f format]
[infile [outfile]] |
apertium
is the application that extract unformatted
text from documents.
-f
format
- Specifies the format of the input and output files which can have these
values:
txt
- (default value) Input and output files are in text format.
html
- Input and output files are in “html” format. This
“html” is the one acceptd by the vast majority of web
browsers.
rtf
- Input and output files are in “rtf” format. The accepted
“rtf” is the one generated by Microsoft WordPad and
Microsoft Office up to and including BOffice 97.
- infile
- Input file (stdin by default).
- outfile
- Output file (stdout by default).
Copyright © 2005, 2006 Universitat d'Alacant / Universidad de Alicante.
This is free software. You may redistribute copies of it under the terms of
the GNU General
Public License.
Many... lurking in the dark and waiting for you!