GSP
Quick Navigator

Search Site

Unix VPS
A - Starter
B - Basic
C - Preferred
D - Commercial
MPS - Dedicated
Previous VPSs
* Sign Up! *

Support
Contact Us
Online Help
Handbooks
Domain Status
Man Pages

FAQ
Virtual Servers
Pricing
Billing
Technical

Network
Facilities
Connectivity
Topology Map

Miscellaneous
Server Agreement
Year 2038
Credits
 

USA Flag

 

 

Man Pages
MFTRAINING(1)   MFTRAINING(1)

mftraining - feature training for Tesseract

mftraining -U unicharset -O lang.unicharset FILE...

mftraining takes a list of .tr files, from which it generates the files inttemp (the shape prototypes), shapetable, and pffmtable (the number of expected features for each character). (A fourth file called Microfeat is also written by this program, but it is not used.)

-U FILE
(Input) The unicharset generated by unicharset_extractor(1)

-F font_properties_file

(Input) font properties file, each line is of the following form, where each field other than the font name is 0 or 1:

*font_name* *italic* *bold* *fixed_pitch* *serif* *fraktur*

-X xheights_file

(Input) x heights file, each line is of the following form, where xheight is calculated as the pixel x height of a character drawn at 32pt on 300 dpi. [ That is, if base x height + ascenders + descenders = 133, how much is x height? ]

*font_name* *xheight*

-D dir

Directory to write output files to.

-O FILE

(Output) The output unicharset that will be given to combine_tessdata(1)

tesseract(1), cntraining(1), unicharset_extractor(1), combine_tessdata(1), shapeclustering(1), unicharset(5)

https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html

Copyright (C) Hewlett-Packard Company, 1988 Licensed under the Apache License, Version 2.0

The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present).
06/07/2022  

Search for    or go to Top of page |  Section 1 |  Main Index

Powered by GSP Visit the GSP FreeBSD Man Page Interface.
Output converted with ManDoc.