set_unicharset_properties - Man Page

set properties about the unichars

Synopsis

set_unicharset_properties --U input_unicharsetfile --script_dir /path/to/langdata --O output_unicharsetfile

Description

set_unicharset_properties(1) reads a unicharset file, puts the result in a UNICHARSET object, fills it with properties about the unichars it contains and writes the result back to another unicharset file.

Options

--script_dir /path/to/langdata

(Input) Specify the location of directory for universal script unicharsets and font xheights (type:string default:)

--U unicharsetfile

(Input) Specify the location of the unicharset to load as input.

--O unicharsetfile

(Output) Specify the location of the unicharset to be written with updated properties.

History

set_unicharset_properties(1) was first made available for tesseract version 3.03.

Resources

Main web site: https://github.com/tesseract-ocr Information on training: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract.html

See Also

tesseract(1)

Copying

Copyright (C) 2012 Google, Inc. Licensed under the Apache License, Version 2.0

Author

The Tesseract OCR engine was written by Ray Smith and his research groups at Hewlett Packard (1985-1995) and Google (2006-present).

Referenced By

combine_lang_model(1).

02/05/2024