CorrectOCR.correcter module¶
Correction Interface¶
The annotator will be presented with the tokens that match a heuristic bin that was marked for annotation.
They may then enter a command. The commands reflect the above settings,
with an additional defer
command to defer decision to a later time.
Prefixing the entered text with an exclamation point causes it to be
considered the corrected version of the token. For example, if the token
is “Wagor” and no suitable candidate is available, the annotator may
enter !Wagon
to correct the word.
Corrections are memoized, so the file need not be corrected fully in one
session. To finish a session and save corrections, use the quit
command.
A help
command is available in the interface.
See also the Server
for a HTTP backend.
- class CorrectOCR.correcter.CorrectionShell(tokens, dictionary, correctionTracking)[source]¶
Bases:
cmd.Cmd
Interactive shell for making corrections to a list of tokens. Assumes that the tokens are binned.
Instantiate a line-oriented interpreter framework.
The optional argument ‘completekey’ is the readline name of a completion key; it defaults to the Tab key. If completekey is not None and the readline module is available, command completion is done automatically. The optional arguments stdin and stdout specify alternate input and output file objects; if not specified, sys.stdin and sys.stdout are used.