Taku A. Tokuyasu and Philip A. Chou
We introduce an iterative approach to recognizing
two-dimensional grammatical structure within digital
images, which we term “turbo recognition.” Inspired
by the success of turbo decoding for channel
coding of one-dimensional sequences, we develop
a recognition scheme for images based on
two independent views of the same underlying message.
These correspond to two independent image
sources, one in the horizontal direction and the other
in the vertical direction, which are driven by a single
input message. The recognition process proceeds
iteratively, first along one direction and then
the other, applying the Forward/Backward algorithm
to derive a new prior probability distribution on the
input message for the orthogonal recognition step.
This holds promise as a principled approach within
the Document Image Decoding (DID) framework
for the recognition of nontrivial 2D layout structure
such as tables.
|Published in||IAPR Workshop on Document Layout Interpretation and its Applications|