Script-agnostic reflow of text in document images

Reading text from document images can be difficult on mobile devices due to the limited screen width available on them. While there exist solutions for reflowing Latin-script texts on such devices, these solutions do not work well for images of other scripts or combinations of scripts, since they rely on script-specific characteristics or OCR. We present a technique that reflows text in document images in a manner that is agnostic to the script used to compose them. Our technique achieved over 95% segmentation accuracy for a corpus of 139 images containing text in 4 genetically-distant languages—English, Hindi, Kannada and Arabic. A preliminary user study with a prototype implementation of the technique provided evidence of some of its usability benefits.

MobileHCI2011-Panjwani-reflow.pdf
PDF file

In  MobileHCI 2011: Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services

Publisher  ACM

Details

TypeInproceedings
URLhttp://doi.acm.org/10.1145/2037373.2037419
Pages299–302
SeriesMobileHCI '11
ISBN978-1-4503-0541-9
AddressNew York, NY, USA
> Publications > Script-agnostic reflow of text in document images