Home Publications Fast Perspective Recovery of Text in Natural Scenes

Skip to content. | Skip to navigation

Carlos Merino-Gracia, Majid Mirmehdi, José Sigut and José L González-Mora (2013)

Fast Perspective Recovery of Text in Natural Scenes

Image and Vision Computing, 31(10):714–724.


Cheap, ubiquitous, high-resolution digital cameras have led to opportunities that demand camera-based text understanding, such as wearable computing or assistive technology. Perspective distortion is one of the main challenges for text recognition in camera captured images since the camera may often not have a fronto-parallel view of the text. We present a method for perspective recovery of text in natural scenes, where text can appear as isolated words, short sentences or small paragraphs (as found on posters, billboards, shop and street signs etc.). It relies on the geometry of the characters themselves to estimate a rectifying homography for every line of text, irrespective of the view of the text over a large range of orientations. The horizontal perspective foreshortening is corrected by fitting two lines to the top and bottom of the text, while the vertical perspective foreshortening and shearing are estimated by performing a linear regression on the shear variation of the individual characters within the text line. The proposed method is efficient and fast. We present comparative results with improved recognition accuracy against the current state-of-the-art.

scene text extraction, perspective recovery, homography rectification

This work is part of our project to develop a text reading system for blind people.

Document Actions