Intelligent Historical Document Image Analysis (IHDIA)

Datasets

Given the large diversity in language, script and non-textual regional elements in historical Indic manuscripts, spatial layout parsing is crucial in enabling downstream applications such as OCR, word-spotting, style-and-content based retrieval and clustering. We take a significant step to address this gap and introduce Indiscapes, the first dataset with layout annotations for historical Indic manuscripts.

Access the dataset

Layout Estimation Networks

To succeed at layout parsing of manuscripts, we require a system which can accurately localize various types of regions (e.g. text lines, isolated character components, physical degradation, pictures, holes) and isolate individual instances of each region. To meet these requirements, we model our problem as one of semantic instance-level segmentation and introduce a deep-network based instance segmentation framework custom modified for fully automatic layout parsing.

Learn More

OCR

We introduce an OCR for Sanskrit texts printed in Devanagari and containing long, highly conjoined words. Our OCR achieves a word error rate of 15.97% and a character error rate of 3.71% on challenging Indic document texts.

Code and Documentation

Annotation and Analytics Systems

We propose a web-based layout annotation and analytics system. Our system, called Historic Indic Document Layout Analyzer (HInDoLA), features an intuitive annotation GUI, a graphical analytics dashboard and interfaces with machine-learning based intelligent modules on the backend. HInDoLA has successfully helped us create the first ever large-scale dataset for layout parsing of Indic palm-leaf manuscripts, which in turn has enabled us to train deep networks for fully automatic instance-level layout parsing.

Press

ETV Telangana (cable TV channel) covered our work on historical manuscript analysis in Yuva, a daily segment which covers young achievers. Watch the video below (in Telugu) and click on images for news articles.

Publications

December,2024

Contact

Dr. Ravi Kiran Sarvadevabhatla
Center for Visual Information Technologies
IIIT Hyderabad, Hyderabad 500032, INDIA

E-mail: ravi.kiran@iiit.ac.in

Datasets

Layout Estimation Networks

OCR

Annotation and Analytics Systems

Press

Publications

LineTR:Unified Text Line Segmentation for Challenging Palm Leaf Manuscripts

SeamFormer: High Precision Text Line Segmentation for Handwritten Documents

Deformable Deep Networks for Instance Segmentation of Overlapping Multi page Handwritten Documents

DocVisor: A Multi-purpose Web-Based Interactive Visualizer for Document Image Analytics

PALMIRA: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts

BoundaryNet - An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation [ORAL]

An OCR for Classical Indic Documents Containing Arbitrarily Long Words

HInDoLA: A Cloud-based System for Large-Scale Annotation and Analysis of Indic Palm Leaf Manuscripts [ORAL]

Indiscapes: Instance Segmentation Networks for Layout Parsing of Historical Indic Manuscripts [ORAL]

Contact