LipiTk Logo

 
     

Home
Lipi Toolkit
Lipi Recognizers
Dataset Tools
Resources
Support
License
About

Isolated Handwritten Telugu Character Dataset

This dataset contains approx 270 samples of each of 166 Telugu "characters" written by native Telugu writers. The data was collected using Acecad Digimemo electronic clipboard devices using the Digimemo-DCT application.  The data is in standard UNIPEN format.

An offline version of the data is also available in the form of bi-level TIFF images, generated from the online data using simple piecewise linear interpolation with a constant thickening factor applied.

The data is available only for research use.

Related Links

Related Papers

  • Elastic Matching of Online Handwritten Tamil and Telugu Scripts Using Local Features, Prashanth L., Jagadeesh Babu V., Raghunath Sharma R., Prabhakara Rao G.V., Dinesh Mandalapu, 9th International Conference on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil, Sept 23-26, 2007

Downloads

Downloading the dataset implies that you have understood and accepted the terms of the license agreement.

hpl-telugu-iso-char

Complete dataset containing approximately 270 samples per character.

hpl-telugu-iso-char-train

Subset of approx 170 samples/char that may be used as the training set.

hpl-telugu-iso-char-test

Subset of approx 60 samples/char that may be used as the test set.

 

Report an issue with this dataset



 
Copyright � 2002-2013 Hewlett-Packard Company.
For problems or questions regarding this Web site contact liptk-dev AT lists.sourceforge.net.
Last updated: 06/21/13.