AHWR‑Net: offline handwritten amharic word recognition using convolutional recurrent neural network

Abstract

Amharic is the ofcial language of the Federal Government of Ethiopia, with more than 27 million speakers. It uses an Ethiopic script, which has 238 core and 27 labialized characters. It is a low-resourced language, and a few attempts have been made so far for its handwritten text recognition. However, Amharic handwritten text recognition is challenging due to the very high similarity between characters. This paper presents a convolutional recurrent neural networks based ofine handwritten Amharic word recognition system. The proposed framework comprises convolutional neural networks (CNNs) for feature extraction from input word images, recurrent neural network (RNNs) for sequence encoding, and connectionist temporal classifcation as a loss function. We designed a custom CNN model and compared its performance with three diferent state-of-the-art CNN models, including DenseNet-121, ResNet-50 and VGG-19 after modifying their architectures to ft our problem domain, for robust feature extraction from handwritten Amharic word images. We have conducted detailed experiments with diferent CNN and RNN architectures, input word image sizes, and applied data augmentation techniques to enhance performance of the proposed models. We have prepared a handwritten Amharic word dataset, HARD-I, which is available publicly for researchers. From the experiments on various recognition models using our dataset, a WER of 5.24 % and CER of 1.15 % were achieved using our best-performing recognition model. The proposed models achieve a competitive performance compared to existing models for ofine handwritten Amharic word recognition.

Publication
SN Applied Sciences
Date
Links