Domain Based Named Entity Recognition Using Naive Bayes Classification

Australian Journal of Basic and Applied Sciences 10(2), 2016

6 Pages Posted: 8 Jul 2017

See all articles by G. S. Mahalakshmi

G. S. Mahalakshmi

Anna University - Department of Computer Science and Engineering

Betina Antony J

Anna University - Department of Computer Science and Engineering

Bagawathi Roshini S

Anna University - Department of Computer Science and Engineering

Date Written: January 8, 2016

Abstract

In this paper we propose a methodology to perform named entity recognition on Tamil text which speaks about various temples in Tamil Nadu. The approach is to preprocess the text by tokenizing, parse the text to find the parts of speech and to perform named entity recognition as a classification problem with the help of Traditional Naïve Bayes algorithm. The statistical processing framework makes use of the dictionary created from the training data which belongs to predefined labels of named entities. Our research primarily focused on the domain specific named entities (Temple name, location swami name), temporal entities (date and time) and numbers. Our experiments on the presented system provided us with desirable results.

Keywords: Named Entity Recognition Syntactic parsing Temple domain Naïve Bayes

Suggested Citation

Mahalakshmi, G. S. and Antony J, Betina and Roshini S, Bagawathi, Domain Based Named Entity Recognition Using Naive Bayes Classification (January 8, 2016). Australian Journal of Basic and Applied Sciences 10(2), 2016, Available at SSRN: https://ssrn.com/abstract=2792117

G. S. Mahalakshmi

Anna University - Department of Computer Science and Engineering

Chennai, Tamil Nadu
India

Betina Antony J (Contact Author)

Anna University - Department of Computer Science and Engineering ( email )

Chennai, Tamil Nadu
India

Bagawathi Roshini S

Anna University - Department of Computer Science and Engineering

Chennai, Tamil Nadu
India

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
135
Abstract Views
542
Rank
386,565
PlumX Metrics