Part of Speech Tagging Using Hidden Markov Models

Adrian Bărbulescu; Daniel I. Morariu

Journal ArticleOPEN ACCESS

Part of Speech Tagging Using Hidden Markov Models

Bărbulescu A
Morariu D

International Journal of Advanced Statistics and IT&C for Economics and Life Sciences (2020) 10(1) 31-42

DOI: 10.2478/ijasitels-2020-0005

N/ACitations

9Readers

Abstract

In this paper, we present a wide range of models based on less adaptive and adaptive approaches for a PoS tagging system. These parameters for the adaptive approach are based on the n-gram of the Hidden Markov Model, evaluated for bigram and trigram, and based on three different types of decoding method, in this case forward, backward, and bidirectional. We used the Brown Corpus for the training and the testing phase. The bidirectional trigram model almost reaches state of the art accuracy but is disadvantaged by the decoding speed time while the backward trigram reaches almost the same results with a way better decoding speed time. By these results, we can conclude that the decoding procedure it’s way better when it evaluates the sentence from the last word to the first word and although the backward trigram model is very good, we still recommend the bidirectional trigram model when we want good precision on real data.

Cite

CITATION STYLE

APA

Bărbulescu, A., & Morariu, D. I. (2020). Part of Speech Tagging Using Hidden Markov Models. International Journal of Advanced Statistics and IT&C for Economics and Life Sciences, 10(1), 31–42. https://doi.org/10.2478/ijasitels-2020-0005

Part of Speech Tagging Using Hidden Markov Models

Abstract

Cite

Register to see more suggestions