Application of Logistic Regression in Natural Language Processing

  • Bhartendoo Vimal
N/ACitations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

Data Delicacy is one of the most important issue now a days, not only storing it requires a lot of storage space but even kills a lot of time. These generally happens in online discussion forums mostly, Quora is one of them. This paper gives an insight into handling the problem of actual duplication of questions. So, to overcome the problem Quora issued a public dataset in which users were asked to give a solution to their problem which should be time efficient and should categorize the dataset as duplicate or non-duplicate. NLP is the most important part to carry out this paper which helps in stemming of the question. Similarly, Tf-idf word vector helps in conversion of words and characters into computer understandable format. Since classifying characters by computers is not an easy task unless it's converted to binary. Then lastly applying Logistic Regression to train a model which will classify the next set of questions from itself. So, after this the forums will have cheaper data storage-storing less questions, Improved customer experience-faster responses to questions, Re-use content-if a question has been answered before it is very efficient to use the same answer for a similar question.

Cite

CITATION STYLE

APA

Bhartendoo Vimal. (2020). Application of Logistic Regression in Natural Language Processing. International Journal of Engineering Research And, V9(06). https://doi.org/10.17577/ijertv9is060095

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free