A Machine Learning Model to Identify Duplicate Questions in Social Media Forums

  • et al.
N/ACitations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In recent years, digital platform forums where question and answers are being discussed are attracting more number of users. Many discussions on these forums would be repetitive nature. Such duplicate questions were provided by Quora as a competition on Kaggle. It is observed that the dataset provided by Quora, requires many modifications before training machine learning models to obtain a good accuracy. These modifications include feature extraction, vectorization and tokenization after which the data is ready for training desired models. While analyzing each model after prediction, it gives plenty of information about its efficiency and many other factors. Later, these information of different models are compared and helps to choose the best model. These models later can be combined and used as a single model with best accuracy. In this paper, a Machine Learning model which will predict duplicate questions is proposed.

Cite

CITATION STYLE

APA

Panda, S. K., Bhalerao, V., & AR*, S. (2020). A Machine Learning Model to Identify Duplicate Questions in Social Media Forums. International Journal of Innovative Technology and Exploring Engineering, 9(4), 370–373. https://doi.org/10.35940/ijitee.d1362.029420

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free