Abstract
The system identifies a duplicate record from the database using the machine learning method. We must pass unstructured data. Data are prepared using any natural language processing technique such as text similarity. This prepared data is then fed into the latest machine learning method called Random Forest. After this data collection, using these files, the target file is compared to the source file. We make input and output files. This is carried out until accurate efficiency is generated.
Author supplied keywords
Cite
CITATION STYLE
Vikas, S., & Thimmaraju, S. N. (2019). Effective compatibility and reduction of data for bigdata applications. International Journal of Engineering and Advanced Technology, 9(1), 3781–3784. https://doi.org/10.35940/ijeat.A9821.109119
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.