Multimodal Deep Learning Framework for Book Recommendations: Harnessing Image Processing with VGG16 and Textual Analysis via LSTM-Enhanced Word2Vec

2Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

In the contemporary digital age, an intensified emphasis has been placed on the research of book recommendation systems. Historically, these systems predominantly focused on readers' past preferences, overlooking the inherent characteristics of the book's content and design. To address this gap, a novel algorithm, leveraging both multimodal image processing and deep learning, was designed. Features from book cover images were extracted using the VGG16 model, while textual attributes were discerned through a combination of the Word2Vec model and LSTM neural networks. The integration of the CBAM attention mechanism culminated in the creation of a modality-weighted feature fusion module, facilitating the dynamic allocation of feature weights. Furthermore, an objective function for this recommendation model was formulated, ensuring the enhancement of its performance during the training phase. This study not only presents a groundbreaking methodology to amplify the efficacy and resilience of book recommendation systems but also broadens understanding in the realm of multimodal information processing within deep learning-based recommendation platforms.

References Powered by Scopus

Automating readers' advisory to make book recommendations for K-12 readers

40Citations
N/AReaders
Get full text

Integrating image and textual information in human–robot interactions for children with autism spectrum disorder

25Citations
N/AReaders
Get full text

Can book covers help predict bestsellers using machine learning approaches?

14Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Exploiting diffusion-based structured learning for item interactions representations in multimodal recommender systems

0Citations
N/AReaders
Get full text

Leveraging Deep Learning for Personalized Book Recommendations: A Big Data Algorithm Combining Capsule Networks and Attention Mechanisms

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Li, Y., Li, X., & Zhao, Q. (2023). Multimodal Deep Learning Framework for Book Recommendations: Harnessing Image Processing with VGG16 and Textual Analysis via LSTM-Enhanced Word2Vec. Traitement Du Signal, 40(4), 1367–1376. https://doi.org/10.18280/ts.400406

Readers' Discipline

Tooltip

Engineering 1

100%

Save time finding and organizing research with Mendeley

Sign up for free