Enhancing Transformer-based Cooking Recipe Generation Models from Text Ingredients

2Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

Recipe generation is an important task in both research and real life. In this study, we explore several pretrained language models that generate recipes from a list of text-based ingredients. Our recipe-generation models use a standard self-attention mechanism in Transformer and integrate a re-attention mechanism in Vision Transformer. The models were trained using a common paradigm based on cross-entropy loss and the BRIO paradigm combining contrastive and cross-entropy losses to achieve the best performance faster and eliminate exposure bias. Specifically, we utilize a generation model to produce N recipe candidates from ingredients. These initial candidates are used to train a BRIO-based recipe-generation model to produce N new candidates, which are used for iteratively fine-tuning the model to enhance the recipe quality. We experimentally evaluated our models using the RecipeNLG and CookingVN-recipe datasets in English and Vietnamese, respectively. Our best model, which leverages BART with re-attention and is trained using BRIO, outperforms the existing models.

Cite

CITATION STYLE

APA

Lam, K. N., Nguyen, M. K. T., Nguyen, H. T., Huynh, V. T., Le, V. L., & Kalita, J. (2024). Enhancing Transformer-based Cooking Recipe Generation Models from Text Ingredients. Journal of Information and Communication Convergence Engineering, 22(4), 288–295. https://doi.org/10.56977/jicce.2024.22.4.288

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free