FLAME: Free-Form Language-Based Motion Synthesis & Editing

124Citations
Citations of this article
50Readers
Mendeley users who have this article in their library.

Abstract

Text-based motion generation models are drawing a surge of interest for their potential for automating the motion-making process in the game, animation, or robot industries. In this paper, we propose a diffusion-based motion synthesis and editing model named FLAME. Inspired by the recent successes in diffusion models, we integrate diffusion-based generative models into the motion domain. FLAME can generate highfdelity motions well aligned with the given text. Also, it can edit the parts of the motion, both frame-wise and joint-wise, without any fne-tuning. FLAME involves a new transformerbased architecture we devise to better handle motion data, which is found to be crucial to manage variable-length motions and well attend to free-form text. In experiments, we show that FLAME achieves state-of-the-art generation performances on three text-motion datasets: HumanML3D, BABEL, and KIT. We also demonstrate that FLAME’s editing capability can be extended to other tasks such as motion prediction or motion in-betweening, which have been previously covered by dedicated models.

Cite

CITATION STYLE

APA

Kim, J., Kim, J., & Choi, S. (2023). FLAME: Free-Form Language-Based Motion Synthesis & Editing. In Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023 (Vol. 37, pp. 8255–8263). AAAI Press. https://doi.org/10.1609/aaai.v37i7.25996

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free