Taming the chaos: Exploring graphical input vector manipulation user interfaces for GANs in a musical context

5Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Generative Adversarial Networks (GANs) are a widely used tool for generating highly realistic artificial data. As the output of these networks can show high diversity and novelty, GANs have the potential to be used as creative tools. However, using GANs in this context poses major challenges due to their unpredictability and lack of controllability, making it difficult for creative people to realize their artistic vision. To address this problem, we present two graphical user interfaces that visually order the (otherwise chaotic) latent input space of a GAN that was trained to generate drum samples. Further, these GUIs provide convergent search functions that allow users to fine-tune generated sounds. By doing so, we provide the ability to create sounds more purposefully to sound-affine users such as musicians or sound engineers. Additionally, we present the results of a user study that we conducted in order to explore our approach in accuracy-oriented and creative tasks. Our results indicate that usability and pragmatic qualities play a more important role for users than aesthetic-oriented aspects. Although not improving the accuracy within reproductive tasks, we observed that convergent search functions, if available, were used significantly more often than divergent/randomized search functions.

References Powered by Scopus

A style-based generator architecture for generative adversarial networks

6831Citations
N/AReaders
Get full text

Construction and evaluation of a user experience questionnaire

1404Citations
N/AReaders
Get full text

Learning to generate chairs with convolutional neural networks

449Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Flow with the Beat! Human-Centered Design of Virtual Environments for Musical Creativity Support in VR

11Citations
N/AReaders
Get full text

ASMRcade: Interactive Audio Triggers for an Autonomous Sensory Meridian Response

1Citations
N/AReaders
Get full text

Intercategorical Label Interpolation for Emotional Face Generation with Conditional Generative Adversarial Networks

1Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Schlagowski, R., Mertes, S., & André, E. (2021). Taming the chaos: Exploring graphical input vector manipulation user interfaces for GANs in a musical context. In ACM International Conference Proceeding Series (pp. 216–223). Association for Computing Machinery. https://doi.org/10.1145/3478384.3478411

Readers over time

‘22‘23‘2400.751.52.253

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

50%

Professor / Associate Prof. 1

25%

Lecturer / Post doc 1

25%

Readers' Discipline

Tooltip

Computer Science 2

50%

Design 1

25%

Arts and Humanities 1

25%

Save time finding and organizing research with Mendeley

Sign up for free
0