A Mapping Strategy for Interacting with Latent Audio Synthesis Using Artistic Materials
Abstract
This paper presents a mapping strategy for interacting with the latent spaces of generative AI models. Our approach involves using unsupervised feature learning to encode a human control space and mapping it to an audio synthesis model's latent space. To demonstrate how this mapping strategy can turn high-dimensional sensor data into control mechanisms of a deep generative model, we present a proof-of-concept system that uses visual sketches to control an audio synthesis model. We draw on emerging discourses in XAIxArts to discuss how this approach can contribute to XAI in artistic and creative contexts, we also discuss its current limitations and propose future research directions.
- Publication:
-
arXiv e-prints
- Pub Date:
- July 2024
- DOI:
- 10.48550/arXiv.2407.04379
- arXiv:
- arXiv:2407.04379
- Bibcode:
- 2024arXiv240704379Z
- Keywords:
-
- Computer Science - Sound;
- Computer Science - Human-Computer Interaction;
- Electrical Engineering and Systems Science - Audio and Speech Processing