Voice Scroll: Real-time Voice to Scrolling Image for Performance

Real-time Generation of Panoramic scenes from Voice using a custom Stable Diffusion pipeline (2023)

We have extended our exploration of text to scrolling image to work with the latest version of Stable Diffusion and adapting it to our particular performance-based projects. Here is a sneak peak of a very casual interaction with the most recent version. This is the direct output responding in real-time to the spoken words.. no editing and no cherry-picking.

In July, we worked with the most recent version of this system with American poet Nick Flynn. You can see some of the preliminary results here.

Real-time Generation of Panoramic scenes from Voice using a custom Stable Diffusion pipeline (2023)

Reader Interactions

Comments

Leave a Reply Cancel reply