• Skip to main content

BMO LAB

Creative Lab for the Arts, Performance, Emerging Technologies and AI

  • Home
  • About
    • A. I., Performance, and the Arts at BMO Lab
  • Highlights
  • Courses
    • Winter 2023
    • Winter 2022
    • Winter 2021
    • Fall 2019
    • 2018
    • Courses Summary
  • Research
  • Events & News
    • Can Stage BMO Lab Residency
    • Diagonal – Speaker Series and Reading Group
    • Radium AI and Art Residence Program: Discovering New Sites of Creativity
    • Events and News Summary
  • Lab
  • People
  • Info
  • Supporters & Partners

Mar 03 2023

Voice Scroll: Real-time Voice to Scrolling Image for Performance

Real-time Generation of Panoramic scenes from Voice using a custom Stable Diffusion pipeline (2023)

We have extended our exploration of text to scrolling image to work with the latest version of Stable Diffusion and adapting it to our particular performance-based projects. Here is a sneak peak of a very casual interaction with the most recent version. This is the direct output responding in real-time to the spoken words.. no editing and no cherry-picking.




In July, we worked with the most recent version of this system with American poet Nick Flynn. You can see some of the preliminary results here.

Written by David Rokeby · Categorized: Blog, Highlights

Reader Interactions

Comments

  1. Harry Hart says

    December 26, 2023 at 12:12 pm

    Is this technology available for use or under license perhaps? Very interested in learning more.

    Reply

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

copyright - BMO Lab for Creative Research in the Arts, Performance, Emerging Technologies and AI