Logo
    Search

    #20 - Making Music and Art Through Machine Learning - Doug Eck of Magenta

    en-usJuly 21, 2017

    Podcast Summary

    • Exploring new possibilities in machine learning with MagentaMagenta, a Google project, helps artists push boundaries in machine learning by providing innovative tools and models. Embrace the 'sound of failure' and unique challenges for new mediums' signature.

      Magenta, a Google project led by research scientist Doug Eck, aims to create open source tools and models that help artists and creatives explore new possibilities in machine learning. Eck emphasizes the importance of embracing the "sound of failure" and pushing the boundaries of new mediums, as seen in the evolution of art and technology throughout history. Magenta's current projects include NSYTH, which generates new sounds by manipulating a latent space, and ongoing work on improving music sequence generation models. The ultimate goal is to provide artists with innovative ways to create, explore, and experiment with machine learning. The quote "Whatever you now find weird, ugly, uncomfortable, and nasty about a new medium will surely become its signature" highlights the importance of embracing the unique challenges and limitations of new mediums, rather than trying to avoid them.

    • Creating expressive music with AI: Challenges and solutionsMagenta team focuses on creating high-quality music with AI, faces challenges in evaluating model quality, explores user feedback via web app, and aims to improve user-friendliness and integration with music software.

      The team behind Magenta is dedicated to creating high-quality, expressive music generated by AI, focusing on aspects like expressive timing, dynamics, and polyphony. However, they face the challenge of evaluating the quality of their models and determining what constitutes "good" music. The team is exploring the idea of creating a web app for users to interact with and provide feedback on the generated music, which could help improve the models. They also acknowledge the need to make their tools more user-friendly and integrate with existing music production software. Despite the challenges, the team is committed to making significant contributions to machine learning by learning from human feedback and improving their models accordingly. They are currently working on solving obvious problems, such as real-time IO and integration with software like Ableton, and aim to provide musicians with useful and fluid tools for generating and playing with sound.

    • AI and Creativity: Intersection of Art and Music with New DiscoveriesAI is revolutionizing creative fields like music and art with tools like Sketch RNN and Google's NSynth, enabling new discoveries, unusual classes, and experimentation.

      The intersection of artificial intelligence and creative fields like music and art is leading to intriguing discoveries and innovations. For instance, Sketch RNN, a recurrent neural network trained on sketches, has enabled artists to generate new drawings, explore unusual classes, and even use the raw data for experimentation. Although the quick draw data has limitations due to the short timeframe for creation, it has still sparked interest among artists. On the music side, Google's NSynth project has generated new and unique sounds by sampling the space between different instruments. These sounds, which capture harmonic properties and essence of traditional instruments, have fascinated musicians due to their novelty and musicality. Even when the model fails to perfectly reproduce sounds, the resulting confusion is still driven by musical elements. The collaboration between AI and creative professionals has led to unexpected discoveries and a renewed appreciation for the unique properties of these technologies. While there are limitations to what can be achieved, the potential for innovation and artistic expression is vast.

    • Exploring AI-generated music with RNN modelsMusicians found interacting with simple RNN models for music generation more enjoyable and effective through call and response approach, bringing their musical talent to the table.

      Even primitive AI models can inspire creativity and engagement when used by musically skilled individuals. The discussion revolved around the use of simple RNN models for music generation, such as the AI duet web application. Instead of expecting the model to carry long arcs of melody, musicians discovered that interacting with the model by following its lead and improvising was more enjoyable and effective. This call and response approach challenged musicians to understand the model's behavior and adapt to it, bringing their musical talent to the table. As the models improve with advanced techniques like generative adversarial feedback, they are expected to become more collaborative tools for generating new musical ideas. Despite the current limitations, the exploration of AI-generated music has been a fascinating and fun experience for both the creators and the users.

    • The persistence and dedication of pioneers led to breakthroughs in LSTM researchLSTM, a recurrent neural network, saw significant advancements due to the persistence of researchers like Alex Graves, faster machines, and larger memory, enabling it to absorb and process larger datasets, and remains a cornerstone for recurrent models in time series analysis and forms the basis for translation models.

      The Long Short-Term Memory (LSTM) model, which is a type of recurrent neural network, has seen significant advancements since its inception in the early 2000s. Alex Graves, one of the pioneers who worked on LSTM during its infancy, persisted in his research despite initial challenges and eventually led to breakthroughs in speech and language applications. The model's effectiveness was largely due to the availability of faster machines and larger memory, enabling it to absorb and process larger datasets. Although LSTM has evolved and other models have emerged, it remains a cornerstone for recurrent models in time series analysis and forms the basis for translation models. The success of LSTM can be attributed to the persistence and dedication of a few pioneers who pushed the boundaries of what was possible with the technology of the time. Despite its limitations in handling long time scale hierarchical patterning, researchers continue to explore its potential in areas such as text generation and music creation. The history of LSTM serves as a reminder of the importance of perseverance in research and the role that technology advancements play in unlocking the full potential of innovative models.

    • Exploring AI's role in creating art and literatureAI models can generate patterns and structures, but not yet fully replicate human creations. They learn to identify important elements and generate caricatures, becoming part of the creative toolkit.

      We're witnessing the emergence of AI models that can generate patterns and structures, much like musical chords or the principal axes of variance in art, but they're not yet capable of fully replicating the depth and complexity of human creations. These models learn to identify the most important elements and generate caricatures rather than perfect samples. As Magenta progresses, AI and machine learning are expected to become part of the creative toolkit, allowing us to generate plots, jokes, and art that might be difficult for humans to create but still resonate with us. However, we should remember that the future of AI is uncertain, and our role is to build intelligent tools while acknowledging the limitations of our understanding. Some might find this answer boring, but the potential for AI in art and literature is vast and exciting. For instance, generative models could help us create intricate plots or generate jokes with surprising turns that fit in many different ways, adding a new dimension to our appreciation of these forms of art. Ultimately, the future of AI in art is a fascinating and open-ended question, and we should approach it with a sense of curiosity and humility, recognizing that we're just beginning to scratch the surface of what's possible.

    • AI in Creative Field: Tool or Replacement?AI should be seen as a tool for enhancing human creativity rather than a replacement, as the true value lies in the human interaction and experimentation with the technology.

      The development of AI in the creative field, such as music or art, raises important questions about the role of human creativity and the value of the creative process itself. While some may fear that AI could replace human artists, the consensus seems to be that it should be seen as a tool rather than a replacement. The true value lies not in the finished product generated by AI alone, but in the human interaction and experimentation with the technology. This was highlighted in the discussion about the use of drum machines, where the creativity comes from the user's ability to manipulate and work with the machine, not just pushing a button for a predetermined output. Additionally, the audience for such AI-generated art may come from the next generation of individuals who have grown up with technology and are open to new forms of expression. The cathartic feeling of creating music or art is an essential aspect of the process, and AI tools should be designed to facilitate and enhance this experience rather than replacing it.

    • Exploring AI for Art and Music Creation with MagentaMagenta, an open-source AI project, lacks creative freedom and interaction. The community desires a more open and hackable codebase, and is interested in reinforcement learning for more flexible and interesting results. Magenta is exploring the use of reinforcement learning as a critic for better AI-generated art and music.

      Magenta, an open-source machine learning project by Google, aims to enable creators to use AI for generating art and music. However, the current state of Magenta lacks the level of creative freedom and interaction that artists and coders crave. The speaker expressed a desire for a more open and hackable codebase, allowing for more creative exploration and innovation. The codebase's current state, while well-written and well-tested, is perceived as too brittle, making it difficult to both creatively and unintentionally break it. The creative coding community is interested in various directions, such as preserving digital art and exploring reinforcement learning. Reinforcement learning is particularly intriguing because it allows models to learn by receiving rewards for following certain rules or heuristics, rather than building the rules into the model itself. This approach, while slower to train, offers more flexibility and can lead to better and more interesting results. The speaker highlighted the importance of having a critic, like in Generative Adversarial Networks (GANs), which push the model out of its safe zone and help it generate better output. Magenta is exploring the use of reinforcement learning as a critic to provide rewards for following certain rules or heuristics, which could lead to more creative and effective AI-generated art and music.

    • Fine-tuning machine learning models for creative outputsMachine learning models can be adjusted with rewards or constraints to generate unique, creative outputs while staying grounded in real data.

      Machine learning models can be fine-tuned to generate creative outputs based on specific rewards or constraints, without being limited by traditional rules or guidelines. This process involves feeding the model with a sample from its output, evaluating it based on the desired reward, and adjusting the model to generate better results. For instance, a generative model for drawing could be trained to avoid straight lines by incorporating a reinforcement learning model that dislikes straight lines. Similarly, a music generator could be influenced to produce "shimmery" music by adding a reward function for that characteristic. The model remains grounded in the realness that comes from being trained on data, but the addition of reward functions allows for more flexibility and creativity. However, there are concerns about the potential impact of this technology on art and music, particularly with regards to the creation of the "perfect pop song." While some may appreciate the rawness and variety of non-predictable art, others will undoubtedly enjoy the polished and easily accessible outputs generated by machine learning. As technology continues to advance, it may become easier to produce high-quality creative outputs, but people may also seek out new challenges and complexities to explore. Overall, the use of machine learning and AI in creative fields offers exciting possibilities for innovation and new forms of expression.

    • Exploring complex rhythms with AI in music compositionAI technology can provide a solid rhythmic foundation for music, enabling artists to focus on expressive timing and musical texture.

      The use of advanced AI technology in music composition can provide a solid rhythmic foundation, allowing artists to explore more complex rhythmical elements in their work. However, the quest for creating the perfect pop song using AI is uncertain, and the future direction lies in developing models capable of handling longer, more intricate structures to enable composers to offload certain decisions and focus on expressive timing and musical texture. The speaker, himself, has been using music as a form of relaxation and experimenting with creative coding, but finds it challenging to fully commit to a project due to the demands of his work on the Magenta project. The potential impact of these advancements on the music industry is vast, offering new possibilities for both creators and audiences alike.

    • Magenta: Making Machine Learning Accessible for Creatives and MusiciansGoogle's Magenta project aims to create an API for real-time conversation between AI models and musicians using MIDI files, focusing on user-friendliness and expressiveness

      Magenta, an open-source project by Google, aims to make machine learning accessible for creatives and musicians. The team is working on creating an API that can facilitate real-time conversation between AI models and musicians using MIDI files. They're focusing on making the process more user-friendly and expressive, which is a challenging aspect of APIs. The core API for moving music around in real time and having a meaningful conversation between AI and musicians is already in place. However, there's more work to be done to get it right. If you're interested in creative coding or learning more about Magenta, the team encourages you to visit their website at magenta.tensorflow.org. They welcome feedback and have an active community where discussions take place. The project is ongoing, with research and community-building being the main priorities. To get started with Magenta, you can check out their website and explore the open issues and code. Remember, it might take some effort to make it work, but the team is there to support you. Stay tuned for updates and join the community to be a part of this exciting project.

    Recent Episodes from Y Combinator

    Consumer is back, What’s getting funded now, Immaculate vibes | Lightcone Podcast

    Consumer is back, What’s getting funded now, Immaculate vibes | Lightcone Podcast

    What's happening in startups right now and how can you get ahead of the curve? In this episode of the Lightcone podcast, we dive deep into the major trends we're seeing from the most recent batch of YC using data we've never shared publicly before. This is a glimpse into what might be the most exciting moment to be a startup founder ever. It's time to build. YC is accepting late applications for the Summer 24 batch: ycombinator.com/apply

    When Should You Trust Your Gut? | Dalton & Michael Podcast

    When Should You Trust Your Gut? | Dalton & Michael Podcast

    When you’re making important decisions as a founder — like what to build or how it should work — should you spend lots of time gathering input from others or just trust your gut? In this episode of Dalton & Michael, we talk more about this and how to know when you should spend time validating and when to just commit. Apply to Y Combinator: https://yc.link/DandM-apply Work at a Startup: https://yc.link/DandM-jobs

    Inside The Hard Tech Startups Turning Sci-Fi Into Reality | Lightcone Podcast

    Inside The Hard Tech Startups Turning Sci-Fi Into Reality | Lightcone Podcast

    YC has become a surprising force in the hard tech world, funding startups building physical products from satellites to rockets to electric planes. In this episode of Lightcone, we go behind the scenes to explore how YC advises founders on their ambitious startups. We also take a look at a number of YC's hard tech companies and how they got started with little time or money.

    Building AI Models Faster And Cheaper Than You Think | Lightcone Podcast

    Building AI Models Faster And Cheaper Than You Think | Lightcone Podcast

    If you read articles about companies like OpenAI and Anthropic training foundation models, it would be natural to assume that if you don’t have a billion dollars or the resources of a large company, you can’t train your own foundational models. But the opposite is true. In this episode of the Lightcone Podcast, we discuss the strategies to build a foundational model from scratch in less than 3 months with examples of YC companies doing just that. We also get an exclusive look at Open AI's Sora!

    Building Confidence In Yourself and Your Ideas | Dalton & Michael Podcast

    Building Confidence In Yourself and Your Ideas | Dalton & Michael Podcast

    One trait that many great founders share is conviction. In this episode of Dalton & Michael, we’ll talk about finding confidence in what you're building, the dangers of inaccurate assumptions, and a question founders need to ask themselves before they start trying to sell to anyone else. Apply to Y Combinator: https://yc.link/DandM-apply Work at a Startup: https://yc.link/DandM-jobs

    Stop Innovating (On The Wrong Things) | Dalton & Michael Podcast

    Stop Innovating (On The Wrong Things) | Dalton & Michael Podcast

    Startups need to innovate to succeed. But not all innovation is made equal and reinventing some common best practices could actually hinder your company. In this episode, Dalton Caldwell and Michael Seibel discuss the common innovation pitfalls founders should avoid so they can better focus on their product and their customers. Apply to Y Combinator: https://yc.link/DandM-apply Work at a Startup: https://yc.link/DandM-jobs

    Should Your Startup Bootstrap or Raise Venture Capital?

    Should Your Startup Bootstrap or Raise Venture Capital?

    Within the world of startups, you'll find lots of discourse online about the experiences of founders bootstrapping their startups versus the founders who have raised venture capital to fund their companies. Is one better than the other? Truth is, it may not be so black and white. Dalton Caldwell and Michael Seibel discuss the virtues and struggles of both paths. Apply to Y Combinator: https://yc.link/DandM-apply Work at a Startup: https://yc.link/DandM-jobs

    Related Episodes

    AI ate my IP!

    AI ate my IP!
    In this episode we speak to two AI visionaries to get the lowdown on AI, IP and creativity. Are the new tools friend or foe to our most human capabilities – the capability to create? And what does generative AI mean for the laws around intellectual property? Our guests: - Arun Sundararajan, NYU professor of Entrepreneurship and Technology & author of "The Sharing Economy: The End of Employment and the Rise of Crowd-Based Capitalism"​ - Ric Scheuss, Managing Director at TRO

    What Makes Amper’s AI Music Tools Different with CEO Drew Silverstein

    What Makes Amper’s AI Music Tools Different with CEO Drew Silverstein

    Amper’s AI-powered music composition tool, Score, helps content creators make music tailored to their projects, whether they’re seasoned musicians or novices. Amper Music CEO Drew Silverstein places Amper and AI music in context of a long history of algorithmic music, and an even longer history of increasing accessibility to the means of music creation. How will AI music composition technology transform the jobs—and careers—of musicmakers? What makes Amper’s sample library different from all the rest? How does Amper address the thorny legal issues around rights and splits when humans and AI collaborate? And if Score sounds like a powerful tool, wait until you hear about what Amper’s API can do! 

    Note that this episode was recorded in early March, before many music industry events (including Music Tectonics Meetups) were moved online, postponed, or cancelled.

    The Music Tectonics podcast goes beneath the surface of the music industry to explore how technology is changing the way business gets done. Visit MusicTectonics.com to learn more, and find us on Twitter, Facebook, and Instagram. Let us know what you think!

    Don’t Call It A Comeback! (Mobiuscast 4)

    Don’t Call It A Comeback! (Mobiuscast 4)
    Mobiuscast is an audio podcast where the collaborators of The Mobius Project, Dan & Jeremiah, discuss a variety of creativity-related topics.

    In this episode, Dan and Jeremiah reunite in podcast form after more than a decade to catch up on their creative journeys and dig into their current sci-fi narrative fiction story in development.

    Human Meme Special Edition: Google Gemini Music

    Human Meme Special Edition: Google Gemini Music
    Today, we are celebrating the release of Google's new AI Bot: Gemini. Our test of Gemini today was musical. We asked Gemini to create the lyrics for two songs for us this morning. What you will hear is based on a single prompt. No revision. No human interference. This is AI, at it's best, at work. The first song is a little Sugar Pop ditty about this Human Meme podcast. The second song, is a Swampy Blues tune about the same topic -- this Human Meme podcast! The lyric results were so stunning -- and leaps and tops above what Google Bard could ever create -- that we decided to offer this off-schedule edition of the Human Meme podcast for you right now!