{"id":13833,"date":"2023-08-02T14:24:50","date_gmt":"2023-08-02T14:24:50","guid":{"rendered":"http:\/\/scannn.com\/introducing-audiocraft-a-generative-ai-tool-for-audio-and-music\/"},"modified":"2023-08-02T14:24:50","modified_gmt":"2023-08-02T14:24:50","slug":"introducing-audiocraft-a-generative-ai-tool-for-audio-and-music","status":"publish","type":"post","link":"https:\/\/scannn.com\/lv\/introducing-audiocraft-a-generative-ai-tool-for-audio-and-music\/","title":{"rendered":"Introducing AudioCraft: A Generative AI Tool For Audio and Music"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p><span style=\"font-weight: 400;\">Imagine a professional musician being able to explore new compositions without having to play a single note on an instrument. Or a small business owner adding a soundtrack to their latest video ad on Instagram with ease. That\u2019s the promise of AudioCraft \u2014\u00a0our latest AI tool that generates high-quality, realistic audio and music from text.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">AudioCraft consists of three models: <\/span><a href=\"https:\/\/huggingface.co\/spaces\/facebook\/MusicGen\"><span style=\"font-weight: 400;\">MusicGen<\/span><\/a><span style=\"font-weight: 400;\">, <\/span><a href=\"https:\/\/felixkreuk.github.io\/audiogen\/\"><span style=\"font-weight: 400;\">AudioGen<\/span><\/a><span style=\"font-weight: 400;\"> and <\/span><a href=\"https:\/\/ai.meta.com\/blog\/ai-powered-audio-compression-technique\/\"><span style=\"font-weight: 400;\">EnCodec<\/span><\/a><span style=\"font-weight: 400;\">. MusicGen, which was trained with Meta-owned and specifically licensed music, generates music from text prompts, while AudioGen, which was trained on public sound effects, generates audio from text prompts. Today, we\u2019re excited to release an improved version of our EnCodec decoder, which allows higher quality music generation with fewer artifacts. We\u2019re also releasing our pre-trained AudioGen models, which let you generate environmental sounds and sound effects like a dog barking, cars honking, or footsteps on a wooden floor. And lastly, we\u2019re sharing all of the AudioCraft model weights and code.\u00a0<\/span><\/p>\n<p><a href=\"https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836\"><img fetchpriority=\"high\" decoding=\"async\" class=\"alignnone wp-image-39059 size-full\" src=\"https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836\" alt=\"Flow chart demonstrating how MusicGen and AudioGen work\" width=\"960\" height=\"836\" srcset=\"https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=2881 2881w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=300 300w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=768 768w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=1024 1024w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=1536 1536w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=2048 2048w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=1241 1241w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=689 689w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/08\/01_MG_AG.jpg?resize=960%2C836?w=1920 1920w\" sizes=\"(max-width: 960px) 100vw, 960px\" data-recalc-dims=\"1\"\/><\/a><\/p>\n<p><span style=\"font-weight: 400;\">We\u2019re open-sourcing these models, giving researchers and practitioners access so they can train their own models with their own datasets for the first time, and help advance the field of AI-generated audio and music.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">While we\u2019ve seen a lot of excitement around generative AI for images, video, and text, audio has seemed to lag a bit behind. There\u2019s some work out there, but it\u2019s highly complicated and not very open, so people aren\u2019t able to readily play with it. Generating high-fidelity audio of any kind requires modeling complex signals and patterns at varying scales. Music is arguably the most challenging type of audio to generate as it\u2019s composed of local and long-range patterns, from a suite of notes to a global musical structure with multiple instruments.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">The AudioCraft family of models are capable of producing high-quality audio with long-term consistency, and they\u2019re easy to use. With AudioCraft, we simplify the overall design of generative models for audio compared to prior work in the field \u2014 giving people the full recipe to play with the existing models that Meta has been developing over the past several years while also empowering them to push the limits and develop their own models.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">AudioCraft works for music, sound, compression, and generation \u2014 all in the same place. Because it\u2019s easy to build on and reuse, people who want to build better sound generators, compression algorithms, or music generators can do it all in the same code base and build on top of what others have done.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Having a solid open source foundation will foster innovation and complement the way we produce and listen to audio and music in the future. With even more controls, we think MusicGen can turn into a new type of instrument \u2014 just like synthesizers when they first appeared.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">We see the AudioCraft family of models as tools for musicians and sound designers to provide inspiration, help people quickly brainstorm and iterate on their compositions in new ways. We can\u2019t wait to see what people create with Audiocraft.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Learn more about <a href=\"https:\/\/ai.meta.com\/blog\/audiocraft-musicgen-audiogen-encodec-generative-ai-audio\/\">AudioCraft on our AI blog<\/a>.\u00a0<\/span><\/p>\n<\/p><\/div>\n<p><script async defer crossorigin=\"anonymous\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js#xfbml=1&#038;version=v5.0\"><\/script><br \/>\n<br \/><br \/>\n<br \/><a href=\"https:\/\/about.fb.com\/news\/2023\/08\/audiocraft-generative-ai-for-music-and-audio\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Imagine a professional musician being able to explore new compositions without having to play a single note on an instrument. Or a small business owner adding a soundtrack to their latest video ad on Instagram with ease. That\u2019s the promise of AudioCraft \u2014\u00a0our latest AI tool that generates high-quality, realistic audio and music from text. [&hellip;]<\/p>\n","protected":false},"author":16,"featured_media":13834,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[123],"tags":[],"class_list":["post-13833","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-facebook"],"_links":{"self":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/13833","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/comments?post=13833"}],"version-history":[{"count":0,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/13833\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media\/13834"}],"wp:attachment":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media?parent=13833"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/categories?post=13833"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/tags?post=13833"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}