{"id":13559,"date":"2023-05-22T16:28:43","date_gmt":"2023-05-22T16:28:43","guid":{"rendered":"http:\/\/scannn.com\/preserving-the-worlds-language-diversity-through-ai\/"},"modified":"2023-05-22T16:28:43","modified_gmt":"2023-05-22T16:28:43","slug":"preserving-the-worlds-language-diversity-through-ai","status":"publish","type":"post","link":"https:\/\/scannn.com\/lv\/preserving-the-worlds-language-diversity-through-ai\/","title":{"rendered":"Preserving the World's Language Diversity Through AI"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<h2><span style=\"font-weight: 400\">Supporting Thousands of Languages<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Many of the world\u2019s languages are in danger of disappearing, and the limitations of current speech recognition and generation technology will only accelerate this trend. We want to make it easier for people to access information and use devices in their preferred language, and today we\u2019re announcing a series of artificial intelligence (AI) models that could help them do just that.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Massively Multilingual Speech (MMS) models expand text-to-speech and speech-to-text technology from around 100 languages to more than 1,100 \u2014 more than 10 times as many as before \u2014 and can also identify more than 4,000 spoken languages, 40 times more than before.<\/span><\/p>\n<p><a href=\"https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540\"><img fetchpriority=\"high\" decoding=\"async\" class=\"aligncenter size-full wp-image-38497\" src=\"https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540\" alt=\"\" width=\"960\" height=\"540\" srcset=\"https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=3840 3840w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=300 300w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=768 768w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=1024 1024w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=1536 1536w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=2048 2048w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=1920 1920w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=800 800w, https:\/\/about.fb.com\/wp-content\/uploads\/2023\/05\/01_Map.png?resize=960%2C540?w=2880 2880w\" sizes=\"(max-width: 960px) 100vw, 960px\" data-recalc-dims=\"1\"\/><\/a><\/p>\n<p><span style=\"font-weight: 400\">There are also many use cases for speech technology \u2014 from virtual and augmented reality technology to messaging services \u2014 that can be used in a person\u2019s preferred language and can understand everyone\u2019s voice.<\/span><\/p>\n<p><span style=\"font-weight: 400\">We\u2019re open-sourcing our models and code so that others in the research community can build on our work and help preserve the world\u2019s languages and bring the world closer together.<\/span><\/p>\n<h2><span style=\"font-weight: 400\">Our Approach<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Collecting audio data for thousands of languages was our first challenge because the largest existing speech datasets cover 100 languages at most. To overcome this, we turned to religious texts, such as the Bible, that have been translated in many different languages and whose translations have been widely studied for text-based language translation research.<\/span><\/p>\n<p><span style=\"font-weight: 400\">These translations have publicly available audio recordings of people reading these texts in different languages. As part of the MMS project, we created a dataset of readings of the New Testament in more than 1,100 languages, which provided on average 32 hours of data per language.<\/span><\/p>\n<p><span style=\"font-weight: 400\">B<\/span><span style=\"font-weight: 400\">y considering unlabeled recordings of various other Christian religious readings, <\/span><span style=\"font-weight: 400\">we increased<\/span><span style=\"font-weight: 400\"> the number of languages available to more than 4,000. While this data is from a specific domain and is often read by male speakers, our analysis shows that our models <\/span><span style=\"font-weight: 400\">perform equally well for male and female voices<\/span><span style=\"font-weight: 400\">. And while the content of the audio recordings is religious, our analysis shows that this doesn\u2019t bias the model to produce more religious language.<\/span><\/p>\n<h2><span style=\"font-weight: 400\">Going Forward<\/span><\/h2>\n<p><span style=\"font-weight: 400\">In the future, we want to increase MMS\u2019s coverage to support even more languages, and also tackle the challenge of handling dialects, which is often difficult for existing speech technology.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400\">Learn more about <a href=\"https:\/\/ai.facebook.com\/blog\/multilingual-speech-recognition-model\/\">MMS<\/a>.<\/span><\/p>\n<\/p><\/div>\n<p><script async defer crossorigin=\"anonymous\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js#xfbml=1&#038;version=v5.0\"><\/script><br \/>\n<br \/><br \/>\n<br \/><a href=\"https:\/\/about.fb.com\/news\/2023\/05\/ai-massively-multilingual-speech-technology\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Supporting Thousands of Languages Many of the world\u2019s languages are in danger of disappearing, and the limitations of current speech recognition and generation technology will only accelerate this trend. We want to make it easier for people to access information and use devices in their preferred language, and today we\u2019re announcing a series of artificial [&hellip;]<\/p>\n","protected":false},"author":16,"featured_media":13560,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[123],"tags":[],"class_list":["post-13559","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-facebook"],"_links":{"self":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/13559","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/comments?post=13559"}],"version-history":[{"count":0,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/13559\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media\/13560"}],"wp:attachment":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media?parent=13559"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/categories?post=13559"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/tags?post=13559"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}