{"id":13549,"date":"2023-05-18T17:25:16","date_gmt":"2023-05-18T17:25:16","guid":{"rendered":"http:\/\/scannn.com\/reimagining-our-infrastructure-for-the-ai-age\/"},"modified":"2023-05-18T17:25:16","modified_gmt":"2023-05-18T17:25:16","slug":"reimagining-our-infrastructure-for-the-ai-age","status":"publish","type":"post","link":"https:\/\/scannn.com\/lv\/reimagining-our-infrastructure-for-the-ai-age\/","title":{"rendered":"Reimagining Our Infrastructure for the AI Age"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p><span style=\"font-weight: 400\">Our artificial intelligence (AI) compute needs will grow dramatically over the next decade as we break new ground in AI research, ship more cutting-edge AI applications and experiences across our family of apps, and build our long-term vision of the metaverse.<\/span><\/p>\n<p><span style=\"font-weight: 400\">We are executing on an ambitious plan to build the next generation of Meta\u2019s AI infrastructure and today, we\u2019re sharing some details on our progress.<\/span><\/p>\n<p><span style=\"font-weight: 400\">This includes our first custom silicon chip for running AI models, a new AI-optimized data center design and the second phase of our 16,000 GPU supercomputer for AI research. These efforts \u2014 and additional projects still underway \u2014 will enable us to develop larger, more sophisticated AI models and then deploy them efficiently at scale. AI is already at the core of our products, enabling <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/facebook-feed-improvements-ai-show-more-less\/\"><span style=\"font-weight: 400\">better personalization<\/span><\/a><span style=\"font-weight: 400\">, <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/responsible-ai-progress-meta-2022\/\"><span style=\"font-weight: 400\">safer and fairer products<\/span><\/a><span style=\"font-weight: 400\">, and <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/facebook-feed-improvements-ai-show-more-less\/\"><span style=\"font-weight: 400\">richer experiences<\/span><\/a><span style=\"font-weight: 400\"> while also helping businesses reach the audiences they care about most.<\/span><\/p>\n<p><span style=\"font-weight: 400\">We\u2019re even reimagining how we code by deploying CodeCompose, a generative AI-based coding assistant we developed to make our developers more productive throughout the software development lifecycle.<\/span><\/p>\n<p><span style=\"font-weight: 400\">By rethinking how we innovate across our infrastructure, we\u2019re creating a scalable foundation to power emerging opportunities in areas like <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/generative-ai-text-to-video\/\"><span style=\"font-weight: 400\">generative AI<\/span><\/a><span style=\"font-weight: 400\"> and the metaverse.<\/span><\/p>\n<h2><span style=\"font-weight: 400\">AI at the Heart of Our Infrastructure<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Since breaking ground on our first data center back in 2010, we\u2019ve built a global infrastructure that currently serves as the engine for the more than three billion people who use our family of apps every day. AI has been an important part of these systems for many years, from our <\/span><a href=\"https:\/\/engineering.fb.com\/2022\/10\/18\/open-source\/ocp-summit-2022-grand-teton\/\"><span style=\"font-weight: 400\">Big Sur<\/span><\/a><span style=\"font-weight: 400\"> hardware in 2015 to the development of <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/pytorch-builds-the-future-of-ai-and-machine-learning-at-facebook\/\"><span style=\"font-weight: 400\">PyTorch<\/span><\/a><span style=\"font-weight: 400\"> to and our <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/ai-rsc\/\"><span style=\"font-weight: 400\">supercomputer for AI research<\/span><\/a><span style=\"font-weight: 400\">.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Now, we\u2019re advancing our infrastructure in exciting new ways:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400\"><b>MTIA (Meta Training and Inference Accelerator)<\/b><span style=\"font-weight: 400\">: This is our in-house, custom accelerator chip family targeting inference workloads. <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/meta-training-inference-accelerator-AI-MTIA\"><span style=\"font-weight: 400\">MTIA<\/span><\/a><span style=\"font-weight: 400\"> provides greater compute power and efficiency than CPUs, and it is customized for our internal workloads. By deploying both MTIA chips and GPUs, we\u2019ll deliver better performance, decreased latency, and greater efficiency for each workload.<\/span><\/li>\n<li style=\"font-weight: 400\"><b>Next-Gen Data Center<\/b><span style=\"font-weight: 400\">: Our next-generation data center design will support our current products while enabling future generations of AI hardware for both training and inference. This new data center will be an AI-optimized design, supporting liquid-cooled AI hardware and a high-performance<\/span> <span style=\"font-weight: 400\">AI network connecting thousands of AI chips together for data center-scale AI training clusters. It will also be faster and more cost-effective to build, and it will complement other new hardware such as our first in-house-developed ASIC solution, <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/meta-scalable-video-processor-MSVP\"><span style=\"font-weight: 400\">MSVP,<\/span><\/a><span style=\"font-weight: 400\"> which is designed to power the constantly growing video workloads at Meta.<\/span><\/li>\n<li style=\"font-weight: 400\"><b>Research SuperCluster (RSC) AI Supercomputer:<\/b><span style=\"font-weight: 400\"> Meta\u2019s <\/span><a href=\"https:\/\/ai.facebook.com\/blog\/supercomputer-meta-research-supercluster-2023\"><span style=\"font-weight: 400\">RSC<\/span><\/a><span style=\"font-weight: 400\">, which we believe is one of the fastest AI supercomputers in the world, was built to train the next generation of large AI models to power new augmented reality tools, content understanding systems, real-time translation technology and more. It features 16,000 GPUs, all accessible across the 3-level Clos network fabric that provides full bandwidth to each of the 2,000 training systems.<\/span><\/li>\n<\/ul>\n<h2><span style=\"font-weight: 400\">The Benefits of an End-to-End Integrated Stack<\/span><\/h2>\n<p><span style=\"font-weight: 400\">Custom-designing much of our infrastructure enables us to optimize an end-to-end experience from the physical layer to the virtual layer to the software layer to the actual user experience.<\/span><\/p>\n<p><span style=\"font-weight: 400\">We design, build and operate everything \u2014 from the data centers to the server hardware to the mechanical systems that keep everything running. Because we control the stack from top to bottom, we\u2019re able to customize it for our specific needs. For example, we can easily collocate GPUs, CPUs, network and storage if it will better support our workloads. If that means we need different power or cooling solutions as a result, we can rethink those designs as part of one cohesive system.<\/span><\/p>\n<p><span style=\"font-weight: 400\">This will be increasingly important in the years ahead. Over the next decade, we\u2019ll see increased specialization and customization in chip design, purpose-built and workload-specific AI infrastructure, new systems and tooling for deployment at scale, and improved efficiency in product and design support. All of this will deliver increasingly sophisticated models built on the latest research \u2014 and products that give people around the world access to this emerging technology.<\/span><\/p>\n<p><span style=\"font-weight: 400\">We\u2019re always focused on delivering long-term value and impact to guide our infrastructure vision. We believe our track record of building world-class infrastructure positions us to continue leading in AI over the next decade and beyond.<\/span><\/p>\n<p><span style=\"font-weight: 400\">Learn more about <\/span><a href=\"https:\/\/atscaleconference.com\/events\/meta-ai-infra-scale\/\"><span style=\"font-weight: 400\">our AI investments<\/span><\/a><span style=\"font-weight: 400\">.<\/span><\/p>\n<\/p><\/div>\n<p><script async defer crossorigin=\"anonymous\" src=\"https:\/\/connect.facebook.net\/en_US\/sdk.js#xfbml=1&#038;version=v5.0\"><\/script><br \/>\n<br \/><br \/>\n<br \/><a href=\"https:\/\/about.fb.com\/news\/2023\/05\/metas-infrastructure-for-ai\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Our artificial intelligence (AI) compute needs will grow dramatically over the next decade as we break new ground in AI research, ship more cutting-edge AI applications and experiences across our family of apps, and build our long-term vision of the metaverse. We are executing on an ambitious plan to build the next generation of Meta\u2019s [&hellip;]<\/p>\n","protected":false},"author":16,"featured_media":13550,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[123],"tags":[],"class_list":["post-13549","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-facebook"],"_links":{"self":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/13549","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/users\/16"}],"replies":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/comments?post=13549"}],"version-history":[{"count":0,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/13549\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media\/13550"}],"wp:attachment":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media?parent=13549"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/categories?post=13549"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/tags?post=13549"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}