{"id":20213,"date":"2025-04-09T13:14:06","date_gmt":"2025-04-09T13:14:06","guid":{"rendered":"https:\/\/scannn.com\/the-first-google-tpu-for-the-age-of-inference\/"},"modified":"2025-04-09T13:14:06","modified_gmt":"2025-04-09T13:14:06","slug":"the-first-google-tpu-for-the-age-of-inference","status":"publish","type":"post","link":"https:\/\/scannn.com\/lv\/the-first-google-tpu-for-the-age-of-inference\/","title":{"rendered":"The first Google TPU for the age of inference"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p data-block-key=\"jyf9e\">Today at Google Cloud Next 25, we\u2019re introducing Ironwood, our seventh-generation Tensor Processing Unit (TPU) \u2014 our most performant and scalable custom AI accelerator to date, and the first designed specifically for inference. For more than a decade, TPUs have powered Google\u2019s most demanding AI training and serving workloads, and have enabled our Cloud customers to do the same. Ironwood is our most powerful, capable and energy efficient TPU yet. And it&#8217;s purpose-built to power thinking, inferential AI models at scale.<\/p>\n<p data-block-key=\"9rrl9\">Ironwood represents a significant shift in the development of AI and the infrastructure that powers its progress. It\u2019s a move from <i>responsive<\/i> AI models that provide real-time information for people to interpret, to models that provide the <i>proactive<\/i> generation of insights and interpretation. This is what we call the \u201cage of inference\u201d where AI agents will proactively retrieve and generate data to collaboratively deliver insights and answers, not just data.<\/p>\n<p data-block-key=\"7kvmh\">Ironwood is built to support this next phase of generative AI and its tremendous computational and communication requirements. It scales up to 9,216 liquid cooled chips linked with breakthrough Inter-Chip Interconnect (ICI) networking spanning nearly 10 MW. It is one of several new components of Google Cloud AI Hypercomputer architecture, which optimizes hardware and software together for the most demanding AI workloads. With Ironwood, developers can also leverage Google\u2019s own Pathways software stack to reliably and easily harness the combined computing power of tens of thousands of Ironwood TPUs.<\/p>\n<p data-block-key=\"at188\">Here\u2019s a closer look at how these innovations work together to take on the most demanding training and serving workloads with unparalleled performance, cost and power efficiency.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/blog.google\/products\/google-cloud\/ironwood-tpu-age-of-inference\/\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Today at Google Cloud Next 25, we\u2019re introducing Ironwood, our seventh-generation Tensor Processing Unit (TPU) \u2014 our most performant and scalable custom AI accelerator to date, and the first designed specifically for inference. For more than a decade, TPUs have powered Google\u2019s most demanding AI training and serving workloads, and have enabled our Cloud customers [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":20214,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[100],"tags":[],"class_list":["post-20213","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-google"],"_links":{"self":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/20213","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/comments?post=20213"}],"version-history":[{"count":0,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/posts\/20213\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media\/20214"}],"wp:attachment":[{"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/media?parent=20213"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/categories?post=20213"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/scannn.com\/lv\/wp-json\/wp\/v2\/tags?post=20213"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}