<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Silicon Co-Design: Digital AI SW Architecture]]></title><description><![CDATA[This section contains more digital and AI Software related topics]]></description><link>https://www.siliconcodesign.com/s/digital-ai-arch</link><image><url>https://substackcdn.com/image/fetch/$s_!WIKb!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a70d8c6-6b56-41f4-b513-b689b17a4d19_503x503.png</url><title>Silicon Co-Design: Digital AI SW Architecture</title><link>https://www.siliconcodesign.com/s/digital-ai-arch</link></image><generator>Substack</generator><lastBuildDate>Thu, 25 Jun 2026 00:46:26 GMT</lastBuildDate><atom:link href="https://www.siliconcodesign.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Chad Wallace]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[chadw@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[chadw@substack.com]]></itunes:email><itunes:name><![CDATA[Chad]]></itunes:name></itunes:owner><itunes:author><![CDATA[Chad]]></itunes:author><googleplay:owner><![CDATA[chadw@substack.com]]></googleplay:owner><googleplay:email><![CDATA[chadw@substack.com]]></googleplay:email><googleplay:author><![CDATA[Chad]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[From Atari to ChatGPT: The Technical and Corporate Forces Shaping Frontier AI ]]></title><description><![CDATA[How the intersection of scientific ambition, commercial urgency, and safety research birthed the modern scaling era.]]></description><link>https://www.siliconcodesign.com/p/a-brief-history-of-modern-ai-deepmind</link><guid isPermaLink="false">https://www.siliconcodesign.com/p/a-brief-history-of-modern-ai-deepmind</guid><dc:creator><![CDATA[Chad]]></dc:creator><pubDate>Wed, 29 Apr 2026 17:54:36 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/fcc0b821-27c2-4f6d-b4e3-b1750426a5f6_1201x673.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iEJ4!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iEJ4!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 424w, https://substackcdn.com/image/fetch/$s_!iEJ4!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 848w, https://substackcdn.com/image/fetch/$s_!iEJ4!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 1272w, https://substackcdn.com/image/fetch/$s_!iEJ4!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iEJ4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png" width="407" height="601.370477568741" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1021,&quot;width&quot;:691,&quot;resizeWidth&quot;:407,&quot;bytes&quot;:1396370,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iEJ4!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 424w, https://substackcdn.com/image/fetch/$s_!iEJ4!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 848w, https://substackcdn.com/image/fetch/$s_!iEJ4!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 1272w, https://substackcdn.com/image/fetch/$s_!iEJ4!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe5187456-2ee8-4525-9b42-3a609a0a72a1_691x1021.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Figure 1. </strong>&#8220;The Infinity Machine&#8221; by Sebastian Mallaby</figcaption></figure></div><p>Demis Hassabis is perhaps one of the most important influences in Modern AI that isn&#8217;t a household name.</p><p>Modern AI is not just a story of better models. It was shaped by the interaction between <strong>scientific ambition</strong>, <strong>commercial urgency</strong>, <strong>safety concerns</strong>, and the <strong>compute power</strong> available to model it. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Inside the Silicon Machine is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Every AI enthusiast should read <strong>&#8220;The Infinity Machine&#8221;</strong> to better understand the <strong>development of modern AI</strong>. It tells the story of <strong>Demis Hassabis</strong>, CEO of Google DeepMind, from his origins, to his founding of DeepMind and its acquisition by Google. </p><p>I&#8217;ll highlight a brief history of AI through the <strong>formative years</strong> of Demis Hassabis, and cover the <strong>key technical developments</strong> and <strong>organizational challenges</strong> along the path to modern AI, particularly around <strong>AI safety</strong>. </p><p>I&#8217;ll also dive into the key papers addressed in the book that influence modern AI today such as <strong>deep reinforcement learning,</strong> <strong>transformer architecture</strong>, and <strong>chain-of-thought prompting</strong> that approximate <strong>human intuition.</strong></p><h2>Demis Hassabis</h2><p>From his early days Demis Hassabis is a modern example of a modern day Polymath, with broad-ranging interests from sci-fi, philosophy, computer science, and neuroscience. </p><p>His entire career has been dedicated to developing <strong>artificial general intelligence (AGI)</strong>, a hypothetical type of AI that <strong>matches or exceeds human cognitive abilities </strong>across a wide range of tasks. His dedication to this mission has him referred to as an <strong>&#8220;authentic entrepreneur&#8221;</strong> by VCs because he is genuinely dedicated to a cause, as opposed to entrepreneurs fishing for ideas to strike it rich. </p><p>We&#8217;ll see through his formative years in video game development about how he distinguishes between early machine vs human intelligence:</p><ul><li><p><strong>Early machine intelligence</strong> uses deductive, first order reasoning to reason through problems </p></li><li><p><strong>Human intelligence </strong>generally uses induction or high level pattern recognition to identify shapes, sounds, and reason through sentences</p></li></ul><h4>Formative Years</h4><p>Demis had a fairly modest upbringing in North London. When he was young he was highly influenced by classic Sci-Fi: &#8220;Enders Game&#8221;, &#8220;Foundation&#8221;, &#8220;Godel, Escher, Bach&#8221; and Ian Banks &#8220;Culture&#8221; series were some of his early influences. He was a chess genius and video game enthusiast in his early days, frequently ruminating on what algorithms are needed to program a machine to play chess.</p><p>He won admission at University of Cambridge to study CS, but he was too young at the age of 16. During this time gap, he worked on video game development at  Bullfrog with Peter Molyneux to develop video games, including <em>Theme Park</em>.</p><p>After he graduated, he founded Elixir Studios in 1998, a London-based video games developer, and work on developing video games that focused on complex decisions on the user&#8217;s part, including <em>Republic: The Revolution</em> and <em>Evil Genius</em>. </p><p>Elixir Studios ultimately failed in April 2005, at which point he decided to pursue his PhD in neuroscience at University College London (UCL) from 2005 to 2009. His studies focused on the field of imagination, memory, and amnesia.</p><p>After he obtained his PhD, Demis Hassabis founded <strong>DeepMind Technologies </strong>in London in 2010, a ML AI startup with Shane Legg and Mustafa Suleyman. The goal is to <strong>combine insights from neuroscience with ML and computing hardware to create new algorithms to advance toward AGI</strong>. His company was successful at training a <strong>Deep Q-Network</strong> to play Atari games in 2013. [1]</p><p>In 2014, <strong>Google purchased DeepMind for $400 million</strong> where it remained a separate, siloed entity from Google. Since being purchased, it racked up a list of accomplishments from 2014 to 2020:</p><ul><li><p>It created a clinical alert system, <strong>Streams</strong>, for acute kidney injury in the UK. While well intended, it was controversial because of concerns over patient data governance.</p></li><li><p>It created <strong>AlphaGo</strong>, a program that played Go. It defeated world champion Lee Sedol in 2016 and several other high level players with moves that humans had very little intuition for.</p></li><li><p>It created <strong>AlphaFold</strong> and <strong>AlphaFold 2</strong> to take on the CASP protein-structure prediction competitions. AlphaFold 2 achieved a breakthrough at CASP14 in 2020. In 2021, DeepMind and EMBL-EBI launched the AlphaFold Protein Structure Database, and by 2022 it included predictions for more than <strong>200 million protein structures</strong>. Hassabis later won a Nobel Prize in Chemistry for his contribution, along with John Jumper and David Baker.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fCVV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fCVV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 424w, https://substackcdn.com/image/fetch/$s_!fCVV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 848w, https://substackcdn.com/image/fetch/$s_!fCVV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 1272w, https://substackcdn.com/image/fetch/$s_!fCVV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fCVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png" width="530" height="291.9157720891825" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:667,&quot;width&quot;:1211,&quot;resizeWidth&quot;:530,&quot;bytes&quot;:1513119,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!fCVV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 424w, https://substackcdn.com/image/fetch/$s_!fCVV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 848w, https://substackcdn.com/image/fetch/$s_!fCVV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 1272w, https://substackcdn.com/image/fetch/$s_!fCVV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06afa233-a5f7-4520-b51c-91e4ed351094_1211x667.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 2. Nobel Prize winners for AlphaFold2. Source: https://www.science.org/content/article/protein-designer-and-structure-solvers-win-chemistry-nobel</figcaption></figure></div><h2>AI Safety: Commercial time-to-market vs Research Discovery </h2><p>While DeepMind racked up an impressive list of accomplishments in its early years, it entered a ferocious competition to develop LLMs against OpenAI, led by Sam Altman.</p><p>Much of the story that follows revolves around two concepts:</p><ul><li><p>AI safety</p></li><li><p>Research vs Commercial Environments</p></li></ul><p>I&#8217;ll cover the groundwork for each concept, then dive into the high level narrative of the competition between OpenAI and DeepMind.</p><h4>AI Safety </h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MTmi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MTmi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 424w, https://substackcdn.com/image/fetch/$s_!MTmi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 848w, https://substackcdn.com/image/fetch/$s_!MTmi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 1272w, https://substackcdn.com/image/fetch/$s_!MTmi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MTmi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png" width="248" height="383" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:383,&quot;width&quot;:248,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:175993,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!MTmi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 424w, https://substackcdn.com/image/fetch/$s_!MTmi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 848w, https://substackcdn.com/image/fetch/$s_!MTmi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 1272w, https://substackcdn.com/image/fetch/$s_!MTmi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4040ff8-45f0-4427-80c6-b03757b0c850_248x383.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 3. Cover of &#8220;I, Robot&#8221; by Isaac Asimov</figcaption></figure></div><p>Long before <strong>AI safety </strong>became a technical field, Sci-Fi gave the public a meaningful way to think about machine intelligence and control. &#8220;I, Robot&#8221; by Isaac Asimov in 1950, proposed the <strong>&#8220;the Three Laws of Robotics&#8221;</strong> that dictate that &#8220;robots must never harm humans, must obey orders unless they conflict with the first law, and must protect their own existence&#8221;. Additionally, classic Sci-Fi highlights the <strong>potential dangers of misuse of AI such as </strong>Terminator, Blade Runner, and Cyberpunk genres.</p><p>These examples influenced the development of AI where humans ensure that AI is safe and aligned with human <strong>intentions</strong>. Humans need to ensure that AI doesn&#8217;t deceive users by providing harmful / misleading information.</p><p>One way to test for AI safety is through <strong>adversarial testing</strong>, or <strong>&#8220;red teaming&#8221;</strong>, where <strong>human testers act as adversaries</strong> to intentionally break, deceive, or trigger harmful outputs from AI systems. Red teams simulate real-world attacks to identify safety, security, and ethical flaws, such as prompt injections, bias, or data leakage, before malicious actors exploit them. </p><p>As AI models <strong>scaled in complexity</strong>, they became <strong>harder to interpret</strong>. Inputs could be fed in during training and outputs could be evaluated during inference, but the<strong> internal representations became increasingly difficult to explain</strong>. AI can produce human-aligned outputs 99.9% of the time, but identifying the 0.1% of cases where it acts against human intentions can be difficult.</p><p>Red teaming alone does not guarantee total safety, but is one tool in a broad range of strategies to ensure that AI is aligned with human intentions. </p><h4>Research vs Commercial incentives </h4><p>In large tech corporations such as Google, there are two main categories of groups that contribute to business revenues:</p><ul><li><p><strong>Revenue-generating core business</strong> - Engineers who create products to <strong>take to market</strong> or <strong>maintain/update existing infrastructure</strong>. Here, work is considered to be <strong>low risk</strong> and <strong>execution on set priorities </strong>is important.</p></li><li><p><strong>Research and development - </strong>Researchers who work on<strong> high-risk</strong>, <strong>long-term,</strong> and <strong>ambitious</strong> ventures. In R&amp;D environments,<strong> ideas are incubated </strong>to either be published or folded into products to take to market.</p></li></ul><p>Organizations consist of groups that range from <strong>commercial readiness</strong> vs <strong>research focus</strong> that help sustain the revenue of the core business as well as bet on &#8220;moonshot&#8221; ideas that can generate revenue in the future. </p><p>In 2015, Google restructured under the umbrella holding company &#8220;Alphabet&#8221;, separating its core businesses from longer-term moonshot bets such as autonomous driving, fiber, and life sciences. This move positioned Google to overcome the <strong>&#8220;innovators dilemma&#8221;</strong> by incubating capabilities to enhance its core business or act as additional revenue generating streams in the future should outside disrupters arrive and threaten it. </p><p>DeepMind, HQed in London, was acquired by Google because its capabilities were potentially useful to the core business. However, it was <strong>heavily siloed </strong>from the main HQ in California at first. This siloing gave the researchers to environment to produce many of the key innovations but caused it to hesitate entering the market when commercial pressure arose. </p><p>OpenAI, on the other hand, is HQed in San Francisco, California, with a highly influential leader, Sam Altman, and the entire network and talent pool of the Valley nearby.</p><h4>OpenAI - Fast to Market, with Safety Governance Under Pressure</h4><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nVrk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nVrk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 424w, https://substackcdn.com/image/fetch/$s_!nVrk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 848w, https://substackcdn.com/image/fetch/$s_!nVrk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 1272w, https://substackcdn.com/image/fetch/$s_!nVrk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nVrk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png" width="408" height="160" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:160,&quot;width&quot;:408,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20643,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nVrk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 424w, https://substackcdn.com/image/fetch/$s_!nVrk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 848w, https://substackcdn.com/image/fetch/$s_!nVrk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 1272w, https://substackcdn.com/image/fetch/$s_!nVrk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04152b1-b3c4-4aac-bcc3-486b31c0fcf5_408x160.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>OpenAI was founded in 2015 by Sam Altman, Elon Musk, Greg Brockman, and others as a non-profit with the primary goal of ensuring that artificial general intelligence (AGI) benefits all of humanity.</p><p>In 2020, OpenAI published <strong>&#8220;Language Models are Few-Shot Learners,&#8221;</strong> introducing <strong>GPT-3</strong>, a 175-billion-parameter autoregressive language model, building off of the <strong>&#8220;Attention is all you Need&#8221;</strong> paper published in 2017. This was one of the first truly major, <strong>large-scale Large Language Models (LLMs)</strong>.</p><p>However, as models were becoming more complex, <strong>AI safety</strong> became a concern. Former OpenAI employees including Dario and Daniela Amodei defected and founded <strong>Anthropic </strong>in 2021 to prioritize <strong>AI safety, alignment, and research.</strong></p><p>To improve AI safety, OpenAI released a paper in 2022, <strong>&#8220;Training language models to follow instructions with human feedback&#8221;</strong> which used <strong>Reinforcement Learning with Human Feedback</strong> (RLHF). It demonstrated that humans are able to &#8220;teach&#8221; AI outputs that aligns with humans intention so that the AI can be trained more effectively. This paper demonstrated improvements in how humans can &#8220;guide&#8221; AI to produce output that aligns with its intentions.</p><p>On <strong>November 30, 2022, </strong>ChatGPT was officially released by OpenAI. It was intended to be introduced as a &#8220;research preview&#8221; and fast tracked to not let Anthropic get ahead. It <strong>quickly went viral</strong>, reaching one million users within a few days of its launch, the fastest growing consumer application ever at the time (Threads would later surpass this).</p><p>About a year later, the board was concerned about Sam Altman&#8217;s handling of AI safety and other allegations. The board abruptly fired Altman on November 17, 2023, citing concerns about his candor. But the move triggered a rapid employee and investor backlash: nearly all OpenAI employees threatened to leave unless he was reinstated, and Altman returned as CEO on November 22, 2023 with a restructured board.</p><h4>Google DeepMind - Prioritizing AI safety, but Caught in the Innovators Dilemma</h4><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EHRh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EHRh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 424w, https://substackcdn.com/image/fetch/$s_!EHRh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 848w, https://substackcdn.com/image/fetch/$s_!EHRh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 1272w, https://substackcdn.com/image/fetch/$s_!EHRh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EHRh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png" width="518" height="158" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:158,&quot;width&quot;:518,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:34715,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EHRh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 424w, https://substackcdn.com/image/fetch/$s_!EHRh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 848w, https://substackcdn.com/image/fetch/$s_!EHRh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 1272w, https://substackcdn.com/image/fetch/$s_!EHRh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3ffa9c-59c2-4d08-82c7-c91bb7a678b9_518x158.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Google saw commercial potential in AI when it first acquired DeepMind because it <strong>recognized the strategic importance of AI</strong> in its core business. </p><p>At first, DeepMind&#8217;s culture leaned toward <strong>research depth</strong>, <strong>scientific credibility</strong>, and <strong>safety caution</strong>. It saw too much existential risk to let commercial priorities drive AI&#8217;s deployment, which made it slower to convert frontier models into consumer products. Demis wanted a path to spinout to a semi-independent Alphabet company, but ultimately didn&#8217;t due to the legal effort involved. </p><p>When OpenAI released GPT-3 in 2020, DeepMind did their own work on LLMs and published a few key papers, including Flamingo, Gato, and Chinchilla in 2022. In Sept 2022, it worked on Sparrow, DeepMind&#8217;s version of ChatGPT, which used RLHF. However, it was ultimately <strong>not released</strong> to the general public.</p><p>When OpenAI released ChatGPT, Google was briefly trapped by the <strong>innovators dilemma</strong> where it would put its brand at risk if it released an AI model too early and it hallucinated. During this time, a lot of DeepMind researchers were being poached by OpenAI who were frustrated by lack of progress by Google releasing a product. </p><p>On April 20, 2023, <strong>Google Brain </strong>(aimed at applied machine learning for Google products like TensorFlow) and <strong>DeepMind</strong> merged to form a single unit, <strong>Google DeepMind</strong>, to become one comprehensive unit.</p><p>Google DeepMind ultimately launched <strong>Gemini </strong>in Dec 2023 and upgraded versions that followed. While it is widely used nowadays and offers many benefits over ChatGPT (such as much larger context window for large documents), it ultimately suffered from the first mover advantage OpenAI had.</p><h2>Key Technical Developments in modern AI</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YtXF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YtXF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 424w, https://substackcdn.com/image/fetch/$s_!YtXF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 848w, https://substackcdn.com/image/fetch/$s_!YtXF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 1272w, https://substackcdn.com/image/fetch/$s_!YtXF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YtXF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png" width="526" height="473.2554945054945" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1310,&quot;width&quot;:1456,&quot;resizeWidth&quot;:526,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YtXF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 424w, https://substackcdn.com/image/fetch/$s_!YtXF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 848w, https://substackcdn.com/image/fetch/$s_!YtXF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 1272w, https://substackcdn.com/image/fetch/$s_!YtXF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2dd4607-596c-4762-b9cc-cdd1050a20de_2985x2685.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 4. Timeline of Evolution of LLMs. Source: https://toloka.ai/blog/history-of-llms/</figcaption></figure></div><p>With that institutional backdrop in place, the technical story becomes easier to understand. </p><p>Next I&#8217;ll dive deeper into the key technical developments along the way to modern AI <strong>referenced in the book</strong> that approximate <strong>human intuitions</strong>. This is by no means a complete list since this book does not cover the more recent &#8220;reasoning era&#8221; Agentic AI.</p><h4>Deep Reinforcement Learning (2013-2015) </h4><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zm3b!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zm3b!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 424w, https://substackcdn.com/image/fetch/$s_!zm3b!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 848w, https://substackcdn.com/image/fetch/$s_!zm3b!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 1272w, https://substackcdn.com/image/fetch/$s_!zm3b!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zm3b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png" width="420" height="171.11111111111111" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:540,&quot;resizeWidth&quot;:420,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!zm3b!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 424w, https://substackcdn.com/image/fetch/$s_!zm3b!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 848w, https://substackcdn.com/image/fetch/$s_!zm3b!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 1272w, https://substackcdn.com/image/fetch/$s_!zm3b!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8b72101e-ccd4-4709-a494-0323c5db70cc_540x220.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Figure 5. Overview of Reinforcement learning. Source: https://medium.com/@vishnuvijayanpv/deep-reinforcement-learning-artificial-intelligence-machine-learning-and-deep-learning-e52cb5974420</figcaption></figure></div><p>Reinforcement learning is a fairly intuitive, but math-heavy, ML method where an <strong>agent interacts with the world</strong> and <strong>periodically receives rewards / reinforcements </strong>that reflect how well its doing. Reinforcement learning is especially popular in environments where there are <strong>small number of training examples</strong> compared to the state space the agent exists in (like chess) that supervised learning would struggle in. </p><p>Each agent exists in an environment that resembles a <strong>Markov decision process</strong>, such as a maze, with other states that the agent can transition to. The agent has a policy that determines whether it will <strong>explore</strong> the environment or <strong>exploit</strong> an action with known outcomes. Each agent goes through several trials of decisions, determines which actions likely led to the desired outcomes, and updates its policies with what it learned.</p><p>One popular method of reinforcement learning is <strong>Q-learning </strong>(or Quality) where the agent estimates the max expected future reward for each action in each state. While Q learning is fairly primitive, it forms the basis for more sophisticated algorithms.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bX8X!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bX8X!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 424w, https://substackcdn.com/image/fetch/$s_!bX8X!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 848w, https://substackcdn.com/image/fetch/$s_!bX8X!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 1272w, https://substackcdn.com/image/fetch/$s_!bX8X!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bX8X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png" width="396" height="180.46285714285713" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:319,&quot;width&quot;:700,&quot;resizeWidth&quot;:396,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!bX8X!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 424w, https://substackcdn.com/image/fetch/$s_!bX8X!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 848w, https://substackcdn.com/image/fetch/$s_!bX8X!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 1272w, https://substackcdn.com/image/fetch/$s_!bX8X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce36e8d5-bfc0-4c4b-83eb-35561f1a2caa_700x319.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Figure 6. Deep Reinforcement Learning. Source: https://medium.com/@vishnuvijayanpv/deep-reinforcement-learning-artificial-intelligence-machine-learning-and-deep-learning-e52cb5974420</figcaption></figure></div><p>In a 2013 paper, <strong>&#8220;Playing Atari with Deep Reinforcement Learning&#8221;</strong>, Hassabis attempted to combine deep and reinforcement learning to create a single NN agent that is able to <strong>learn to play as many of the games as possible without knowing the rules in advance</strong> [1]. However, combining deep learning and reinforcement learning presented some challenges:</p><ul><li><p><strong>Deep learning</strong> applications require <strong>large amounts of hand-labelled training data </strong>(at the time). It also assumes samples to be independent.</p></li><li><p><strong>RL algorithms,</strong> on the other hand, learn from a scalar reward signal that is frequently<strong> sparse, noisy and delayed</strong>. It also frequently encounters <strong>highly correlated states.</strong></p></li></ul><p>DeepMind used <strong>Deep Q-Networks (DQN) </strong>to play Atari games. They use a method of <strong>&#8220;experience replay&#8221;</strong> where the agent&#8217;s experiences are stored at each time step, then are <strong>sampled uniformly</strong> when performing updates. Randomizing samples helps break up correlations and allows the DNN to learn more effectively.</p><p>They demonstrate this approach on seven Atari Games (Beam Rider, Breakout, Enduro, Pong, Q*bert, Seaquest, and Space Invaders) and showed that it <strong>generalizes</strong> to a variety of games without specific information about each game. </p><h4>Transformer (2017)</h4><p>The Transformer architecture was introduced in the June 2017 seminal paper "<strong>Attention Is All You Need</strong>" by researchers from Google Brain and Google Research. <strong>It is widely considered one of the most important, consequential, and influential AI papers of the 21st century</strong> because it influences the architecture of Large Language Models we use today.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BbYY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BbYY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 424w, https://substackcdn.com/image/fetch/$s_!BbYY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 848w, https://substackcdn.com/image/fetch/$s_!BbYY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 1272w, https://substackcdn.com/image/fetch/$s_!BbYY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BbYY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png" width="877" height="179" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f96583df-4af2-4737-9491-206f44e979cb_877x179.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:179,&quot;width&quot;:877,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:13142,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BbYY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 424w, https://substackcdn.com/image/fetch/$s_!BbYY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 848w, https://substackcdn.com/image/fetch/$s_!BbYY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 1272w, https://substackcdn.com/image/fetch/$s_!BbYY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff96583df-4af2-4737-9491-206f44e979cb_877x179.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Figure 7. A Sequence to Sequence Model with Long Short term Memory [2]</figcaption></figure></div><p>Prior to the transformer model, the standard approach for language training were <strong>Recurrent Neural Networks (RNNs)</strong> and <strong>sequence-to-sequence</strong> models with <strong>Long Short-Term Memory (LSTM</strong>). A RNN/LSTM processes words sequentially. This is useful in simple, condensed sentences structures such as subject-verb-object pairs. However, it suffers from modern sentences especially when <strong>meaning between words (both local and global context) are far away from each other in time. </strong></p><p>The proposed &#8220;<strong>Attention&#8221;</strong> mechanism allows <strong>each word context into what other related words are doing</strong>. To get an idea for how the Attention mechanism works, lets consider real sentences. </p><p>Real sentences can have <strong>multiple parts of speech</strong>, with nouns (things), adjectives (modifiers to nouns), verbs (actions) and other parts of speech. Nouns can take on multiple meanings depending on the &#8220;context&#8221; is it in. Take the word &#8220;apple&#8221; in the following sentences:</p><ul><li><p>Apple will release an iPhone next year.</p></li><li><p>I ate an apple and it was delicious.</p></li></ul><p>In each sentence, its clear to us what &#8220;Apple&#8221; refers to: a company in the first sentence because of the world &#8220;IPhone&#8221; that provided context, and a fruit in the second because of the word, &#8220;delicious&#8221;. We humans used <strong>context clues </strong>to determine what &#8220;apple&#8221; means based on supporting words around it.</p><p>This is roughly what &#8220;<strong>Attention&#8221;</strong> enables mathematically in the Transformer architecture.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b6Oy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b6Oy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 424w, https://substackcdn.com/image/fetch/$s_!b6Oy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 848w, https://substackcdn.com/image/fetch/$s_!b6Oy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 1272w, https://substackcdn.com/image/fetch/$s_!b6Oy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b6Oy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png" width="287" height="399.0211800302572" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:919,&quot;width&quot;:661,&quot;resizeWidth&quot;:287,&quot;bytes&quot;:115042,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b6Oy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 424w, https://substackcdn.com/image/fetch/$s_!b6Oy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 848w, https://substackcdn.com/image/fetch/$s_!b6Oy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 1272w, https://substackcdn.com/image/fetch/$s_!b6Oy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F99aee463-316f-43c6-a5bf-97ced3b42069_661x919.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 8. The Transformer Architecture [3]</figcaption></figure></div><p>In the original Transformer, words are processed in parallel through attention layers rather than one step at a time as in RNNs. These words are broken down into several units called <strong>&#8220;tokens&#8221;.</strong></p><p>Then, each token is converted to a high dimensional vector called an <strong>embedding</strong> that <strong>captures the meaning of each token</strong>. Think of this vector like the entries of a <strong>thesaurus</strong> where each tokens&#8217; values corresponds to how strong it correlates to each entry in the thesaurus. A word like apple can map well on the the position that represents companies (such as Tesla, Exxon) and another position that specific fruits (oranges, pears). These embedding values are found during <strong>training</strong> on billions of words since each embedding has multiple &#8220;dimensions&#8221; it sits on.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vovv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vovv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 424w, https://substackcdn.com/image/fetch/$s_!vovv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 848w, https://substackcdn.com/image/fetch/$s_!vovv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 1272w, https://substackcdn.com/image/fetch/$s_!vovv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vovv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png" width="568" height="325.6178194607268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9b4505a-b006-415e-9641-fc3e19067d53_853x489.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:489,&quot;width&quot;:853,&quot;resizeWidth&quot;:568,&quot;bytes&quot;:235034,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!vovv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 424w, https://substackcdn.com/image/fetch/$s_!vovv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 848w, https://substackcdn.com/image/fetch/$s_!vovv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 1272w, https://substackcdn.com/image/fetch/$s_!vovv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9b4505a-b006-415e-9641-fc3e19067d53_853x489.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 9. Attention Mechanism showing relationship between words stores as &#8220;weights&#8221; in a matrix of all possible combinations [3]</figcaption></figure></div><p>Then, through matrix operations, <strong>every word in a sentence calculates how much &#8220;attention&#8221; it should pay to every </strong><em><strong>other</strong></em><strong> word in that same sentence.</strong> This allows it to figure out relationship between words based on <strong>context clues.</strong></p><p>To accomplish this, each embedding vector is decomposed into three vectors : <strong>Query (Q)</strong>, <strong>Key (K)</strong>, and <strong>Value (V)</strong>.  The query vector for each word is then multiplied by the key vectors for all the other words to come up with an <strong>&#8220;Attention&#8221; matrix</strong> that represents how related words are to each other. This attention value is given by the following formula:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6O53!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6O53!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 424w, https://substackcdn.com/image/fetch/$s_!6O53!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 848w, https://substackcdn.com/image/fetch/$s_!6O53!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 1272w, https://substackcdn.com/image/fetch/$s_!6O53!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6O53!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png" width="443" height="88.25725338491296" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:103,&quot;width&quot;:517,&quot;resizeWidth&quot;:443,&quot;bytes&quot;:19624,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6O53!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 424w, https://substackcdn.com/image/fetch/$s_!6O53!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 848w, https://substackcdn.com/image/fetch/$s_!6O53!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 1272w, https://substackcdn.com/image/fetch/$s_!6O53!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbb25c47e-fd19-4585-9b1d-b73c5acc189e_517x103.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Figure 10. Attention Formula [3]</figcaption></figure></div><p>To predict the next word in the sequence, it uses a decoder to look at words in context to be able to generate the sequence of words that map well to the original meaning of the sentence.</p><p>AlphaFold 2 also showed how attention-like architectures could move beyond language: it used attention over sequence and pair representations to <strong>reason about relationships between amino acids</strong>, including residues far apart in the 1D sequence but close in 3D structure.</p><p>The transformer is significant because it is <strong>scalable along memory and compute:</strong></p><ul><li><p>The<strong> context window</strong> represents the &#8220;short term memory&#8221; as the maximum number of tokens (words) the model can &#8220;see&#8221; in one single look. </p></li><li><p>The main attention calculation is a <strong>matrix multiplication</strong>, so modern GPUs can <strong>parallelize these matrix multiplications</strong> to higher embedding dimensions.</p></li></ul><h4>Sentiment Neuron (2017)</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RItM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RItM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 424w, https://substackcdn.com/image/fetch/$s_!RItM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 848w, https://substackcdn.com/image/fetch/$s_!RItM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 1272w, https://substackcdn.com/image/fetch/$s_!RItM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RItM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png" width="326" height="461.98342541436466" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1026,&quot;width&quot;:724,&quot;resizeWidth&quot;:326,&quot;bytes&quot;:573607,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RItM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 424w, https://substackcdn.com/image/fetch/$s_!RItM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 848w, https://substackcdn.com/image/fetch/$s_!RItM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 1272w, https://substackcdn.com/image/fetch/$s_!RItM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe611dd22-62e5-4e17-a855-cde2bdd8be6e_724x1026.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 11. Value of single Sentiment cell (positive, negative, neutral) while parsing IMBd review [4]</figcaption></figure></div><p>The paper <strong>&#8220;Learning to Generate Reviews and Discovering Sentiment&#8221;</strong> was released by OpenAI on April 5, 2017. This paper was consequential because it became an early and memorable example of a neural network learning an <strong>internal representation</strong> that researchers did not explicitly program.</p><p>Originally, the researchers at OpenAI trained a specific type of RNN called an <strong>mLSTM</strong> (Multiplicative LSTM) on 82 million Amazon reviews with unsupervised language modeling. They found that <strong>one single neuron</strong> had essentially "volunteered" to track sentiment of whether parts of the review were positive, neutral, or negative.</p><p>This paper is considered <strong>highly influential </strong>to AI safety because it gave insight to researchers that certain neurons can be &#8220;inspected&#8221; to find big picture behavior and debug potentially dangerous behavior.</p><h4>Few-shot and in-context learning in GPT-3 (2020)</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Mj4w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Mj4w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 424w, https://substackcdn.com/image/fetch/$s_!Mj4w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 848w, https://substackcdn.com/image/fetch/$s_!Mj4w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 1272w, https://substackcdn.com/image/fetch/$s_!Mj4w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Mj4w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png" width="1456" height="666" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:666,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:109512,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Mj4w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 424w, https://substackcdn.com/image/fetch/$s_!Mj4w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 848w, https://substackcdn.com/image/fetch/$s_!Mj4w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 1272w, https://substackcdn.com/image/fetch/$s_!Mj4w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff72c6e8d-6bcc-4be8-bd74-8ecbd847201a_1488x681.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 12. Meta Learning on a broadly trained LLM with context provided during inference to tailor the LLM to the specific task [5]</figcaption></figure></div><p>Since the transformer paper was published, the standard approach for LLMs was to pretrain the models on a ton of data and fine tune the model for each specific task, such as reading comprehension and answering questions. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_Ksx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_Ksx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 424w, https://substackcdn.com/image/fetch/$s_!_Ksx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 848w, https://substackcdn.com/image/fetch/$s_!_Ksx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 1272w, https://substackcdn.com/image/fetch/$s_!_Ksx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_Ksx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png" width="409" height="353.0891530460624" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58ea62f9-a084-44da-8958-5be095023698_673x581.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:581,&quot;width&quot;:673,&quot;resizeWidth&quot;:409,&quot;bytes&quot;:110466,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!_Ksx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 424w, https://substackcdn.com/image/fetch/$s_!_Ksx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 848w, https://substackcdn.com/image/fetch/$s_!_Ksx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 1272w, https://substackcdn.com/image/fetch/$s_!_Ksx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58ea62f9-a084-44da-8958-5be095023698_673x581.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 13. Examples of Prompts for in-context learning in zero-shot, one-shot, and few-shot learning [5]</figcaption></figure></div><p>In Apr 2020, when OpenAI released GPT-3, it published a paper, &#8220;Language Models are Few-Shot Learners&#8221;. This paper presents <strong>&#8220;Few-shot learning&#8221; </strong>where a model learns to perform new tasks or recognize patterns simply <strong>by providing examples in the prompt</strong>, a capability known as <strong>in-context learning</strong>. It demonstrated that a generalized LLM model could be <strong>taught to solve new tasks using only a very small amount of training data. </strong>Other settings such as zero-shot and one-shot were shown with some accuracy loss compared to few-shot at the time. </p><p>Together, the Transformer architecture and GPT-3 helped make <strong>scaling</strong> a central paradigm in AI: as model size, data, and compute increased, capabilities improved in increasingly predictable ways.</p><h4>Reinforcement Learning with Human Feedback (2022)</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xE7t!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xE7t!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 424w, https://substackcdn.com/image/fetch/$s_!xE7t!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 848w, https://substackcdn.com/image/fetch/$s_!xE7t!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 1272w, https://substackcdn.com/image/fetch/$s_!xE7t!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xE7t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png" width="523" height="310.768115942029" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:492,&quot;width&quot;:828,&quot;resizeWidth&quot;:523,&quot;bytes&quot;:140909,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xE7t!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 424w, https://substackcdn.com/image/fetch/$s_!xE7t!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 848w, https://substackcdn.com/image/fetch/$s_!xE7t!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 1272w, https://substackcdn.com/image/fetch/$s_!xE7t!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F25f6ec9c-b360-4ff2-98e9-faa53fbc4043_828x492.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 14. Methodology for Reinforcement Learning with Human Feedback [6]</figcaption></figure></div><p>When LLMs began scaling in complexity, they sometimes generated outputs that were not <strong>aligned</strong> with their users, such as toxic or biased statements. The paper <strong>&#8220;Training language models to follow instructions with human feedback&#8221;</strong> used human-guided Reinforcement learning to &#8220;teach&#8221; the model what outputs are acceptable to humans. </p><p>OpenAI hired 40 contractors and trained a model, <strong>InstructGPT</strong>, with human feedback. They evaluated its responses vs GPT-3 and found that <strong>labelers significantly prefer InstructGPT</strong> outputs over outputs from GPT-3. </p><p>This procedure showed <strong>improvements in truthfulness and reducing toxicity</strong>. However, RLHF is hard to scale because human-provided feedback on AI is harder as AI generated more sophisticated outputs.</p><h4>Chain-of-thought Prompting (2022)</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DVyE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DVyE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 424w, https://substackcdn.com/image/fetch/$s_!DVyE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 848w, https://substackcdn.com/image/fetch/$s_!DVyE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 1272w, https://substackcdn.com/image/fetch/$s_!DVyE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DVyE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png" width="621" height="305.37931034482756" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:656,&quot;width&quot;:1334,&quot;resizeWidth&quot;:621,&quot;bytes&quot;:187965,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DVyE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 424w, https://substackcdn.com/image/fetch/$s_!DVyE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 848w, https://substackcdn.com/image/fetch/$s_!DVyE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 1272w, https://substackcdn.com/image/fetch/$s_!DVyE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a64d73c-babe-46bb-a361-b7edf452fdff_1334x656.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 15. Illustration of Chain-of-Thought Prompting [7]</figcaption></figure></div><p>The concept of Chain-of-thought prompting was introduced in the 2022 paper, <strong>&#8220;Chain-of-Thought Prompting Elicits Reasoning in Large Language Models&#8221;</strong>. CoT was a major demonstration that providing <strong>intermediate reasoning examples</strong> to pretrained LLMs can <strong>improve performance on arithmetic, commonsense, and symbolic reasoning tasks</strong> in sufficiently large models.</p><p>CoT also made model outputs <strong>more interpretable</strong> at the surface level, because humans could inspect the generated intermediate steps, though those steps should not be treated as a guaranteed faithful transcript of the model&#8217;s internal computation.</p><p>This approach is similar to<strong> how humans learn by analogy/example.</strong> It is typical for humans to decompose the problem into intermediate steps and solve each before giving the final answer. Most of us were taught to <strong>&#8220;show your work&#8221; </strong>when solving complex problems and were rewarded partial credit even though our answer might be wrong, since the line of reasoning can be valid up to a certain point. </p><h4>Anthropic - Interpretability (2021-)</h4><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Egvl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Egvl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 424w, https://substackcdn.com/image/fetch/$s_!Egvl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 848w, https://substackcdn.com/image/fetch/$s_!Egvl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 1272w, https://substackcdn.com/image/fetch/$s_!Egvl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Egvl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png" width="423" height="98" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:98,&quot;width&quot;:423,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:10781,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/195320512?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Egvl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 424w, https://substackcdn.com/image/fetch/$s_!Egvl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 848w, https://substackcdn.com/image/fetch/$s_!Egvl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 1272w, https://substackcdn.com/image/fetch/$s_!Egvl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe04be582-c0b3-4a38-a9a8-7cec31102e14_423x98.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Anthropic was founded in <strong>2021</strong> by former OpenAI executives Dario and Daniela Amodei and other colleagues. It was established as a public benefit corporation (PBC) dedicated to <strong>AI safety, focusing on creating reliable, interpretable, and safe AI systems</strong> (like Claude) following disagreements over the rapid commercialization and safety approach at OpenAI.</p><p>Anthropic&#8217;s safety research pushed hard on <strong>mechanistic interpretability:</strong> trying to identify features, circuits, and internal representations that help explain why a model produced a given output. </p><p>This matters because <strong>interpretability is one possible path toward moving AI safety </strong>beyond trial-and-error behavior testing. Red teaming can reveal when a model fails from the outside; interpretability tries to understand why the model behaves that way from the inside.</p><h2>Conclusion</h2><p>Modern AI did not emerge from a single breakthrough, company, or personality. It came from the <strong>convergence of several technical ideas:</strong> <strong>deep reinforcement learning, transformers, scaling laws, RLHF, chain-of-thought prompting, and interpretability </strong>&#8212; with a set of organizational tensions that shaped how quickly those ideas moved from <strong>research labs</strong> into the <strong>hands of the public.</strong></p><p>Demis Hassabis and Sam Altman represent two different archetypes in this story. </p><ul><li><p>Hassabis reflects the <strong>research-driven</strong>,<strong> long-horizon</strong> pursuit of intelligence as a scientific problem. </p></li><li><p>Altman reflects the <strong>commercial, political, and go-to-market instincts</strong> required to turn frontier research into a mass-market product. </p></li></ul><p>Both approaches have strengths and failure modes, and that tension is the real story of modern AI. </p><p>The next era of AI will likely be shaped not only by better models, but by how well institutions can balance those forces while building systems that are powerful, useful, and aligned with human intent.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Inside the Silicon Machine is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>If you want to continue expanding your knowledge on the HW side of AI, check out my other post:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;ac625011-90ff-4725-a190-69907c0d8eea&quot;,&quot;caption&quot;:&quot;I believe its important for people working around AI - ML engineers, semiconductor engineers, and end-users - to have a holistic understanding of AI, including the software interface and hardware underneath that optimizes calculations.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;md&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;An Overview of AI Accelerators: MAC Operations, DRAM/SRAM, Performance Metrics, and Architecture&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:127334549,&quot;name&quot;:&quot;Chad Wallace&quot;,&quot;bio&quot;:&quot;Deeply-researched Mental Models on Mixed Signal architectures for AI Datacenters - High Speed Optical/Wireline, Power Electronics, and adjacent domains &quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50804a2f-043d-47c7-915f-b30144640b1f_697x697.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-01-27T16:01:50.107Z&quot;,&quot;cover_image&quot;:&quot;https://substackcdn.com/image/fetch/$s_!ZCEN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://chadw.substack.com/p/ai-accelerators-a-primer&quot;,&quot;section_name&quot;:null,&quot;video_upload_id&quot;:null,&quot;id&quot;:185475328,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:8,&quot;comment_count&quot;:0,&quot;publication_id&quot;:7378748,&quot;publication_name&quot;:&quot;Inside the Silicon Machine&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!cXVP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2817a3aa-afa0-4d3a-b965-bace92d3d63e_902x902.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p></p><div class="captioned-button-wrap" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/p/a-brief-history-of-modern-ai-deepmind?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="CaptionedButtonToDOM"><div class="preamble"><p class="cta-caption">Thanks for reading Inside the Silicon Machine! This post is public so feel free to share it.</p></div><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/p/a-brief-history-of-modern-ai-deepmind?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.siliconcodesign.com/p/a-brief-history-of-modern-ai-deepmind?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p></div><h2>References</h2><p>[1] Mnih, V, et al. &#8220;Playing Atari with Deep Reinforcement Learning&#8221; https://arxiv.org/pdf/1312.5602</p><p>[2] Sutskever, I, Vinyals, O, Le, Q. &#8220;Sequence to Sequence Learning with Neural Networks&#8221; https://arxiv.org/pdf/1409.3215</p><p>[3] Vaswani, A, et al. &#8220;Attention Is All You Need&#8221; https://arxiv.org/pdf/1706.03762</p><p>[4] Radford, A, Jozefowicz, R, Sutskever, I. &#8220;Learning to Generate Reviews and Discovering Sentiment&#8221; https://arxiv.org/pdf/1704.01444</p><p>[5] Brown, T, et al. &#8220;Language Models are Few-Shot Learners&#8221; https://arxiv.org/pdf/2005.14165</p><p>[6] Ouyang, L, et al. &#8220;Training language models to follow instructions with human feedback&#8221; https://arxiv.org/pdf/2203.02155</p><p>[7] Wei, J, et al. &#8220;Chain-of-Thought Prompting Elicits Reasoning in Large Language Models&#8221; https://arxiv.org/pdf/2201.11903</p><p></p><p></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Inside the Silicon Machine is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[An Overview of AI Accelerators: MAC Operations, DRAM/SRAM, Performance Metrics, and Architecture  ]]></title><description><![CDATA[Learning from a classic AI accelerator - Eyeriss]]></description><link>https://www.siliconcodesign.com/p/ai-accelerators-a-primer</link><guid isPermaLink="false">https://www.siliconcodesign.com/p/ai-accelerators-a-primer</guid><dc:creator><![CDATA[Chad]]></dc:creator><pubDate>Tue, 27 Jan 2026 16:01:50 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ZCEN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I believe its important for people working around AI - ML engineers, semiconductor engineers, and end-users - to have a <strong>holistic</strong> understanding of AI, including the software interface and hardware underneath that optimizes calculations. </p><p>This primer focuses on <strong>AI accelerators</strong>: hardware designed to run deep learning efficiently. We&#8217;ll build intuition from the ground up (MACs &#8594; DNN/CNN structure &#8594; memory), explain the <strong>Von Neumann bottleneck</strong>, introduce the key performance metrics, and then ground everything in a classic example: <strong>Eyeriss</strong>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Inside the Silicon Machine! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>This post synthesizes ideas primarily from &#8220;<em>Efficient Processing of Deep Neural Networks&#8221;</em> and &#8220;<em>Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks</em>.&#8221; Figures are adapted for educational commentary with attribution in captions; please see the original papers for full detail.</p><h4>The compute: The Multiply- accumulate</h4><p>The Multiply - accumulation operation is the heart of most deep learning applications. </p><p>In simplest form, </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!U3ob!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!U3ob!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 424w, https://substackcdn.com/image/fetch/$s_!U3ob!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 848w, https://substackcdn.com/image/fetch/$s_!U3ob!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 1272w, https://substackcdn.com/image/fetch/$s_!U3ob!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!U3ob!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png" width="237" height="91" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:91,&quot;width&quot;:237,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3277,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!U3ob!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 424w, https://substackcdn.com/image/fetch/$s_!U3ob!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 848w, https://substackcdn.com/image/fetch/$s_!U3ob!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 1272w, https://substackcdn.com/image/fetch/$s_!U3ob!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9fc1d7f2-07bb-4f31-b8f8-e45d5db0496b_237x91.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>then applies a nonlinearity (often ReLU). At the output, sigmoid (binary) or softmax (multi-class) can convert logits into probabilities.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZCEN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZCEN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 424w, https://substackcdn.com/image/fetch/$s_!ZCEN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 848w, https://substackcdn.com/image/fetch/$s_!ZCEN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 1272w, https://substackcdn.com/image/fetch/$s_!ZCEN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZCEN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png" width="568" height="335" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:335,&quot;width&quot;:568,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:22794,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZCEN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 424w, https://substackcdn.com/image/fetch/$s_!ZCEN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 848w, https://substackcdn.com/image/fetch/$s_!ZCEN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 1272w, https://substackcdn.com/image/fetch/$s_!ZCEN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e4d9dff-3c5d-43cb-8488-71862bfffdf5_568x335.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 1. A &#8220;neuron&#8221; consisting of a MAC operation and a ReLU Activation function</figcaption></figure></div><p>There are few <strong>intuitions</strong> I use the thinking about Neural networks:</p><ul><li><p>MAC operations are analogous to <strong>biological neurons</strong>: many inputs influence an output, and the weights determine what patterns the unit responds to. (It&#8217;s an analogy&#8212;not a literal model of the brain&#8212;but it&#8217;s a useful intuition.)</p></li><li><p>A neuron (dot product + nonlinearity) behaves like a <strong>pattern detector </strong>where the output is high in response to a input pattern that correlates with the combination of weights.</p></li></ul><p>This multiple accumulate operation can be found across many range of deep learning applications.</p><h4>Feedforward Neural Networks</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uZVl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uZVl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 424w, https://substackcdn.com/image/fetch/$s_!uZVl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 848w, https://substackcdn.com/image/fetch/$s_!uZVl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 1272w, https://substackcdn.com/image/fetch/$s_!uZVl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uZVl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png" width="564" height="546" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:546,&quot;width&quot;:564,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:53147,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uZVl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 424w, https://substackcdn.com/image/fetch/$s_!uZVl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 848w, https://substackcdn.com/image/fetch/$s_!uZVl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 1272w, https://substackcdn.com/image/fetch/$s_!uZVl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ab67a3a-4269-4f09-b75f-3ab0bb5db0a8_564x546.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Figure 2. Basic structure of a deep neural network. Each &#8220;web&#8221; of connections is represented as a matrix of weights that performs the MAC operation with the inputs. The &#8220;magnitude&#8221; of each weight is shown as boldness that is tuned during training</figcaption></figure></div><p>In feedforward neural networks, many of these neurons are assembled in parallel in a &#8220;pipelined&#8221; fashion. The inputs all connect to each &#8220;neuron&#8221;, each with its own unique set of weights to detect specific <strong>patterns</strong>, and these neurons then pass that information to later layers or the output. Additional layers can be added to be able to fine tune the pattern detection. </p><p>Adding &#8220;hidden&#8221; layers between the input and output allows the network to build up from <strong>simple features</strong> (edges/textures) to <strong>more complex features</strong> (higher-level shapes and objects).</p><p>There are two major operations performed on neural networks:</p><ul><li><p><strong>Training </strong>= learning the weights by passing images through, computing the error, and tuning the weights through &#8220;backpropagation&#8221;</p></li><li><p><strong>Inference</strong> = running a trained model to produce outputs</p></li></ul><p>Inference is commonly run on edge devices locally; training is usually done on large GPU/accelerator systems. Both training and inference can be run on GPUs or specialized accelerators optimized for these workloads.</p><h4>Convolutional Neural networks</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XZTv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XZTv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 424w, https://substackcdn.com/image/fetch/$s_!XZTv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 848w, https://substackcdn.com/image/fetch/$s_!XZTv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 1272w, https://substackcdn.com/image/fetch/$s_!XZTv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XZTv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png" width="994" height="561" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:561,&quot;width&quot;:994,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:292776,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!XZTv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 424w, https://substackcdn.com/image/fetch/$s_!XZTv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 848w, https://substackcdn.com/image/fetch/$s_!XZTv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 1272w, https://substackcdn.com/image/fetch/$s_!XZTv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F11230c43-7733-4b75-8bd2-45087a760fe9_994x561.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Figure 3. </strong>A Convolution Neural network for Image recognition, with &#8220;filters&#8221; as edge detectors. Images are represented as &#8220;tensors&#8221; where pixels have RGB values. The Filter is stepped across the image with a specific &#8220;stride&#8221; to generate an output fmap consisting of features that strongly correlate to the filter. These fmaps are then normalized, pooled, and processed in deeper layers to build more complex shapes.</figcaption></figure></div><p>Convolutional neural networks (CNNs) are widely used for <strong>image recognition and classification</strong>. Instead of connecting every pixel to every neuron (which would explode parameter count and compute), CNNs use <strong>small filters</strong> that slide across the input in <strong>strides</strong>. </p><p>Each layer of the Convolutional Neural Networks operates in the following way:</p><ul><li><p>Small filters represented as matrices can be customized to detect specific features, but the simplest case are <strong>&#8220;edge detectors&#8221;</strong> that detects vertical lines, horizontal lines, and diagonal dimensions.</p></li><li><p>These filters are &#8220;convolved&#8221; with each each <strong>input feature map (ifmaps)</strong>, which is image pixel that is most commonly represented by a <strong>tensor</strong> of RGB values. </p></li><li><p>The<strong> output feature map (ofmaps)</strong> is calculated by sliding the filter across the input and accumulating <strong>partial sums</strong> (<strong>psums</strong>) along the way. </p></li><li><p>This feature map is condensed through <strong>normalization</strong> and <strong>&#8220;pooling&#8221;</strong> operations. </p></li></ul><p>Each <strong>output feature map</strong> corresponds to a filter where the particular pattern is present (such as horizontal/vertical/diagonal line segments). As layers get deeper, higher order constructs are formed from simpler ones and <strong>spatial resolution shrinks</strong>. When dimensions are small enough, the CNN is connected to a <strong>fully connected layer</strong> that generates an output.</p><p>There are computational observations in both DNNs and CNNs as it relates to hardware:</p><ul><li><p>Filter weights can be <strong>reused</strong> in computation without needing to retrieve it from memory every time.</p></li><li><p>The tensor shapes and data dependencies are <strong>known ahead of time</strong>, so you can pre-plan tiling and data movement to minimize expensive memory traffic.</p></li></ul><p>These two points will become important later when we discuss <em>Eyeriss</em>. </p><h4>The Von Neumann bottleneck</h4><p><em>You have the best compute in the world, but it doesn&#8217;t matter if memory can&#8217;t keep up. </em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rhkm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rhkm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 424w, https://substackcdn.com/image/fetch/$s_!rhkm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 848w, https://substackcdn.com/image/fetch/$s_!rhkm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 1272w, https://substackcdn.com/image/fetch/$s_!rhkm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rhkm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png" width="815" height="367" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/efdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:367,&quot;width&quot;:815,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:20884,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rhkm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 424w, https://substackcdn.com/image/fetch/$s_!rhkm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 848w, https://substackcdn.com/image/fetch/$s_!rhkm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 1272w, https://substackcdn.com/image/fetch/$s_!rhkm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fefdb868e-ab58-444b-80ce-8492ee4f6b07_815x367.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Figure 4.</strong> <em>A Von Neumann architecture with Data between Compute and Memory.</em></figcaption></figure></div><p>Most computing nowadays uses a <em>Von Neumann architecture</em>, where the compute and memory are split into two entities. Data stored in memory is retrieved and stored locally into CPU memory to perform operations on. When the result is computed, it is written back to the memory. </p><p>This architecture has worked as compute and memory are scaled. However, there are challenges being faced today:</p><ul><li><p>Compute throughput has scaled faster than memory bandwidth/latency improvements </p></li><li><p>There is a limit transfer rate and a high energy cost per memory retrieval</p></li></ul><p>This results in a <strong>bottleneck</strong>, where compute is limited by bandwidth + high energy per byte moved between memory and compute.</p><p>The question is, <em>why don&#8217;t you put compute and memory on the same chip</em>? You could, but its often not cost effective. Fab process for compute and memory optimize different things:</p><ul><li><p>Logic (CPU/GPUs) are optimized for <strong>speed of transistor switching</strong> and <strong>ease of routing interconnects</strong> through stacking metal layers</p></li><li><p>Memory requires <strong>high density storage</strong> and <strong>capacity</strong></p></li></ul><p>One method classical computer architecture employs to mitigate this issue is <strong>arranging memory in a hierarchy through caches</strong>. Here, data is retrieved from the main memory only as &#8220;needed&#8221;. This exploits a phenomenon called &#8220;<strong>spatial locality&#8221;</strong> where you are likely to retrieve data close to the one you retrieved. As a result, you can write a whole data &#8220;chunk&#8221; of memory to a cache to increases the chances that you &#8220;hit&#8221; data in the cache when going to retrieve it.</p><p>AI Accelerators exploit both <strong>spatial locality</strong> and <strong>data reuse</strong>. In order to take advantage of both, we need to understand two mature memory technologies used across the hierarchy: <strong>DRAM</strong>, and <strong>SRAM</strong>.</p><h4>The Memory: DRAM and SRAM</h4><p>In a na&#239;ve implementation, each MAC operation requires three memory reads and one memory write:</p><ul><li><p><strong>Read:</strong> weight</p></li><li><p><strong>Read: </strong>activation</p></li><li><p><strong>Read:</strong> accumulated partial sum</p></li><li><p><strong>Write: </strong>updated partial sum</p></li></ul><p>To store data into memory, there are two mature memory technologies available:</p><ol><li><p><strong>DRAM - Dynamic Random access memory</strong></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!c_-h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!c_-h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 424w, https://substackcdn.com/image/fetch/$s_!c_-h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 848w, https://substackcdn.com/image/fetch/$s_!c_-h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 1272w, https://substackcdn.com/image/fetch/$s_!c_-h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!c_-h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png" width="262" height="240" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:240,&quot;width&quot;:262,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3830,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!c_-h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 424w, https://substackcdn.com/image/fetch/$s_!c_-h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 848w, https://substackcdn.com/image/fetch/$s_!c_-h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 1272w, https://substackcdn.com/image/fetch/$s_!c_-h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d6159c-2ad9-4243-8558-6872afed4e97_262x240.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Figure 5. A DRAM cell with transistor and capacitor</figcaption></figure></div><p>DRAM consists of capacitors that store charge and are read / written to word lines through a transistor. These memory cells  are are typically arrayed in a 2D grid of cells with vertical row lines and horizontal bit lines. Memory cells are places at the intersection of word lines and bit lines where each ones is accessed by a combination of specific word line and bit line. </p><p></p><p><strong>Pros: </strong>DRAM is <strong>cheap</strong> and <strong>dense</strong>.</p><p></p><p><strong>Cons: </strong>DRAM is <strong>volatile</strong> and requires periodic <strong>refresh</strong> because capacitor charge leaks. It also requires constant electrical power to hold data.</p><p></p></li><li><p><strong>SRAM - Static Random access memory</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6FTP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6FTP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 424w, https://substackcdn.com/image/fetch/$s_!6FTP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 848w, https://substackcdn.com/image/fetch/$s_!6FTP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 1272w, https://substackcdn.com/image/fetch/$s_!6FTP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6FTP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png" width="459" height="308.56424581005587" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:361,&quot;width&quot;:537,&quot;resizeWidth&quot;:459,&quot;bytes&quot;:10221,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6FTP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 424w, https://substackcdn.com/image/fetch/$s_!6FTP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 848w, https://substackcdn.com/image/fetch/$s_!6FTP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 1272w, https://substackcdn.com/image/fetch/$s_!6FTP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62b85498-cd3b-48ab-ab7c-c221b3228f0b_537x361.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Figure 6</strong>. A 6 transistor SRAM cell with two inverters connected in a positive feedback configuration</figcaption></figure></div><p>SRAM consists of cross coupled inverters in a latch (or positive feedback)  configuration. The most common configuration is a 6 transistor  cross coupled inverters to hold data. To read and store data, bitlines are charged and the wordline (WL) is activated depending on the values being read/written from/to the cell. The bitline needs to be <strong>strong (i.e. generate a lot of current)</strong> during write operations to break the feedback loop of the latches in order to flip the state of the memory cell. As with DRAM, these cells are arranged in a grid with these cells at the intersections. </p><p></p><p><strong>Pros</strong> SRAM is <strong>faster</strong> and <strong>lower energy per access</strong> than DRAM. SRAM can also be easier to integrate on compute processes. </p><p></p><p><strong>Cons:</strong> SRAM is <strong>much less dense</strong> and <strong>more expensive</strong> per bit.</p><p></p></li></ol><h4>Cost of memory: energy and bandwidth</h4><p>There are two issues with DRAM:</p><ul><li><p><strong>The energy cost of DRAM is 1-2 orders of magnitude more expensive to access than on chip memory [1].</strong> There are several factors that contribute to this, one of the largest being constant refresh cycles needed when non-ideal capacitors discharge through leakage paths.</p></li><li><p><strong>The data bandwidth of DRAM is finite</strong>. In modern computer designs, DRAM is separate from the compute chips (CPU, GPU) and connected via busses with finite latency and throughput capabilities. In addition, DRAM itself has high latency due to steps of row activation, command delays, and precharging.</p></li></ul><p>Due to the high energy cost, <strong>it is not desirable to read / write from DRAM continuously.</strong> Its better to read/write a &#8220;chunk&#8221; of information at a time and store the data locally on the chip to be accessed at memory technologies with lower energy cost.</p><p>There are a few trends that have been going on improve performance without significant changes to the Von-Neumann paradigm:</p><ul><li><p><strong>High bandwidth memory (HBM)</strong> - stacked DRAM integrated closely with the compute package to increase bandwidth.</p></li><li><p><strong>Optimizing memory usage on chip into a memory &#8220;hierarchy&#8221;.</strong> When data is know in advance, it is possible to structure the data in a way that optimizes the memory cost and schedule data as such. </p></li><li><p><strong> &#8220;Compute in/near memory&#8221;</strong> which uses more efficient memory technologies designed to handle this type of computation, or architectures design to do the computation by storing weights in conductance of resistors. There are many intriguing niche approaches here [2] but are beyond the scope of this post. </p></li></ul><h4>Performance metrics for DNN Models</h4><p>&#8220;Performance&#8221; in a deep learning system consists of how performance is defined on both the model side and the Hardware side. Both are important and closely intertwined.</p><p>Metrics for DNN Models include:</p><ul><li><p><strong>Accuracy and robustness - </strong>How accurate is the model on common &#8220;benchmark&#8221; datasets</p></li><li><p><strong>Network architecture - </strong>The number of layers, filters, and sizes</p></li><li><p><strong>Number of weights - </strong>Affects storage requirements</p></li><li><p><strong>Number of MAC operations </strong></p></li></ul><p>Often times, performance is compared in terms of normalized energy per MAC to give an apples to apples comparison of performance across different models.</p><h4>Performance metrics for AI Accelerators</h4><p>Hardware optimized for AI computation has its own set of performance metrics that that are optimized depending on the application:</p><ul><li><p><strong>Energy and Power</strong> is important for most applications, including cloud data centers with stringent power ceilings, as well as portable devices with limited battery</p></li><li><p><strong>Low latency </strong>is important in real time interactive applications</p></li><li><p><strong>High throughput</strong> is important for big data analytics is important when action needs to be taken based on the data</p></li><li><p><strong>Hardware cost</strong> is important for most applications, when tightly integrating an AI chip on a consumer device to run inference, as well as integrating into a blade on a rack in a data center</p></li></ul><h4>Bridging Compute and Memory: Eyeriss [3]</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6pXe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6pXe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 424w, https://substackcdn.com/image/fetch/$s_!6pXe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 848w, https://substackcdn.com/image/fetch/$s_!6pXe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 1272w, https://substackcdn.com/image/fetch/$s_!6pXe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6pXe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png" width="564" height="297" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:297,&quot;width&quot;:564,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:11131,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6pXe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 424w, https://substackcdn.com/image/fetch/$s_!6pXe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 848w, https://substackcdn.com/image/fetch/$s_!6pXe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 1272w, https://substackcdn.com/image/fetch/$s_!6pXe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6dc70051-89ce-4e52-8abf-ec428f26091c_564x297.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Figure 7.</strong> <em>Eyeriss System Architecture.</em> <strong>Adapted from</strong> Y.-H. Chen, T. Krishna, J. S. Emer, and V. Sze, &#8220;Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks,&#8221; <em>IEEE Journal of Solid-State Circuits</em>, early access, doi: <strong>10.1109/JSSC.2016.2616357</strong>, 2016, <strong>Fig. 2</strong>; Figure is simplified for clarity.</figcaption></figure></div><p>Eyeriss is a classic AI accelerator that optimizes for the <strong>deterministic</strong> nature of AI calculations. If you know the size and shapes of the filters, ifmaps, etc. you can optimize your flow to a given hardware set based on the size of the array. By doing so, <strong>you minimize the energy cost of moving data around.</strong></p><p>Eyeriss has a few key building blocks:</p><ul><li><p><strong>Processing Engine (PE)</strong></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qJt_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qJt_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 424w, https://substackcdn.com/image/fetch/$s_!qJt_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 848w, https://substackcdn.com/image/fetch/$s_!qJt_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 1272w, https://substackcdn.com/image/fetch/$s_!qJt_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qJt_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png" width="504" height="228.11009174311926" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:444,&quot;width&quot;:981,&quot;resizeWidth&quot;:504,&quot;bytes&quot;:28199,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!qJt_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 424w, https://substackcdn.com/image/fetch/$s_!qJt_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 848w, https://substackcdn.com/image/fetch/$s_!qJt_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 1272w, https://substackcdn.com/image/fetch/$s_!qJt_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8ca08c3-5f3d-4128-b0d8-a52b682652d2_981x444.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><strong>Figure 8.</strong> Inside a <em>Eyeriss Processing Unit.</em> <strong>Adapted from</strong> Y.-H. Chen, T. Krishna, J. S. Emer, and V. Sze, &#8220;Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks,&#8221; <em>IEEE Journal of Solid-State Circuits</em>, early access, doi: <strong>10.1109/JSSC.2016.2616357</strong>, 2016, <strong>Fig. 12</strong>; Figure is simplified and colored added for clarity.</figcaption></figure></div><p></p><p>Each processing engine consists of a 16b two stage pipelined multiplier and adder, along with scratch pad memory to temporarily store the weights. </p><p></p></li><li><p><strong>Data parallelism through Network on chip (NoC)</strong></p><ul><li><p>There are 168 processing elements arranged in a Spatial architecture of <strong>12x14</strong>. </p></li><li><p>Each PE can communicate with neighboring PEs or the GLB through an NoC</p></li><li><p>The global input network handles the process of distributing the computations to each of the PEs though busses</p></li><li><p>Processing in each PE can be run independently from each other with data movement coordinated by the NoC</p></li></ul><p></p></li><li><p><strong>Four-level Memory Hierarchy</strong> </p><p>The memory hierarchy is arranged into four-level hierarchy in order of decreasing energy:</p><ul><li><p>Off Chip DRAM</p></li><li><p>108kB global buffer</p></li><li><p>NoC reuse paths (neighbor PE communication)</p></li><li><p>Scratch pads local to PE (520 bytes each)</p></li></ul></li></ul><p>The goal of this chip is simple:<strong> </strong></p><blockquote><p>To minimize high cost DRAM and GLB access by reusing data from low cost spads and inter-PE communication as much as possible. </p></blockquote><p>Eyeriss accomplishes this through a methodology called<strong> &#8220;Row stationary&#8221; </strong>that divides the matrix multiplication in a series of <strong>1D convolution primitives</strong> that form <strong>2D convolutional sets. </strong>:</p><ul><li><p><strong>1-D Convolution primitive in a PE</strong></p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GZh8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GZh8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 424w, https://substackcdn.com/image/fetch/$s_!GZh8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 848w, https://substackcdn.com/image/fetch/$s_!GZh8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 1272w, https://substackcdn.com/image/fetch/$s_!GZh8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GZh8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png" width="552" height="170" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:170,&quot;width&quot;:552,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:6543,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GZh8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 424w, https://substackcdn.com/image/fetch/$s_!GZh8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 848w, https://substackcdn.com/image/fetch/$s_!GZh8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 1272w, https://substackcdn.com/image/fetch/$s_!GZh8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7180ea31-74cb-421a-af04-d97d9d695a39_552x170.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><strong>Figure 9.</strong> <em>A 1D Convolution Primitive.</em> <strong>Adapted from</strong> Y.-H. Chen, T. Krishna, J. S. Emer, and V. Sze, &#8220;Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks,&#8221; <em>IEEE Journal of Solid-State Circuits</em>, early access, doi: <strong>10.1109/JSSC.2016.2616357</strong>, 2016, <strong>Fig. 3</strong>. Colors added for clarity</figcaption></figure></div><ul><li><p>A 1D convolution primitive can be best thought of as a &#8220;sliding window&#8221; approach because it &#8216;fixes&#8221; the filter weights in memory while the ifmaps are &#8220;slid&#8221; across it. </p></li><li><p>This allows weights/activations are reused many times before going back to DRAM and/or the GLB.</p></li><li><p>Accumulate psums from different primitives to generate the ofmap values</p></li></ul></li></ul><ul><li><p><strong>2D Convolution PE Set</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7aXS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7aXS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 424w, https://substackcdn.com/image/fetch/$s_!7aXS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 848w, https://substackcdn.com/image/fetch/$s_!7aXS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 1272w, https://substackcdn.com/image/fetch/$s_!7aXS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7aXS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png" width="381" height="389" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:389,&quot;width&quot;:381,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:17071,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://chadw.substack.com/i/185475328?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7aXS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 424w, https://substackcdn.com/image/fetch/$s_!7aXS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 848w, https://substackcdn.com/image/fetch/$s_!7aXS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 1272w, https://substackcdn.com/image/fetch/$s_!7aXS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e73574e-a3db-4a5c-be85-1681b81099e6_381x389.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><strong>Figure 10.</strong> <em>A 2D Convolution Primitive with Constant value contours shown along the arrows.</em> <strong>Adapted from</strong> Y.-H. Chen, T. Krishna, J. S. Emer, and V. Sze, &#8220;Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks,&#8221; <em>IEEE Journal of Solid-State Circuits</em>, early access, doi: <strong>10.1109/JSSC.2016.2616357</strong>, 2016, <strong>Fig. 4</strong>; Colors added for clarity</figcaption></figure></div><p></p><ul><li><p>2D convolution sets are formed from multiple 1D convolution primitives</p></li><li><p>In a 2D convolution set, the weight are written to spads in rows and the weights are &#8220;slid across&#8221; diagonally. It works in the following order:</p><ul><li><p><strong>Reuse</strong> each filter row horizontally </p></li><li><p><strong>Reuse</strong> each of ifmap diagonally </p></li><li><p><strong>Accumulate</strong> rows of psums vertically / features</p></li></ul></li><li><p>The challenge is to &#8220;map&#8221; PE sets onto the PE array <strong>cleanly</strong> due to different sizes of layer shapes and PE array dimensions. Any computation primitive that exceed the size of the PE array itself or width / height can incur a large energy cost and needs further optimization from RS dataflow. </p></li></ul></li></ul><p><strong>Other optimizations.</strong> These optimizations are also used and influential.</p><ul><li><p><strong>Run length compression</strong> - compress runs of 0&#8217;s</p></li><li><p>When distributing data from the GLB, only activate busses that correspond to the tag to save energy</p></li></ul><p><strong>A large fraction of the chip area goes to the memory, including the global buffer and on-chip storage close to the MACs.</strong></p><h4>Conclusion</h4><p>In this primer I built up intuition of how AI accelerators optimize energy of AI computations:</p><ul><li><p>MACs are the core primitive in most DNN workloads</p></li><li><p>DNN layers are repeated dot products with heavy weight/activation reuse potential</p></li><li><p>Performance is often limited by memory movement (energy + bandwidth), not arithmetic</p></li><li><p>Accelerators win by choosing dataflows + memory hierarchies that maximize reuse and minimize DRAM traffic</p></li></ul><p>There is a lot of research done to optimizing AI accelerators including related compute near memory architecture that leverage traditional compute architectures as much as possible, as well as intriguing compute in memory approaches. </p><p>If you want to learn more about the technical developments of AI on the SW modelling side, check out my other post, based heavily on the book &#8220;Infinity Machine&#8221;:</p><div class="digest-post-embed" data-attrs="{&quot;nodeId&quot;:&quot;9a35f59f-f872-4015-9840-7db3157ed365&quot;,&quot;caption&quot;:&quot;Demis Hassabis is perhaps one of the most important influences in Modern AI that isn&#8217;t a household name.&quot;,&quot;cta&quot;:null,&quot;showBylines&quot;:true,&quot;showDescription&quot;:true,&quot;showImage&quot;:true,&quot;size&quot;:&quot;lg&quot;,&quot;isEditorNode&quot;:true,&quot;title&quot;:&quot;From Atari to ChatGPT: The Technical and Corporate Forces Shaping Frontier AI&quot;,&quot;publishedBylines&quot;:[{&quot;id&quot;:127334549,&quot;name&quot;:&quot;Chad Wallace&quot;,&quot;bio&quot;:&quot;Deeply-researched Mental Models on Mixed Signal architectures for AI Datacenters - High Speed Optical/Wireline, Power Electronics, and adjacent domains &quot;,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/50804a2f-043d-47c7-915f-b30144640b1f_697x697.png&quot;,&quot;is_guest&quot;:false,&quot;bestseller_tier&quot;:null}],&quot;post_date&quot;:&quot;2026-04-29T17:54:36.593Z&quot;,&quot;cover_image&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fcc0b821-27c2-4f6d-b4e3-b1750426a5f6_1201x673.png&quot;,&quot;cover_image_alt&quot;:null,&quot;canonical_url&quot;:&quot;https://chadw.substack.com/p/a-brief-history-of-modern-ai-deepmind&quot;,&quot;section_name&quot;:&quot;Business Library&quot;,&quot;video_upload_id&quot;:null,&quot;id&quot;:195320512,&quot;type&quot;:&quot;newsletter&quot;,&quot;reaction_count&quot;:2,&quot;comment_count&quot;:0,&quot;publication_id&quot;:7378748,&quot;publication_name&quot;:&quot;Inside the Silicon Machine&quot;,&quot;publication_logo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!cXVP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2817a3aa-afa0-4d3a-b965-bace92d3d63e_902x902.png&quot;,&quot;belowTheFold&quot;:true,&quot;youtube_url&quot;:null,&quot;show_links&quot;:null,&quot;feed_url&quot;:null}"></div><p></p><h4>References</h4><p>[1] V. Sze, Y.-H. Chen, T.-J. Yang, and J. Emer, &#8220;Efficient Processing of Deep Neural Networks: A Tutorial and Survey,&#8221; arXiv:1703.09039v2, 2017.</p><p>[2] A. A. Khan, J. P. C. de Lima, H. Farzaneh, and J. Castrill&#243;n, &#8220;The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview,&#8221; <em>arXiv preprint</em> arXiv:2401.14428, 2024.</p><p>[3] Y.-H. Chen, T. Krishna, J. S. Emer, and V. Sze, &#8220;Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks,&#8221; <em>IEEE Journal of Solid-State Circuits</em>, vol. 52, no. 1, pp. 127&#8211;138, Jan. 2017, doi: 10.1109/JSSC.2016.2616357.</p><p></p><p></p><p></p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.siliconcodesign.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Inside the Silicon Machine! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>