<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Nate’s Substack]]></title><description><![CDATA[Daily newsletters on AI strategy, news, and implementation for practitioners and leaders who are past the hype and ready to build.]]></description><link>https://natesnewsletter.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!s4a7!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb3b96b13-6f01-4e56-b410-18e03e7bc8af_500x500.png</url><title>Nate’s Substack</title><link>https://natesnewsletter.substack.com</link></image><generator>Substack</generator><lastBuildDate>Tue, 26 May 2026 14:05:35 GMT</lastBuildDate><atom:link href="https://natesnewsletter.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Nate]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[natesnewsletter@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[natesnewsletter@substack.com]]></itunes:email><itunes:name><![CDATA[Nate]]></itunes:name></itunes:owner><itunes:author><![CDATA[Nate]]></itunes:author><googleplay:owner><![CDATA[natesnewsletter@substack.com]]></googleplay:owner><googleplay:email><![CDATA[natesnewsletter@substack.com]]></googleplay:email><googleplay:author><![CDATA[Nate]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[One person's best AI session vanishes the second they close the tab. Grab the 3 prompts that make it your team's.]]></title><description><![CDATA[Watch now | The AI work your company cannot see is the AI work your company cannot learn from.]]></description><link>https://natesnewsletter.substack.com/p/public-ai-work-team-learning</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/public-ai-work-team-learning</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Tue, 26 May 2026 13:03:17 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/199112667/ff85c937197c6d67a59e6b6693756c46.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>5,938 Shopify employees worked alongside the same AI agent in a single month, all of it in public. When those numbers surfaced this spring, everyone fixated on the scale. The scale is not the interesting part.</p><p>Almost no other company works this way, and most of them have the opposite problem. Their people are using AI constantly, asking ChatGPT to rewrite emails, using Claude to reason through customer issues, running coding agents against repos, quietly building small workflows that save hours. And almost none of it is visible to anyone else.</p><p>A good prompt disappears into one person&#8217;s chat history. A clever correction stays inside one employee&#8217;s browser tab. The workflow your best operator nailed last month gets rebuilt from scratch by a new hire who never knew it existed. A senior person figures out how to load context, challenge the model, and review what comes back, and the junior across the team never gets to watch how any of that judgment works.</p><p>The result: individuals get smarter, the company does not.</p><p>That is the missing layer in most AI adoption plans. Companies are buying tools, writing acceptable-use policies, measuring logins, running the occasional training session. None of it touches the actual problem, which is that your best AI work is invisible to everyone who could learn from it. So every quarter, the same lessons get rediscovered from nothing. You are paying tuition on the same lesson over and over, across hundreds of people, and the bill never stops arriving.</p><p>Shopify&#8217;s answer is not surveillance. It is not scraping everyone&#8217;s chat history and calling it knowledge management. It is something smaller and far more useful: a deliberate way to make non-sensitive AI work visible enough that the people around it can actually learn. The agent that produced those numbers runs only in public. That single design choice is the whole game, and any team can copy it.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>What Shopify actually built.</strong> Not the AI mandate everyone argued about, but the design choice underneath River, the agent that runs in public and turns one engineer&#8217;s judgment into the whole team&#8217;s.</p></li><li><p><strong>Why a prompt library will not save you.</strong> The most valuable part of AI work is the part a prompt library cannot hold, and what you have to make visible instead.</p></li><li><p><strong>Where to draw the line.</strong> A workflow-by-workflow boundary for regulated and sensitive work, so you capture the learning without exposing anything that should stay private.</p></li><li><p><strong>The room, the rules, and the one constraint that makes it work.</strong> A setup any team can run in ninety minutes, the metrics that actually signal learning, and the single binding rule that bends a team toward sharing instead of hoarding.</p></li><li><p><strong>How to bring this into your org.</strong> A three-part prompt kit: one that turns a messy AI session into a post your team can learn from, one that draws the line between what&#8217;s safe to share and what stays private, and one that helps your senior people model real work in public without it feeling staged.</p></li></ul><p>Private AI work helps the person at the keyboard. Public AI work helps the company learn. That difference is small today and decisive in a year.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/public-ai-work-team-learning">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[AI made your app teams 10x faster. Nobody gave your platform team 10x the headcount.]]></title><description><![CDATA[I sat down with Emma, who leads data infrastructure engineering at OpenAI, to find out what her team is actually building to stay ahead of the agents.]]></description><link>https://natesnewsletter.substack.com/p/ai-agents-platform-team-bottleneck</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/ai-agents-platform-team-bottleneck</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Mon, 25 May 2026 14:02:04 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/199078289/68bc96a2c3b28292375a068ff10f9be9.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Nobody meant to break anything. A user asked for something routine, an agent found a path through the system, and a Kafka cluster went down. That&#8217;s the part that doesn&#8217;t show up in productivity dashboards.</p><p>In the same conversation, I heard the opposite version of the same story. A user launched a training-data export job and went to sleep. The job hit a blocker. Instead of waiting for a platform engineer to notice a support ping, the agent inspected several internal systems and found a small bug several layers down. By morning, the job was done.</p><p>Those two examples belong together. They show both sides of the frontier. Agents are now useful enough to participate in real operational work. They&#8217;re also capable enough to create real operational risk.</p><p>That&#8217;s the next AI bottleneck. Not whether people use agents, or whether agents can code, or whether the next model wins a benchmark. We&#8217;re already past those questions. The real question is what happens when the work starts moving faster than the controls around it.</p><p>This came into focus in a recent conversation with Emma, who leads data infrastructure engineering at OpenAI. Her team sits near the bottom of a very large stack: the analytics, streaming, and data infrastructure that almost every other team eventually depends on.</p><p>That vantage point matters. If you&#8217;re on an application team, you mostly see whether agents help you build faster. If you&#8217;re on a platform team, you see what happens when everyone above you starts building faster.</p><p>Agents make work happen faster. They don&#8217;t make it safer at the same rate.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>Agents crossed into operations.</strong> They&#8217;re not writing code for humans to paste anymore. They run release loops and build features end-to-end while a human watches.</p></li><li><p><strong>The work lands somewhere.</strong> When app teams accelerate, platform teams inherit the operational burden nobody budgeted for.</p></li><li><p><strong>Platform agents play by different rules.</strong> Same model, very different blast radius, and the tooling most companies have doesn&#8217;t know the difference.</p></li><li><p><strong>The practical control layer.</strong> Four things that let you absorb agent-created work without becoming the permanent bottleneck everyone routes through.</p></li><li><p><strong>The eval discipline most companies skip.</strong> A cheap way to know when an agent is ready for more autonomy and when to pull it back.</p></li><li><p><strong>Two prompts that build the documents.</strong> A private eval suite and a tiered action-class policy, both built from your own systems, not a template.</p></li></ul><p>Let me show you where this is headed, and what to do about it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/ai-agents-platform-team-bottleneck">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Executive Briefing: Your AI vendor contract isn't built for a capacity crunch. 3 prompts to fix it before your budget meeting]]></title><description><![CDATA[Watch now | Microsoft plans to spend roughly $190 billion this year and still expects to run short on capacity.]]></description><link>https://natesnewsletter.substack.com/p/ai-big-tech-industrial-business</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/ai-big-tech-industrial-business</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Sun, 24 May 2026 15:01:01 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/198904623/5950571cc9cab46da002a29a37723b6e.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Microsoft plans to spend roughly $190 billion this year and still expects to run short on capacity. And Microsoft is not alone: across the four biggest hyperscalers, combined 2026 capital spending is on track to approach $700 billion, nearly double what they spent in 2025.</p><p>The fiscal third-quarter 2026 numbers fill in the picture: Microsoft reported $31.9 billion in capital expenditures for the quarter and guided to more than $40 billion in the next. About two-thirds of that quarterly spend went to short-lived assets, primarily GPUs and CPUs. Even after all of it, the company expects to stay capacity constrained at least through the year.</p><p>No classic software company looks like that. A classic software company writes code once and sells it many times, and its biggest worry is talent or distribution, not whether it can physically build enough capacity to meet demand. The cloud was supposed to make infrastructure someone else&#8217;s problem.</p><p>AI breaks that abstraction.</p><p>The most important thing to understand about the current AI buildout is that tokens are not magic. They are manufactured. Every answer from a model is the output of a physical production system that consumes chips, high-bandwidth memory, advanced packaging, substrates, optics, power, cooling, land, data-center construction, networking, and operations talent. When it works, a user sees a paragraph, a line of code, a summarized contract, or an agent completing a task. Behind the screen, a factory is turning electricity and silicon into intelligence.</p><p>This is why AI is turning big tech into an industrial business.</p><p>Calling AI industrial is not new. Mary Meeker <a href="https://natesnewsletter.substack.com/p/i-summarized-mary-meekers-incredible">built a 340-slide deck around it</a>. Jensen Huang says a version of it on every NVIDIA earnings call. What is new is what Microsoft&#8217;s number does to the contract sitting on your desk. Six months ago, an AI vendor agreement was structured like a software agreement. Now that the hyperscalers are spending at this scale and still rationing capacity, your AI vendor agreement is a supply contract in everything but name. It has allocation. It needs capacity terms. It needs a fallback. None of that was a line item a year ago.</p><p>If that sounds like a problem for the CFO and no one else, consider that my own token spend ran close to 500 million tokens last week. Multiply that across a team and the capacity question stops being abstract.</p><p>The visible product is still software. ChatGPT, Copilot, Gemini, Claude, Meta AI, and Bedrock all look like applications or APIs, but the constraint underneath them is physical. The strategic question is no longer only, &#8220;Who has the best model?&#8221; It is, &#8220;Who can operate the factory that produces intelligence at scale?&#8221;</p><p><strong>This briefing covers:</strong></p><ul><li><p><strong>The bill of materials behind every token.</strong> Chips, memory, packaging, power, cooling, construction, and how to tell which one stops your vendor cold before it stops you.</p></li><li><p><strong>Why your vendor contract is now a supply contract.</strong> What to actually ask for when the cloud you buy from is competing with you for the same chips: allocation, fallback, reserved capacity, written down.</p></li><li><p><strong>Your CFO is about to inherit a capital cycle.</strong> Why utilization is suddenly the metric that matters, and why a 40% throughput gain beats a new data center.</p></li><li><p><strong>Seats are the wrong unit, and almost every plan still uses them.</strong> How to forecast in tokens instead, and why an agent that runs for hours belongs in a different budget line than a chatbot that answers questions.</p></li><li><p><strong>The one question that decides whether AI helps your margins or wrecks them.</strong> It is not whether tokens are expensive. It is something more specific, and most companies have never asked it.</p></li><li><p><strong>Three instruments to run before the budget meeting.</strong> A vendor audit that maps the supply chain under your contract and hands you the language to demand, a forecasting model that sizes demand in tokens instead of seats, and a routing diagnostic that finds the workflows burning premium inference on cheap work.</p></li></ul><p>Microsoft has already put $190 billion behind this view of the world. Most companies have not put it into their AI plans at all. What follows is what changes once you do.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Executive Circle members enjoy all these Sunday briefings! Curious? You can easily <a href="https://support.substack.com/hc/en-us/articles/360044105731-How-do-I-change-my-subscription-plan-on-Substack">change your plan here</a></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/ai-big-tech-industrial-business">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Build the room before you write the memo. Grab the 4-prompt project room kit: source inventory, duplicate log, missing-context list, grounded draft.]]></title><description><![CDATA[Watch now | The first useful agent workflow is not generation. It is getting the work surface into shape.]]></description><link>https://natesnewsletter.substack.com/p/ai-organize-files-before-writing</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/ai-organize-files-before-writing</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Fri, 22 May 2026 13:02:29 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/198744305/3b9e8a890059412b8e42c6ce7d5d797b.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>When AI produces a mediocre draft from a messy folder, the prompt is almost never the problem. The room is.</p><p>The model has been handed a strategy doc, two slightly different versions of the operating plan, a transcript with two meetings in it, and a deck that no longer matches reality. It is asked to write a memo. To do that, it has to do two jobs at once: figure out what the project actually is, then produce the artifact. The first job is the hard one. The second job is the one that shows up in the draft.</p><p>A sharper prompt won&#8217;t fix this. You need to prepare the room first.</p><p>I recently worked on a project where the real work did not live in one place. A strategy doc, meeting transcripts, a budget spreadsheet, trip-planning notes, org-design drafts, old PDFs, follow-up emails, half-finished notes. Some clearly current. Others superseded. A few useful only because they showed how the thinking had changed.</p><p>The useful first prompt was much more boring than &#8220;write the plan.&#8221; It was something like: help me build the room. Find the relevant materials. Preserve the originals. Make an inventory. I needed to know which files were authoritative, which were duplicates, which were old, which were missing. I asked it to summarize each source before synthesizing across them, and explicitly told it not to write the final deliverable yet.</p><p>Only after that did the writing prompt become simple. Use the current operating plan for the numbers, the transcript for decision context, the older PDF only as background, and flag unsupported claims rather than smoothing them over. The room made those distinctions visible before the writing started.</p><p>This kind of workflow was not really available a year ago. Agents could draft, summarize, and answer questions, but they were uneven at walking a folder tree, opening files in sequence, comparing dates across documents, and inspecting metadata without losing the thread. In the last few months that has changed. The current generation of agents is good at the small, boring, file-level operations the work actually requires. Which means the bottleneck has moved. It is no longer &#8220;can the model produce the artifact.&#8221; It is &#8220;is the source set in shape for the model to do anything useful with it.&#8221;</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>What is an AI project room.</strong> What a bounded workspace looks like for one serious job, and which tools to use for which source types.</p></li><li><p><strong>Why AI fails with messy source files.</strong> Why serious work fails when you skip the preparation step and jump straight to generation.</p></li><li><p><strong>How to build an AI source inventory.</strong> How to build the artifact that makes everything downstream inspectable.</p></li><li><p><strong>Summaries, duplicates, and missing context.</strong> The three preparation layers that prevent bad synthesis before it starts.</p></li><li><p><strong>The writing prompt, once the room exists.</strong> What changes when you draft from a clean work surface instead of a raw file dump.</p></li><li><p><strong>Grab the four prompts.</strong> A room-builder for file-system tools, an inventory-and-audit for uploaded docs, a grounded-draft prompt that cites every claim back to a source, and a refresh prompt for when new files arrive.</p></li></ul><p>Let&#8217;s build the room.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/ai-organize-files-before-writing">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[68% of AI power users do one thing differently — and it is not a prompt trick]]></title><description><![CDATA[Watch now | Working with AI agents makes you a better communicator]]></description><link>https://natesnewsletter.substack.com/p/ai-agents-better-communicator</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/ai-agents-better-communicator</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Thu, 21 May 2026 13:03:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!5Kso!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbff16dcf-3ac3-4d48-ad85-1d974ac49a53_1254x1254.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Working with AI agents makes you a better communicator</p><p>Most people are still prompting like it&#8217;s November, and the model on the other end stopped being the same machine months ago.</p><p>Opus 4.7 and GPT-5.5 landed in the last two months. The agents built on top of them now run for hours on a single request &#8212; they edit files, search archives, write code and test it, and come back with something close to a finished artifact. That capability jump is the largest single move I have watched in two years of doing this work. Our prompting has not moved with it.</p><p>When I look over people&#8217;s shoulders, I see the same kinds of requests I saw last September going into a system that is a hundred times more capable. The output comes back polished and mostly useless, and the person at the keyboard concludes the model is mediocre. That is almost never what is actually happening. The assignment is mediocre, and the model is just being honest about it.</p><p>The shift is from prompting to briefing. Last year, the best advice was to talk to AI like a careful junior employee who needed every step spelled out, every format specified, every structural decision made for them. That advice was right for the models we had, and those models needed it. The models we have now do not. A senior partner needs the goal, the context, the constraints, and the quality bar, and then they need room to push back. An agent running on 4.7 or 5.5 works the same way, and most of the leverage in the new models lives in that gap between what you tell a junior and what you tell a senior.</p><p>A prompt is something you type into a box. A useful question is the same thing done seriously &#8212; work made legible enough that another intelligence can act on it.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The senior-partner shift.</strong> What actually changes about your request when you stop prompting and start briefing.</p></li><li><p><strong>The generic output trap.</strong> Why polished, useless AI work is almost always a mirror of the assignment, not the model.</p></li><li><p><strong>The six-field brief.</strong> The template that turns a vague ask into something an agent can run with for hours without drifting.</p></li><li><p><strong>The same brief, for humans.</strong> Why this makes you a clearer manager, a sharper colleague, and easier to work with than most of your team.</p></li><li><p><strong>Three prompts that earn their keep.</strong> The brief builder for when you do not know how to start, the thin-ask detector for the request you suspect is too vague to send, and the finish line for work that keeps almost being done.</p></li></ul><p>The brief and the prompts are below. So is the part most people miss: get good at this with the agent and you get better with the humans too.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/ai-agents-better-communicator">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Seven questions decide whether your AI agent ships. Most teams can answer two.]]></title><description><![CDATA[Watch now | The model is one piece of the agent economy. The control layer is the other, and most proposals on a CIO's desk have no answer for it.]]></description><link>https://natesnewsletter.substack.com/p/agent-infrastructure-control-layer</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/agent-infrastructure-control-layer</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Wed, 20 May 2026 13:01:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!W9oW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8db0e815-0c67-48b0-abee-0f9811dffa6c_1254x1254.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A new class of company has taken up position around the agent economy, and they are the ones who decide whether your agent gets to ship. They do not build models. They are not on most teams&#8217; AI stack roadmap. But every serious production agent has to pass through them, and most agent proposals on a CIO&#8217;s desk right now have no answer for what those companies are about to ask.</p><p>The model is one piece of the agent economy. The control layer is the other.</p><p>The control layer is the set of infrastructure decisions that determine whether a model&#8217;s output is allowed to act in the world. Where does the agent live. What state does it remember. Who is it acting for. When does it need approval. What can it spend. Who can stop it. None of those questions get answered by a model. They get answered by the companies sitting between the model and your production system, and the last six weeks have made it obvious who they are. Cloudflare ran Agents Week. Stripe expanded its Agentic Commerce Suite. Okta launched Okta for AI Agents and expanded it again this month. Auth0 has been publishing AI Agents docs. Datadog has been turning LLM observability into something that looks a lot more like an agent control plane than a logging product.</p><p>I&#8217;ve covered the protocol stack already &#8212; <a href="https://natesnewsletter.substack.com/p/agent-protocol-stack-mcp-a2a">the six protocols that emerged and the three that decide which agents survive</a>. This piece is about the operators sitting above the protocols. The companies turning agentic behavior into controlled, permissioned, auditable infrastructure. The companies your security review is about to discover the hard way.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The seven-row control map</strong> I would put in front of any agent proposal before it touches a production system, and the one question per row that flushes out the gap.</p></li><li><p><strong>Why Cloudflare, Auth0, Snowflake, Stripe, and Datadog</strong> are becoming the operating system of the agent economy, and what each one is actually trying to own.</p></li><li><p><strong>The kill switch most teams do not actually have</strong>, even though they think they do &#8212; and the five layers it has to be implemented at to be real.</p></li><li><p><strong>A real data leader&#8217;s failure mode</strong>, live this quarter: her agents are routing around the human permission structure, and she now owns three questions she did not yet know how to answer.</p></li><li><p><strong>The three prompts paid subscribers get this week.</strong> A control-map fill-in for the agent your team is most likely to ship this quarter, a pressure-test for the next vendor pitch on your desk, and a five-layer kill-switch audit for the agent that actually scared you.</p></li></ul><p>The boring layer is where the power is moving. Let me walk you through the map.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/agent-infrastructure-control-layer">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Six agent protocols just launched. Three of them decide which products survive. Here is how to tell which three.]]></title><description><![CDATA[Six agent protocols launched in a year. Three form the core stack: MCP for tools, A2A for delegation, AG-UI for human control. Map your product to the right layers.]]></description><link>https://natesnewsletter.substack.com/p/agent-protocol-stack-mcp-a2a</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/agent-protocol-stack-mcp-a2a</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Tue, 19 May 2026 13:03:05 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!1tAn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc6357221-36f1-461d-8119-f40dd2caf69d_1254x1254.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Google I/O opens today, and most of the coverage will be about the demos. The more interesting story is the protocol race happening underneath them. Six new agent protocols have launched in the past year, and if you are building or buying agents right now, the standards layer reads like a scrum.</p><p>The instinct is to ignore the acronyms and wait for the big platforms to absorb everything behind product surfaces. The platforms will absorb a lot of it. But the protocols are not all trying to solve the same problem, and the substrate an agent is built on shapes the customer experience more than the model choice does. Read as a pile of competing standards, the picture stays blurry. Read as layers, three of them snap into focus as the foundation most builders will actually depend on.</p><p>Three protocols are forming the core agent stack: MCP for the tools and data an agent can reach, A2A for the other agents it can delegate to, and AG-UI for the controls a human needs to stay in the loop while long-running work is happening. Those three answer the only three questions every real agent system hits within its first week of existence. What can the agent use. Who else can the agent work with. How does the human stay in control. A2UI, AP2, and x402 matter, but they sit in layers where product requirements, trust boundaries, payment rails, and platform incentives are still being negotiated. Treating those three as equal bets with the first three is the most common mistake I see in agent strategy decks right now.</p><p>Builders who try to bet on all the layers at once end up paralyzed. Builders who ignore the layer map ship agents that fail at the boundaries that actually matter, like security on tool access, approval on long-running work, and supervision when the agent crosses company lines. Buyers who cannot read the layer map cannot evaluate what they are actually purchasing when a vendor says the word &#8220;agent.&#8221; The map is the closest thing the industry has to a shared vocabulary, and the next twelve months of agent product strategy will run through it.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The protocol map.</strong> A single table that places all six protocols in the layer they actually occupy, so the acronyms stop blurring together.</p></li><li><p><strong>The core stack: MCP, A2A, AG-UI.</strong> Why these three are converging into the universal foundation, what each one is really for, and where each one fails if treated as a feature toggle instead of a boundary.</p></li><li><p><strong>Why payments are not one protocol.</strong> AP2 and x402 solve different problems, the payment space carries hidden assumptions about geography and authorization, and &#8220;agent can pay&#8221; is not a button &#8212; it is an audit problem.</p></li><li><p><strong>Map, draft, audit, brief.</strong> Map any real workflow to the layers that matter, draft the Agent Card boundary another team will integrate against, audit a workflow for the human controls it&#8217;s missing, and produce the strategy brief for the leader making the platform bet &#8212; four prompts, one per job, with renewal prep as the worked example threaded through.</p></li></ul><p>The acronym pile will keep growing. The layer map will not.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/agent-protocol-stack-mcp-a2a">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[What ChatGPT sees when it looks at your company + 3 diagnostics]]></title><description><![CDATA[Watch now | If I were walking into a marketing role in 2026, I would not start by asking which AI tools the team uses.]]></description><link>https://natesnewsletter.substack.com/p/marketing-humans-and-agents-2026</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/marketing-humans-and-agents-2026</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Mon, 18 May 2026 13:02:18 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/198159118/150602b5224a225410ea1e0281a00740.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>If I were walking into a marketing role in 2026, I would not start by asking which AI tools the team uses.</p><p>The content workflow is not where I would start. Neither is the lifecycle automation stack, and I would not waste an early conversation asking whether the team is on ChatGPT, Claude, Jasper, Canva, Midjourney, Clay, HubSpot Breeze, Salesforce Agentforce, or some agentic Frankenstein the growth team duct-taped together over a long weekend. Those questions answer themselves once you know what the team is responsible for.</p><p>I would start with a more basic question, the kind that determines whether everything downstream is sane:</p><p><em>Does this company understand that marketing now has two audiences?</em></p><p>Most leadership teams think they already know the answer. Almost none of them have internalized what it means. Marketing now serves two audiences at once. It serves the humans it has always served, and the agents that increasingly read, summarize, compare, and recommend on behalf of those humans before any human decision gets made. Companies that miss the split lose on both sides.</p><p>The losses are already showing up in the data. In a March 2026 survey of 1,076 B2B software buyers, sixty-nine percent reported choosing a different vendor than they originally planned because of guidance from an AI chatbot, and roughly a third bought from a vendor they had never heard of before the assistant surfaced it. That is not the work of better marketing. It is the work of a comparison system that read the public surfaces, weighed them against each other, and produced a shortlist &#8212; one that often did not include the buyer&#8217;s original preferred vendor.</p><p>I&#8217;m starting a series on what I would do walking into a job today across the functions AI is reshaping. Marketing first, because I spent a long stretch of my career in and around it. Most of the AI-marketing conversation right now is about tools and adoption curves. This piece is about what the job has to become.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The two audiences, and why most companies serve only one.</strong> What agents need is not a better tagline. It&#8217;s legibility, and legibility is a different job than persuasion.</p></li><li><p><strong>The &#8220;make more stuff&#8221; trap.</strong> Why an AI strategy built around content velocity puts your career on the most commoditized layer of the role.</p></li><li><p><strong>The truth layer.</strong> Why marketing has to become the steward of the company&#8217;s claims, proof, and product reality, not the decorator of decisions made elsewhere.</p></li><li><p><strong>Why AI-washing is a trust-debt loan.</strong> What happens when companies (and candidates) stretch their AI story past what they can defend, and why the agents catch it first.</p></li><li><p><strong>What to look for in a marketing role in 2026.</strong> The questions to ask before you take the job, and the surfaces marketing has to be allowed to touch.</p></li><li><p><strong>The AI-washing audit.</strong> A skill pack for Claude Desktop, Claude Code, ChatGPT, and Codex that builds your company&#8217;s truth layer from the ground up &#8212; diagnosis, claims-and-evidence map, AI-washing risk register, and a prioritized fix list.</p></li></ul><p>Let me show you what the split looks like, why most companies are sitting on the wrong side of it, and what the marketers who win the next decade will be doing differently.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/marketing-humans-and-agents-2026">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Executive Briefing: Stop asking if AI can do this. Start asking what shape the work is.]]></title><description><![CDATA[Watch now | Every serious AI conversation eventually turns into the same practical question.]]></description><link>https://natesnewsletter.substack.com/p/build-buy-hire-wait-ai-matrix</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/build-buy-hire-wait-ai-matrix</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Sun, 17 May 2026 15:02:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!xQuX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5ba3d118-fa6a-4f02-800c-7e535d181ee8_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Every serious AI conversation eventually turns into the same practical question. Should we hire someone for this? Should we automate it? Should we buy a tool? Should we build the workflow ourselves? Or should we wait because the models are changing so fast that anything we build today may be obsolete in six months?</p><p>Most teams treat this as an AI question. That ends up costing time, money, and the work itself. It is a work-shape question. The right answer depends less on how impressive the model demo looked and more on the structure of the work in front of you: how often it repeats, how costly a mistake is, how much judgment it needs, and whether near-term model improvement is about to collapse what you are about to build.</p><p>Shopify gave the market one version of this question when Tobi L&#252;tke told teams they had to show why they could not get something done with AI before asking for more headcount. It stopped hiring from being the default answer to every capacity problem. But &#8220;can AI do this?&#8221; is the wrong question to stop at.</p><p>Executives are running a capital allocation problem, not a technology question. How you allocate capital has always defined what a firm can accomplish, and the upside variance on AI investment right now is wider than most leaders have ever priced for. Pick the wrong motion for a workflow and the cost is not only the wasted spend. It is also the upside you never captured because the capital landed in the wrong place.</p><p>The wrong answer is expensive in both directions. If you hire against work that AI can already handle, you build a cost structure around disappearing scarcity. If you automate work that depends on trust and judgment, you break the business process at the point where the human mattered most. If you buy a generic tool for company-specific work, you spend months fighting the product. If you custom-build something the market has already solved, you burn scarce builders on infrastructure that should have been a line item. If you wait on a workflow that is already stable and costly, you let delay masquerade as prudence.</p><p>Gartner has put a number on it: more than 40% of agentic AI projects are forecast to be canceled by the end of 2027 because of cost, unclear business value, or inadequate risk controls. Classify the work before the spending starts.</p><p><strong>This briefing covers:</strong></p><ul><li><p><strong>The decision starts with the work.</strong> A six-dimension scoring framework that routes each workflow to the right investment motion.</p></li><li><p><strong>When to automate, build, buy, hire, or wait.</strong> Real company examples (IBM, Klarna, Stripe) showing how the shape of the work determines the answer.</p></li><li><p><strong>The matrix.</strong> A two-axis visual that maps market maturity against company specificity, with named examples in every cell.</p></li><li><p><strong>The executive job is changing.</strong> Why routing logic is the new leadership skill, and what happens when executives and builders have the conversation together.</p></li><li><p><strong>Four prompts that route AI investment.</strong> A decomposer that turns a function into scoreable workflows, a scorer that writes the budget memo, a pressure test that forces three counter-arguments before capital commits, and a describability gate that holds automation projects until eight fields are filled.</p></li></ul><p>Classify the work first. The investment motion follows.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Executive Circle members enjoy all these Sunday briefings! Curious? You can easily <a href="https://support.substack.com/hc/en-us/articles/360044105731-How-do-I-change-my-subscription-plan-on-Substack">change your plan here</a></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/build-buy-hire-wait-ai-matrix">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Exclusive: a conversation with Tibo from Codex on what your company has to become when the model can actually do the work]]></title><description><![CDATA[Watch now | Between the launch of the new Codex and GPT-5.5 and now, something happened in my own house that has stayed with me more than any benchmark.]]></description><link>https://natesnewsletter.substack.com/p/codex-five-leadership-chairs-tibo-interview</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/codex-five-leadership-chairs-tibo-interview</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Sat, 16 May 2026 14:03:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!6t_T!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b9d57a2-dc48-4532-90e0-385284a0b901_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Between the launch of the new Codex and GPT-5.5 and now, something happened in my own house that has stayed with me more than any benchmark. My wife, who is not an engineer, built and shipped a working full-stack app. She is using GitHub for the first time. That is one anecdote, not a trend, and I am wary of overreading it. But it is the cleanest signal I have for what the April release actually did. The model can now carry the work, and the surface area of who can ship working software has widened far enough that the question of where human judgment lives inside a company stops being a developer question and starts being a leadership one.</p><p>I sat down with Tibo, who leads Codex at OpenAI, to ask what changes for companies now that the model can do the work. I&#8217;ve written about this a few times since GPT-5.5 and Codex dropped in late April: the bottleneck has moved twice. The first move was from &#8220;the model can&#8217;t do the work&#8221; to &#8220;the model can&#8217;t do the work the way our team would do the work&#8221; &#8212; the workflow-packaging problem, which I covered last week. The second move does not land in a workflow file at all. It lands in five different leadership chairs across the company, and each of those chairs has to develop a new instinct that almost nobody is teaching. Our conversation kept circling back to a single organizing point: the model is good now, and the question that matters has shifted to where you put the human judgment around it. What follows is my attempt to write down the takeaways from that conversation and push the framing further than we got to in the room.</p><p>I&#8217;ve been thinking about what happens to companies that skip this layer. Some will over-restrict to the point that the agents are useless and the team works around them. A smaller number will under-restrict and end up with an incident that turns into a board-level event. The companies that do the quiet work of building the five layers will look unremarkable for two quarters and then will be impossible to catch. Watching who joins that last group is going to be one of the more interesting things to track over the next year.</p><p>I&#8217;m going to walk through a practitioner template that&#8217;s already running this way, the five chairs, and the work each one has to do this quarter. Let&#8217;s go.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/codex-five-leadership-chairs-tibo-interview">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The 2 prompts I'd run before any 2026 SaaS renewal (especially if you're deploying agents)]]></title><description><![CDATA[Watch now | The seat is not dead. It is being wrapped in a meter for delegated work.]]></description><link>https://natesnewsletter.substack.com/p/saas-agent-license-renewal</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/saas-agent-license-renewal</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Fri, 15 May 2026 13:02:32 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Wycm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff495bcae-894d-4d1c-a65d-70166b77333e_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>For two decades, SaaS pricing pulled off a beautiful trick: it turned work into seats. Ten reps in the CRM meant ten seats. Fifty reps in the help desk meant fifty seats. The model was legible because the human was the unit of software value. A person logged in, clicked, updated records, sent messages, and the vendor charged for that person.</p><p>That model just cracked.</p><p>Salesforce booked $800 million in agent revenue last quarter, up from $540 million the quarter before. Its CRO Miguel Milano told analysts, &#8220;We have found the formula to monetize AI.&#8221; Microsoft added a separate $15-per-user license for agent governance, sitting alongside the $30 Copilot seat. SAP put hard limits on which agents are even allowed to call its APIs. ServiceNow, Workday, Zendesk, HubSpot, and Atlassian each added their own meter under their own name.</p><p>Your next SaaS renewal will price two things: who logs in, and what work moves through the system. If you walk in counting seats, you&#8217;ll sign for the headcount you already have and pay for the agent work on top. Once usage is embedded in customer workflows, support deflection numbers, and the finance close, the negotiating position flips. The vendor knows the work has moved. You know turning it off would hurt.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The eight-vendor pattern.</strong> What each one calls its agent meter, and which one your renewal will hit first.</p></li><li><p><strong>The hybrid model most companies will land on.</strong> Why Microsoft&#8217;s three-layer stack is probably the template.</p></li><li><p><strong>Fair license vs. rent-seeking.</strong> Nine traits that separate a defensible meter from a vendor capturing the AI efficiency.</p></li><li><p><strong>The negotiation list.</strong> What to ask at renewal, and the one question most vendors will try to dodge.</p></li><li><p><strong>Two prompts to run before you renew.</strong> A system touch map for builders before procurement reviews it for them, and a vendor-specific question sequence for CFOs walking into renewal.</p></li></ul><p>But first, the seat stays &#8212; even as the meter changes. Then we&#8217;ll look at how each vendor is pricing the work, and what to ask before you sign.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/saas-agent-license-renewal">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Six things have to be true before AI changes a workflow. Most companies have built two.]]></title><description><![CDATA[Watch now | The interesting thing about Anthropic&#8217;s new enterprise AI services company isn&#8217;t the services part.]]></description><link>https://natesnewsletter.substack.com/p/enterprise-ai-deployment-layer</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/enterprise-ai-deployment-layer</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Thu, 14 May 2026 13:02:41 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!nj_1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fc92b99-fbfe-47cd-8bf6-243a8fdd8794_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The interesting thing about Anthropic&#8217;s new enterprise AI services company isn&#8217;t the services part. Enterprise software has always needed implementation help: cloud migrations, ERP projects, Salesforce rollouts, the forward-deployed engineering Palantir made famous. What&#8217;s new is the target. Anthropic is aiming this venture at mid-sized businesses: the segment with enough operating complexity to benefit from frontier AI, but rarely with enough internal engineering to turn it into working systems.</p><p>That target tells you something. The hard part of enterprise AI is no longer buying access to a powerful model. Any company can approve ChatGPT Enterprise, Claude, or Gemini, buy seats, call an API, and produce impressive internal demos. None of that proves the company has actually changed how support tickets move, how invoices close, how compliance reviews happen, or how customers get served. Value shows up when the model has a specific role in a specific workflow, with the right data, permissions, review process, and success metric. That work is what most companies haven&#8217;t built. In the last few months, agents have gotten reliable enough at running entire workflows that the distance between companies that have built it and companies that haven&#8217;t is starting to compound.</p><p>That&#8217;s why Anthropic, OpenAI, Blackstone, Hellman &amp; Friedman, and Goldman Sachs are all making moves right now. The implementation layer has become the strategic layer in enterprise AI. There are trillions of dollars in workflow value waiting on whoever figures this out first, and the companies that already understand this are about to pull further ahead.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>What&#8217;s actually new about the Anthropic deployment company.</strong> Why the mid-market target and PE backing signal a shift from model sales to deployment capacity &#8212; and what private equity sees that the market doesn&#8217;t.</p></li><li><p><strong>What &#8220;implementation architecture&#8221; actually means.</strong> The technical and operational work that separates AI experiments from production workflows.</p></li><li><p><strong>The risk: services that don&#8217;t turn into product.</strong> When field work compounds into reusable assets versus when it stays bespoke.</p></li><li><p><strong>What this means for startups, buyers, and builders.</strong> Where narrow, deep ownership beats generic AI productivity plays &#8212; and the specific move each one should make next.</p></li><li><p><strong>The implementation architecture audit.</strong> A prompt that scores your AI product against the six components, tells you whether you own a workflow or decorate a model, and surfaces the two questions that will end your next enterprise deal.</p></li></ul><p>Let me show you how the pieces connect, and what the next phase of enterprise AI actually requires.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/enterprise-ai-deployment-layer">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Your AI agent is rediscovering 85% of its context every run. Here's the architecture fix (+ Contract Spec, Failure Triage, and Stack ADR)]]></title><description><![CDATA[Watch now | There&#8217;s a debate going on right now about whether vector search is obsolete.]]></description><link>https://natesnewsletter.substack.com/p/rag-agents-knowledge-layer-architecture</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/rag-agents-knowledge-layer-architecture</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Wed, 13 May 2026 13:03:32 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/197424499/196b2ecde280d636da053bc1965b22d3.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>There&#8217;s a debate going on right now about whether vector search is obsolete. That&#8217;s the wrong layer to be arguing about. The agents I&#8217;m watching fail in production aren&#8217;t failing because the retrieval method is wrong &#8212; they&#8217;re failing because the retrieval system can&#8217;t assemble what the agent actually needs before it starts acting.</p><p>A vector database can find text semantically related to a question. Useful, but nowhere near enough. Agents need the current account record, the user&#8217;s permissions, the controlling policy, the right section of a long document, the table behind a metric, the prior decision from a meeting, and the source trail that lets a human reviewer reconstruct why the agent did what it did. When the system doesn&#8217;t prepare that context, the model improvises &#8212; and the cost shows up everywhere except the place you&#8217;re looking. Wrong refunds get issued. Stale policies get cited. Outdated metrics make it into board decks. The agent burns tokens and wall-clock time rebuilding context every run, and when the answer finally lands, it lands confidently &#8212; which is the most expensive way to be wrong. That&#8217;s the new RAG problem &#8212; not a retrieval problem, an assembly problem.</p><p>So the next move isn&#8217;t &#8220;vectors versus something else.&#8221; It&#8217;s that vector search is quietly getting demoted from the whole architecture to one component inside a broader knowledge layer for agents &#8212; a layer that includes retrieval, but also document structure, semantic data models, access control, provenance, memory, and write-back. I want to be careful not to overstate this: vector search isn&#8217;t going anywhere. But the conversation about where the real work happens has moved.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>Why classic RAG worked &#8212; and where it stops working.</strong> How to spot the moment your retrieval architecture became the bottleneck instead of the solution.</p></li><li><p><strong>What Pinecone, PageIndex, SAP, and Dremio are all saying.</strong> Four different companies, one shared shift in what &#8220;retrieval&#8221; actually means for agents.</p></li><li><p><strong>The practical architecture.</strong> Seven questions to test whether your knowledge layer can support a production agent.</p></li><li><p><strong>What could go wrong.</strong> Where this new architecture quietly breaks, and how to tell if you&#8217;re overbuilding.</p></li><li><p><strong>How to put this to work.</strong> A Retrieval Contract Spec, a Failure Triage, and a Stack ADR: paste-ready artifacts for the three states a builder hits when working on retrieval.</p></li></ul><p>Let&#8217;s walk through how this shift is playing out across the stack, and what it means for how you build. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/rag-agents-knowledge-layer-architecture">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Six layers your agent has to handle. Most products have only thought about two. + a responsibility-layer audit.]]></title><description><![CDATA[Watch now | For most of the internet&#8217;s history, a purchase has been a human action that everyone in the chain could see.]]></description><link>https://natesnewsletter.substack.com/p/agentic-commerce-protocol-war</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/agentic-commerce-protocol-war</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Tue, 12 May 2026 13:03:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!HTab!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8be3b75d-6a62-4aa6-9358-df28cc3660af_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>For most of the internet&#8217;s history, a purchase has been a human action that everyone in the chain could see. A person clicked a button on a merchant&#8217;s site. A payment credential moved through a processor. A network or wallet weighed the risk. The merchant took responsibility for the order. The records were imperfect, but everyone agreed on the shape of the evidence: a human was present, a page was shown, a credential was used, a final action was taken.</p><p>Agentic commerce breaks that structure.</p><p>Software is starting to hold wallets, sign authorizations, and pay merchants directly. Some of those agents are spending your money. Some are spending your company&#8217;s. Some are spending your customers&#8217;. When one of them sends money to the wrong place &#8212; and one of them will &#8212; someone is going to be left holding the bag. Protocol camps are fighting over who that someone is.</p><p>The question stops being &#8220;can the customer pay?&#8221; and becomes &#8220;how does everyone know the agent was allowed to do what it just did?&#8221; That question reaches well past checkout. It touches identity, authorization, fraud, payment credentials, settlement, refunds, liability, data rights, and the merchant&#8217;s relationship with the customer. The old purchase bundle comes apart, and the market isn&#8217;t converging on a single replacement. It&#8217;s splitting into protocol camps, each owning a different piece of what used to live behind one click.</p><p>That&#8217;s the split. A fight over where commercial trust lives, not over a single button. And it will shape who keeps the customer, who carries the loss, and who gets to write the rules of online buying for the next decade.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The first round of the fight is already over.</strong> OpenAI and Stripe shipped Instant Checkout in September. Five months later it was scaled back, while Shopify and Google&#8217;s counter-protocol gained ground.</p></li><li><p><strong>Authorization is not payment.</strong> The evidence layer agentic commerce needs, why it has to outlive the transaction, and the two routes Google and Stripe are taking to build it.</p></li><li><p><strong>The stablecoin case that holds up.</strong> Software paying software is a different business problem than a person buying shoes, and the rails should be different too.</p></li><li><p><strong>Where this lands and who owns which layer.</strong> Why AWS quietly matters most, and what every kind of builder, merchant, and operator should do now.</p></li><li><p><strong>Two prompts to expose your own gaps.</strong> A responsibility-layer audit that forces you to name who owns each piece of an agentic purchase in your product, and an authorization spec that finance and legal will actually accept.</p></li></ul><p>The split started a year ago. Start with Instant Checkout: how OpenAI and Stripe launched it, and why OpenAI walked it back.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/agentic-commerce-protocol-war">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[You gave your AI agent real tools. Here's the 4-part control layer it's missing + the Judge Layer implementation guide]]></title><description><![CDATA[Watch now | The next serious agent failure won&#8217;t look like a jailbreak.]]></description><link>https://natesnewsletter.substack.com/p/agent-judge-layer-production-control</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/agent-judge-layer-production-control</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Mon, 11 May 2026 13:03:34 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/197051038/3cdca10cb90792bf48113c516ba99c92.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>The next serious agent failure won&#8217;t look like a jailbreak. It&#8217;ll look like an email sent because the thread seemed to imply approval, a customer record updated because the old value looked stale, a pull request opened because the tests passed and the change looked done. None of that requires the model to misbehave, which is what makes it hard. The risk starts where the product gets useful: when language turns into action.</p><p>A chat demo lives in suggestion space. The model drafts, summarizes, answers, proposes, and if it&#8217;s wrong, the user rejects it. The cost is local. A production agent lives closer to consequence: it can notify someone, expose private information, change a shared record, trigger a workflow, or spend money. That moves a question to the center of the product demos never had to answer: who decides whether the agent should be allowed to act?</p><p>A better prompt doesn&#8217;t really answer it. Telling the model to &#8220;be careful&#8221; doesn&#8217;t either. Approval modals technically reduce risk but ruin the workflow. Users either click through out of habit or stop using the system. The answer that&#8217;s actually working is architectural: a separate judge wrapped around the actor, deciding whether each proposed action should move forward. If you&#8217;re building agents that act, this is the layer of the product you cannot bolt on later.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The Lindy example.</strong> How a multi-channel agent product hit the failure mode every production system eventually faces, and the architectural fix that worked.</p></li><li><p><strong>Why prompting and approval modals both fail.</strong> The structural reasons a single prompt can&#8217;t pursue a task and police it at the same time.</p></li><li><p><strong>Orchestration is not judgment.</strong> Why coordinating agents and judging their actions are different problems with different homes in the stack.</p></li><li><p><strong>The builder toolkit.</strong> Action classification, proposals, specialist judges, eval, memory governance, and what to build first.</p></li><li><p><strong>The OpenBrain Judge Extender guide + the prompt kit that builds your first judge.</strong> Five prompts that take you from &#8220;my agent acts&#8221; to a working judge at your highest-risk boundary, plus the full implementation spec for wiring that judge to durable memory, provenance, and structured write-back so it doesn&#8217;t start every session from zero.</p></li></ul><p>Start with the team that hit this wall publicly and figured out what to do about it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/agent-judge-layer-production-control">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Executive Briefing: Six announcements in 48 hours just changed how enterprise AI gets bought (+ 2 prompts for the new process)]]></title><description><![CDATA[Watch now | Capital just moved. The question is whether the platform you&#8217;re buying can be built on.]]></description><link>https://natesnewsletter.substack.com/p/enterprise-ai-buying-build-room</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/enterprise-ai-buying-build-room</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Sun, 10 May 2026 15:01:17 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/196959127/88126e47ce018b49166ec1686e3f5d00.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>In 48 hours last week, six things happened in enterprise AI.</p><p>Anthropic announced a new enterprise AI services company with Blackstone, Hellman &amp; Friedman, and Goldman Sachs; press reports put the venture at roughly $1.5 billion. Bloomberg reported that OpenAI had raised more than $4 billion for a similar enterprise deployment venture backed by firms including TPG, Brookfield, Advent, and Bain. SAP said it would acquire Dremio and Prior Labs. Pinecone launched Nexus, a &#8220;compilation-stage knowledge engine&#8221; with the claim that 85 percent of agent compute is wasted on rediscovery. ServiceNow shipped Action Fabric at Knowledge 2026, opening its workflow engine to any external agent through MCP, with Anthropic as launch partner.</p><p>These were reported as separate stories. They are the same bet, and the bet is roughly $5.5 billion that capital is moving from buying the model to buying the build. What is being repriced is not intelligence. Intelligence is cheap and getting cheaper. What is being repriced is the surrounding infrastructure that lets an agent reach real data, act through real permissions, run real workflows, and stay inside audit boundaries at a cost the company can plan around. The frontier labs call it forward-deployed engineering. The platform vendors call it governed action. Whatever the label, it is what enterprise AI value depends on, and the people writing the checks have noticed it is more decisive than the model line item ever was.</p><p>If that sounds abstract, the concrete version already happened. In February, an autonomous agent built by CodeWall reached full read-write access on McKinsey&#8217;s internal AI platform, Lilli, in under two hours, through one of 22 unauthenticated API endpoints, on a system used by roughly 70 percent of the firm&#8217;s 43,000 consultants. The exploit was SQL injection, a vulnerability class from 1998. The story everyone told was about security. The story underneath was about procurement: a platform shipped without the technical voices in the room who would have caught what was on the wire. That is the build room. That is what the $5.5 billion bet is trying to fix.</p><p><strong>This briefing covers:</strong></p><ul><li><p><strong>Why most enterprise AI plans are running on the old buying sequence.</strong> Strategy upstream, implementation downstream &#8212; agents reverse that order, and the budget allocation is still pointed at the wrong layer.</p></li><li><p><strong>What the $5.5 billion concession means.</strong> The labs putting capital behind forward-deployed engineering is not a marketing posture. It is the bottleneck they are admitting they cannot solve from the model side.</p></li><li><p><strong>Why context, not tokens, is the line item ruining agent economics.</strong> And why capping usage kills the use case without fixing the cause.</p></li><li><p><strong>The new buying sequence, and where the next quarter&#8217;s capital should flow.</strong> Three changes that do most of the work &#8212; and the test that exposes whether a vendor&#8217;s roadmap can survive the build room.</p></li></ul><p>The next year of enterprise AI will not be won by the most ambitious roadmap. It will be won in the room where the roadmap meets the build.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Executive Circle members enjoy all these Sunday briefings! Curious? You can easily <a href="https://support.substack.com/hc/en-us/articles/360044105731-How-do-I-change-my-subscription-plan-on-Substack">change your plan here</a></p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/enterprise-ai-buying-build-room">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[OpenAI made Codex smart enough that the bottleneck moved. Most people haven't noticed where it went.]]></title><description><![CDATA[Watch now | Codex plugins matter because the bottleneck moved.]]></description><link>https://natesnewsletter.substack.com/p/codex-plugins-bottleneck-moved</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/codex-plugins-bottleneck-moved</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Sat, 09 May 2026 14:02:36 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/196961228/0bc1428499af79ee3a2a08b689906b79.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Open a new Codex thread and you are the operating system.</p><p>You explain the repo. You paste the standard. You point at the docs. You list the tools. You list the failure modes you do not want the agent to repeat. By the time it is ready to act, you have done a real chunk of the work yourself, just to get the work started.</p><p>That part did not get better with GPT-5.5. The model did. Codex is now at 82.7% on Terminal-Bench 2.0, up from 75.1%, and the lift you actually feel is bigger than the number reads, because the model can now stay inside long, multi-tool tasks without losing the thread. It reviews pull requests against your standards, builds screens from Figma comps, runs tests in a browser, pulls context across Slack, Drive, GitHub, and Linear, and drafts release notes from the diff. The model is good now.</p><p>The work around it is not.</p><p>That is the bottleneck. The workflow lives in your head, and you reload it every thread. From here, the work has to meet the model halfway.</p><p>That is what plugins are for. A skill says how the work should be done. A plugin packages that skill with tool access, live integrations, deterministic checks, the team&#8217;s failure modes, and the parts of the standard nobody has written down. Once installed, the agent stops needing you to be the OS.</p><p>The stakes are not subtle. A stronger model with a vague environment does not give you more help. It gives you faster, more confident wrongness. Reviews that miss the team&#8217;s review standard. Release notes that drift into engineering language. Customer summaries that mix admin material into the team-facing recap. Each looks fine alone. Together they make a company that runs faster and means less.</p><p>The career version is sharper. The next competitive skill is not writing the longest prompt. It is knowing which parts of your work should become reusable infrastructure. Two years from now, the people who learned to package will be compounding. Everyone else will be explaining the workflow on Tuesday morning.</p><p>Here&#8217;s what&#8217;s inside:</p><ul><li><p><strong>The bottleneck that GPT-5.5 made visible.</strong> Why a stronger model with a vague environment gives you faster wrongness, not more help.</p></li><li><p><strong>The decision ladder.</strong> When to stay with a prompt, when to build a skill, when to package a plugin, and when not to bother.</p></li><li><p><strong>Which workflows to package first.</strong> Five categories worth the investment, and a test for whether yours qualifies.</p></li><li><p><strong>Grab The Ultimate Codex Plugin Guide + prompts.</strong> The full step-by-step build guide from skill file through plugin manifest and debugging checklist, plus seven prompts that take you from workflow audit to installed, tested plugin.</p></li></ul><p>Let me show you how the bottleneck moved, and what to do about it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/codex-plugins-bottleneck-moved">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[271 bugs found in Firefox, zero written by a human attacker. What this means for the future of safe code + 2 prompts]]></title><description><![CDATA[Watch now | The best code you'll ever ship might not be written by a human. That's a good thing.]]></description><link>https://natesnewsletter.substack.com/p/ai-code-trust-verification-shift</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/ai-code-trust-verification-shift</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Fri, 08 May 2026 13:03:41 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!EJuq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7b11c0c-6abc-4623-9a7f-da48a5d0717b_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Mozilla&#8217;s Mythos experiment reads, at first, like a cybersecurity story. Anthropic built Mythos for vulnerability research. Mozilla pointed it at Firefox. The previous AI scan, run with a general-purpose model, surfaced 22 security-sensitive bugs. One release cycle later, with the purpose-built model, the number was 271. The Firefox security team &#8212; a group that has spent two decades being skeptical about new tools &#8212; published the result with the kind of careful enthusiasm that means something has actually changed.</p><p>The 271 is striking. The story underneath it is bigger, and it is not really about cybersecurity at all. Software has always rested on a quiet assumption that human-written code is the trust anchor and machines are there to check it. Mythos is the first serious sign that the assumption is about to flip. Serious software in the next era may be generated, attacked, repaired, and verified by machines, while humans hold a different role entirely: defining what the system is allowed to mean.</p><p>If that flip is real, the standard of trust rises faster than most teams are ready for. Hand-written code without adversarial machine review starts to look incomplete. Codebases that were merely messy start to look structurally unsafe, because the tools that could make them safer cannot operate on a system nobody can read. There is a window to get ahead of this &#8212; short, uneven, and closing.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>The inversion of authorship.</strong> Humans wrote, machines checked. Mythos starts to flip that. What it means when generated code becomes the more trusted version, not the less.</p></li><li><p><strong>When trust becomes scarce.</strong> Code is about to get cheap to produce and expensive to trust. The teams that win the next year are the ones that build for that gap on purpose.</p></li><li><p><strong>Comprehensibility as a security property.</strong> Why the next four to five months are a refactor window &#8212; and why teams that wait will discover their codebases are too tangled for the new tools to help.</p></li><li><p><strong>Where the timing lands.</strong> What this means if you are an individual contributor versus a team lead versus a CTO &#8212; and why the budget conversation is already late at the top of the org.</p></li><li><p><strong>Audit your readiness now.</strong> Two prompts that tell you whether your codebase is legible enough for the next generation of adversarial review tools, and whether your evals for AI-generated code are catching the right things.</p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/ai-code-trust-verification-shift">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[OpenClaw, Anthropic, and Gemma 4 just redefined what "agent framework" means. You need to pick a side.]]></title><description><![CDATA[Watch now | Anthropic restricted, OpenAI opened, Google shipped Gemma 4, and OpenClaw stopped being model-dependent. Things are getting spicy!]]></description><link>https://natesnewsletter.substack.com/p/openclaw-agent-runtime-model-swapping</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/openclaw-agent-runtime-model-swapping</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Thu, 07 May 2026 13:02:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!bA1X!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9e7e3a64-a148-4136-a24b-ef05614768fa_1024x1024.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In April, OpenClaw stopped being an agent harness and started being a runtime.</p><p>A month ago, the easy descriptions still fit. Chatbot wrapper, Claude launcher, viral terminal toy. By the end of April, none of them held. OpenClaw became an action layer for agents &#8212; a place where tasks, tools, memory, channels, permissions, subagents, and model choices come together into durable workflows. The model is no longer the product. The runtime is.</p><p>That shift matters because the model layer underneath it became more contested at exactly the same time. Anthropic pulled subscription-backed third-party usage back toward its own products. OpenAI moved the other way, opening ChatGPT and Codex subscription usage to OpenClaw users. Google shipped Gemma 4, built explicitly for agentic and on-device work. Local models got good enough for more background tasks. Claude is still valuable, but increasingly as a metered premium component rather than the default flat-rate engine for everything.</p><p>The mistake is to let the runtime fight swallow the product story.</p><p>A month ago, the builder ambition was to make an agent do something useful. Now it is to build a durable workflow once and swap the model underneath it. That is a much bigger idea, and it changes the builder opportunity completely.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>Why April was the threshold month.</strong> TaskFlows, channels, memory, and routing all matured at once, and the combination is what crossed the runtime line.</p></li><li><p><strong>The Anthropic move, in context.</strong> What changed, why it was rational, why the developer community reacted the way it did, and what it tells you about the next twelve months.</p></li><li><p><strong>The seven workflows you can now actually build.</strong> Repo operations, code review memory, incident response, customer feedback, meetings to execution, model-aware routing, and team memory across agents.</p></li><li><p><strong>Why memory has to live outside the model.</strong> If the brain is swappable, memory cannot live inside any one brain &#8212; and that has implications for how you architect anything you build on top of OpenClaw.</p></li><li><p><strong>The Open Brain Agent Memory launch.</strong> A live ClawHub skill, a plugin package, and four recipes &#8212; code review memory, TaskFlow work logs, the OpenClaw Agent Memory recipe, and a Safe Agent Memory contract that draws the line between evidence and instruction.</p></li></ul><p>The labs will keep fighting over the brain. The interesting question is what builders should do while they fight.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/openclaw-agent-runtime-model-swapping">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The next AI platform winner won't have the best model. They'll own something most companies don't even see yet.]]></title><description><![CDATA[Watch now | Why access without meaning is the most expensive mistake in AI right now.]]></description><link>https://natesnewsletter.substack.com/p/ai-work-primitives-access-vs-meaning</link><guid isPermaLink="false">https://natesnewsletter.substack.com/p/ai-work-primitives-access-vs-meaning</guid><dc:creator><![CDATA[Nate]]></dc:creator><pubDate>Wed, 06 May 2026 13:02:43 GMT</pubDate><enclosure url="https://api.substack.com/feed/podcast/196571944/e8cdaecd8da39e821b37277fc77f2727.mp3" length="0" type="audio/mpeg"/><content:encoded><![CDATA[<p>Agents can finally use computers like humans do &#8212; open browsers, click forms, move calendar invites, run commands. The spectacle is real. It is also a distraction.</p><p>Every AI product announcement in the next year will look like progress. Most will be progress on access &#8212; the agent can now reach one more thing. A smaller share will be progress on meaning &#8212; the system actually understands what it is doing. Demos make the two look identical. Six months into a real deployment they feel completely different: access-only products are still demanding constant supervision, and meaning-rich ones are quietly compounding. If you are building, buying, or betting on AI tools right now, the difference between those two categories is the most important call you will make this year.</p><p>Computer use exists because most software was designed for human interpretation &#8212; and a person knows that moving a calendar invite is not just changing a row in a database. It may notify attendees, alter someone&#8217;s preparation, break a commitment, or reschedule a meeting that took three weeks to set. The agent can infer some of that. Inference over a human interface is not the same as software exposing the meaning of the work directly. That distinction is the next platform fight.</p><p>Computer use gives agents reach. Semantic control gives them judgment. The long-term moat is not the ability to click the button. It is ownership of the layer that tells the agent what the button means.</p><p><strong>Here&#8217;s what&#8217;s inside:</strong></p><ul><li><p><strong>Why coding agents arrived first.</strong> The structural reason software development became the wedge &#8212; and what it tells you about which kinds of work agents will conquer next.</p></li><li><p><strong>The Stripe move most people misread.</strong> Why a structured payment token is strategically deeper than any agent that clicks checkout buttons.</p></li><li><p><strong>The trap Perplexity is trying to escape.</strong> Why moving from answering to operating is necessary but not sufficient.</p></li><li><p><strong>The Salesforce vs. SAP wager.</strong> Two opposite bets on agent-readability, and the buyer behavior that decides which one survives.</p></li><li><p><strong>The better product test.</strong> A single question to ask of every AI product announcement that cuts through the demo theater.</p></li><li><p><strong>Three diagnostic prompts.</strong> One to evaluate any AI product announcement against a ten-dimension semantic depth test, one to audit a tool already in your stack and decide whether to extend, wrap, replace, or wait, and one to run a structured post-mortem after an agent&#8217;s action succeeded but the outcome was wrong.</p></li></ul><p>What follows is how the divergence is unfolding &#8212; and what to look for in the products you are evaluating, building, or betting on.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://natesnewsletter.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribers get all posts like these!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>
      <p>
          <a href="https://natesnewsletter.substack.com/p/ai-work-primitives-access-vs-meaning">
              Read more
          </a>
      </p>
   ]]></content:encoded></item></channel></rss>