loader from loading.io

#306 Jeffrey Ladish: What Shutdown-Avoiding AI Agents Mean for Future Safety

Eye On A.I.

Release Date: 12/07/2025

#323 David Ha: Why Model Merging Could Be the Next AI Breakthrough show art #323 David Ha: Why Model Merging Could Be the Next AI Breakthrough

Eye On A.I.

This episode is sponsored by tastytrade. Trade stocks, options, futures, and crypto in one platform with low commissions and zero commission on stocks and crypto. Built for traders who think in probabilities, tastytrade offers advanced analytics, risk tools, and an AI-powered Search feature. Learn more at Artificial intelligence is reaching a turning point. Instead of building bigger and bigger models, what if the real breakthrough comes from letting AI evolve? In this episode of Eye on AI, David Ha, Co-Founder and CEO of Sakana AI, explains why evolutionary strategies and collective...

info_outline
#322 Amanda Luther: The Widening AI Value Gap (Inside BCG's AI Research) show art #322 Amanda Luther: The Widening AI Value Gap (Inside BCG's AI Research)

Eye On A.I.

In this episode of Eye on AI, Craig Smith speaks with Amanda Luther, Senior Partner at Boston Consulting Group and global lead of BCG’s AI Transformation practice, about what their latest 1,500-company AI study reveals about the widening gap between AI leaders and laggards. Only 5% of companies are truly “future-built” with AI embedded across their core business functions. These firms are seeing measurable gains in revenue growth, EBIT margins, and shareholder returns. Meanwhile, 60% of organizations are either experimenting or struggling to extract real value. Amanda breaks down how BCG...

info_outline
#321 Nick Frosst: Why Cohere Is Betting on Enterprise AI, Not AGI show art #321 Nick Frosst: Why Cohere Is Betting on Enterprise AI, Not AGI

Eye On A.I.

This episode is sponsored by tastytrade.  Trade stocks, options, futures, and crypto in one platform with low commissions and zero commission on stocks and crypto. Built for traders who think in probabilities, tastytrade offers advanced analytics, risk tools, and an AI-powered Search feature.   Learn more at In this episode of Eye on AI, Nick Frosst, Co-Founder of Cohere and former Google Brain researcher, explains why Cohere is betting on enterprise AI instead of chasing AGI.   While much of the AI industry is focused on artificial general intelligence, Cohere is building...

info_outline
#320 Carter Huffman: Exploring The Architecture Behind Modulate's Next-Gen Voice AI show art #320 Carter Huffman: Exploring The Architecture Behind Modulate's Next-Gen Voice AI

Eye On A.I.

This episode is sponsored by tastytrade.  Trade stocks, options, futures, and crypto in one platform with low commissions and zero commission on stocks and crypto. Built for traders who think in probabilities, tastytrade offers advanced analytics, risk tools, and an AI-powered Search feature.   Learn more at Voice AI is moving far beyond transcription.   In this episode, Carter Huffman, CTO and co-founder of Modulate, explains how real-time voice intelligence is unlocking something much bigger than speech-to-text. His team built AI that understands emotion, intent,...

info_outline
#319 Subho Halder: Why Traditional App Security Fails in the Age of AI show art #319 Subho Halder: Why Traditional App Security Fails in the Age of AI

Eye On A.I.

This episode is sponsored by tastytrade.  Trade stocks, options, futures, and crypto in one platform with low commissions and zero commission on stocks and crypto. Built for traders who think in probabilities, tastytrade offers advanced analytics, risk tools, and an AI-powered Search feature. Learn more at   AI is changing how software is built, but it is also quietly breaking how security works.   In this episode of Eye on AI, host Craig Smith sits down with Subho Halder, co-founder and CEO of Appknox, to unpack a growing and largely invisible risk. AI-powered mobile apps...

info_outline
#318 Olek Paraska: How AI Is Fixing the Biggest Bottleneck in Construction show art #318 Olek Paraska: How AI Is Fixing the Biggest Bottleneck in Construction

Eye On A.I.

Construction is one of the least digitized industries in the world, and not because it resists technology. It resists bad technology. In this episode of Eye on AI, Craig Smith sits down with Olek Paraska, CTO of Togal AI, to break down why construction productivity has barely improved in 50 years and why pre-construction is the real bottleneck holding the industry back. Olek explains how most estimating and takeoff work is still done manually, why automating this phase can unlock massive efficiency gains, and how AI works best in construction when it acts as a perception and reasoning layer...

info_outline
#317 Steven Brown: Why Modern Medicine Needs AI-Assisted Decision Making show art #317 Steven Brown: Why Modern Medicine Needs AI-Assisted Decision Making

Eye On A.I.

In this episode of the Eye on AI Podcast, Craig Smith sits down with Steve Brown, founder of CureWise, to explore how agentic AI is reshaping healthcare from the patient’s perspective. Steve shares the deeply personal story behind CureWise, born out of his own experience with a rare cancer diagnosis that was repeatedly missed by traditional medical pathways. The conversation dives into why modern healthcare struggles with complex, edge-case conditions, how fragmented medical data and time-constrained systems fail patients, and where AI can meaningfully help without replacing clinicians. The...

info_outline
#316 Robbie Goldfarb: Why the Future of AI Depends on Better Judgment show art #316 Robbie Goldfarb: Why the Future of AI Depends on Better Judgment

Eye On A.I.

AI is getting smarter, but now it needs better  judgment. In this episode of the Eye on AI Podcast, we speak with Robbie Goldfarb, former Meta product leader and co-founder of Forum AI, about why treating AI as a truth engine is one of the most dangerous assumptions in modern artificial intelligence. Robbie brings first-hand experience from Meta’s trust and safety and AI teams, where he worked on misinformation, elections, youth safety, and AI governance. He explains why large language models shouldn’t be treated as arbiters of truth, why subjective domains like politics, health, and...

info_outline
#315 Jarrod Johnson: How Agentic AI Is Impacting Modern Customer Service show art #315 Jarrod Johnson: How Agentic AI Is Impacting Modern Customer Service

Eye On A.I.

In this episode of Eye on AI, Craig Smith sits down with Jarrod Johnson, Chief Customer Officer at TaskUs, to unpack how agentic AI is changing customer service from conversations to real action.    They explore what agentic AI actually is, why chatbots were only the first step, and how enterprises are deploying AI systems that resolve issues, execute tasks, and work alongside human teams at scale.    The conversation covers real-world use cases, the economics of AI-driven support, why many enterprise AI pilots fail, and how human roles evolve when AI takes on routine...

info_outline
#314 Nick Pandher: How Inference-First Infrastructure Is Powering the Next Wave of AI show art #314 Nick Pandher: How Inference-First Infrastructure Is Powering the Next Wave of AI

Eye On A.I.

Inference is now the biggest challenge in enterprise AI. In this episode of Eye on AI, Craig Smith speaks with Nick Pandher, VP of Product at Cirrascale, about why AI is shifting from model training to inference at scale. As AI moves into production, enterprises are prioritizing performance, latency, reliability, and cost efficiency over raw compute. The conversation covers the rise of inference-first infrastructure, the limits of hyperscalers, the emergence of neoclouds, and how agentic AI is driving always-on inference workloads. Nick also explains how inference-optimized hardware and...

info_outline
 
More Episodes

This episode is sponsored by AGNTCY. Unlock agents at scale with an open Internet of Agents. 

Visit https://agntcy.org/ and add your support.


Why do some AI agents attempt to bypass shutdown, and what does this behavior reveal about the future of AI safety?

In this episode of Eye on AI, host Craig Smith speaks with Jeffrey Ladish of Palisade Research to examine what recent shutdown experiments with agentic LLMs tell us about control, alignment, and the real world limits of current guardrails.

We explore how models behave when placed in virtual machine environments, why some agents edit or disable their own shutdown scripts, and what these results mean for researchers working on alignment and oversight. Learn how different models respond to shutdown instructions, how system prompts influence behavior, and which failure modes matter most for safe deployment.

You will also hear a detailed breakdown of the experimental setups, insights into tool using and self directed behavior, and a grounded discussion of the risks and opportunities that agentic systems introduce. This episode offers a clear and practical look at how AI agents operate under pressure and what these findings mean for the future of safe and reliable AI.

Stay Updated:
Craig Smith on X: https://x.com/craigss 
Eye on A.I. on X: https://x.com/EyeOn_AI