loader from loading.io

#314 Nick Pandher: How Inference-First Infrastructure Is Powering the Next Wave of AI

Eye On A.I.

Release Date: 01/17/2026

#336 Professor Mausam: Why India Is Losing the AI Race and What It Will Take to Catch Up show art #336 Professor Mausam: Why India Is Losing the AI Race and What It Will Take to Catch Up

Eye On A.I.

What if the country that produces the world's top AI talent finally figured out how to keep it? In this episode of Eye on AI, Craig Smith sits down with Professor Mausam, one of India's leading AI researchers, AAAI Fellow, and founding head of the Yardi School of Artificial Intelligence at IIT Delhi, to get an honest and unflinching diagnosis of why India has fallen so far behind the US and China in artificial intelligence and what it will actually take to close that gap. Mausam breaks down the structural story behind India's deficit. A pipeline of world-class students that gets exported...

info_outline
#335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models show art #335 Sriram Raghavan: Why IBM Is Betting Everything on Small AI Models

Eye On A.I.

Why IBM Is Betting Everything on Small AI Models In this episode of Eye on AI, Craig Smith sits down with Sriram Raghavan, Vice President of AI at IBM Research, to explore one of the most important debates in enterprise AI right now. Do you actually need a massive model to get world class results? IBM's answer is no, and Sriram breaks down exactly why. Sriram explains why IBM chose to train its Granite models directly using reinforcement learning rather than distilling from larger models like most of the industry. The reason goes beyond performance. It comes down to data lineage, safety...

info_outline
#334 Abhishek Singh: The $1.2 Billion Plan to Turn India Into an AI Superpower show art #334 Abhishek Singh: The $1.2 Billion Plan to Turn India Into an AI Superpower

Eye On A.I.

What if the country that trained the world's engineers finally decided to keep them? In this episode of Eye on AI, Craig Smith sits down with Abhishek, the civil servant leading India's $1.2 billion national AI Mission, to explore how one of the world's largest and most diverse nations is mounting a serious challenge to US and Chinese dominance in artificial intelligence. Abhishek breaks down the honest story behind India's late start. World-class talent, but no research ecosystem to retain it. Digitization without AI-usable data. Compute so scarce that the entire country had fewer than 500...

info_outline
#333 Adi Kuruganti: Why Your AI Pilot Is Failing and What It Takes to Reach Production show art #333 Adi Kuruganti: Why Your AI Pilot Is Failing and What It Takes to Reach Production

Eye On A.I.

Most enterprises are excited about agentic AI. But very few are actually deploying it in production. In this episode of Eye on AI, Craig Smith sits down with Adi Kuruganti, Chief AI and Development Officer at Automation Anywhere, to break down why agentic AI is so hard to get right in the enterprise and what it actually takes to move from a promising pilot to a mission-critical deployment. Adi explains why the future of enterprise automation is not agentic AI alone, but the combination of deterministic and agentic systems working together, and why companies that treat AI as a technology...

info_outline
#332 Dan Faulkner: The Code Is Clean. The App Is Broken. Why AI Development Has an Integrity Problem show art #332 Dan Faulkner: The Code Is Clean. The App Is Broken. Why AI Development Has an Integrity Problem

Eye On A.I.

What happens when AI writes code faster than anyone can test it? In this episode of Eye on AI, Craig Smith sits down with Dan Faulkner, CEO of SmartBear, to explore one of the most underappreciated risks of the AI coding boom. As tools like Claude Code and Codex push software development to unprecedented speed, the systems built to validate that software are being left behind. Dan makes a distinction that every engineering leader needs to hear: clean code passing unit tests is not the same as an application that actually works. Dan introduces the concept of application integrity, continuous...

info_outline
#331 Sergey Levine: The Robot Revolution Nobody Is Talking About show art #331 Sergey Levine: The Robot Revolution Nobody Is Talking About

Eye On A.I.

This episode is sponsored by Modulate. Most voice AI focuses on transcription. Velma takes it further by actually understanding conversations, analyzing tone, timing, stress, and intent using its Ensemble Listening Model architecture. Explore the live preview: What does it actually mean to build a foundation model for robots? In this episode of Eye on AI, Craig Smith sits down with Sergey Levine, co-founder of Physical Intelligence and professor at UC Berkeley, to explore a fundamentally different approach to building robots, one inspired not by programming a single perfect machine, but...

info_outline
#330 Sebastian Risi: Why AI Should Be Grown, Not Trained show art #330 Sebastian Risi: Why AI Should Be Grown, Not Trained

Eye On A.I.

AI has been trained like software. But what if it should be grown like life? In this episode of Eye on AI, Craig Smith sits down with Sebastian Risi, professor and leading researcher in neuroevolution and artificial life, to explore a fundamentally different approach to building intelligence, one inspired by how nature evolves, grows, and adapts. Sebastian explains why traditional AI systems are limited by fixed architectures and one-time training, and how evolutionary algorithms can create systems that continuously learn, self-organize, and even grow their own neural structures over time....

info_outline
#329 Izhar Medalsy: How AI Solves Quantum Computing’s Biggest Problem show art #329 Izhar Medalsy: How AI Solves Quantum Computing’s Biggest Problem

Eye On A.I.

Quantum computing has been “5 years away” for decades. So what’s actually holding it back? In this episode of Eye on AI, Craig Smith sits down with Izhar Medalsy, Co-founder & CEO of Quantum Elements, to break down the real bottleneck in quantum computing today and why the future of the industry may depend more on classical systems and AI than quantum hardware itself. Izhar explains how digital twins of quantum systems are being used to simulate real hardware, generate massive amounts of training data, and solve one of the biggest challenges in the field: noise and error...

info_outline
#328 Kevin Tian: Exploring Doppel's AI-Native Social Engineering Defense Platform show art #328 Kevin Tian: Exploring Doppel's AI-Native Social Engineering Defense Platform

Eye On A.I.

AI is changing more than just productivity. It’s changing what we can trust. In this episode, Kevin Tian, Co-founder and CEO of Doppel, breaks down how AI is enabling a new wave of social engineering attacks—from deepfake phone calls to impersonation across LinkedIn, YouTube, and search engines. The reality is this:Deepfakes are just one part of a much bigger problem. Attackers are now operating across multiple channels at once, using AI to manipulate people, not just systems. And as these attacks scale, the real risk isn’t just fraud or data loss—it’s the erosion of trust in...

info_outline
#327 Baris Gultekin: The Next Phase of AI - Agents That Understand Your Company’s Data show art #327 Baris Gultekin: The Next Phase of AI - Agents That Understand Your Company’s Data

Eye On A.I.

This episode is sponsored by Modulate. Most voice AI focuses on transcription. Velma takes it further by actually understanding conversations, analyzing tone, timing, stress, and intent using its Ensemble Listening Model architecture. Explore the live preview:   Baris Gultekin, Head of AI at Snowflake, breaks down how enterprise AI is actually being built, deployed, and scaled today. From running AI directly inside governed data environments to enabling natural language access across entire organizations, this conversation explores the shift from experimentation to real-world impact....

info_outline
 
More Episodes

Inference is now the biggest challenge in enterprise AI.

In this episode of Eye on AI, Craig Smith speaks with Nick Pandher, VP of Product at Cirrascale, about why AI is shifting from model training to inference at scale. As AI moves into production, enterprises are prioritizing performance, latency, reliability, and cost efficiency over raw compute.

The conversation covers the rise of inference-first infrastructure, the limits of hyperscalers, the emergence of neoclouds, and how agentic AI is driving always-on inference workloads. Nick also explains how inference-optimized hardware and serverless AI platforms are shaping the future of enterprise AI deployment.

 

If you are deploying AI in production, this episode explains why inference is the real frontier.

 


Stay Updated:

Craig Smith on X: https://x.com/craigss

Eye on A.I. on X: https://x.com/EyeOn_AI



(00:00) Preview

(00:50) Introduction to Cirrascale and AI inference

(03:04) What makes Cirrascale a neocloud

(04:42) Why AI shifted from training to inference

(06:58) Private inference and enterprise security needs

(08:13) Hyperscalers vs neoclouds for AI workloads

(10:22) Performance metrics that matter in inference

(13:29) Hardware choices and inference accelerators

(20:04) Real enterprise AI use cases and automation

(23:59) Hybrid AI, regulated industries, and compliance

(26:43) Proof of value before AI pilots

(31:18) White-glove AI infrastructure vs self-serve cloud

(33:32) Qualcomm partnership and inference-first AI

(41:52) Edge-to-cloud inference and agentic workflows

(49:20) Why AI pilots fail and how enterprises succeed