Episodios

  • We Built Microsoft Teams in 23 Minutes (And You Can Use It) & GPT 5.4 Impressions - EP99.37
    Mar 6 2026

    Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80
    Join Simtheory: https://simtheory.ai

    🚀 Try our AI-built apps:

    Macrosoft Teams: teams.simtheoryapp.com (working video chat with up to 150 people)
    Trallo: trallo.simtheoryapp.com (full Trello clone, unlimited boards, completely free)

    TDIA Discord: https://discord.gg/gTW4RkAJvn
    Spotify Songs: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=Zh4jgHIASI2ZvsXVfVcCoA

    So Chris, this week... we've been having way too much fun with the AI again. OpenAI just dropped GPT-5.4 and 5.4 Pro, and holy shit - we finally have a ball game. This might be the first OpenAI model that genuinely competes with Opus 4.6 for agentic work.

    But here's where it gets wild: we rebuilt Trello AND Microsoft Teams from scratch using single prompts. Not mockups. Fully deployed, working apps with authentication, video chat, the works. You can literally sign up and use them right now.

    Plus: We roast Gemini 3.1 (it's a disgrace for agentic workflows), break down the insane $30/$180 per million pricing on 5.4 Pro (who is this for??), and discuss why every $99/month SaaS tool might be about to die. Chris declares his programming skills "useless" and honestly... he might be right.

    We also demo our actual workflow - running 5 agent tabs simultaneously, delegating everything, and why we barely visit websites anymore. The AI workspace IS the operating system now.


    CHAPTERS:

    0:00 - Intro & Housekeeping (We Screwed Up the Link)
    1:26 - GPT-5.4 First Impressions & Specs
    3:12 - Chris's Testing: 40 Minutes to Solve a Problem
    4:51 - Knowledge Work Improvements (Catching Up to Anthropic)
    6:38 - Computer Use vs Browser/Terminal Debate
    8:07 - Why We Don't Need Computer Use Anymore
    9:53 - Teaser: We Built Full SaaS Apps Today
    11:19 - Tool Search API & Skills Integration
    13:20 - The Speed Problem (It's a Plodder)
    15:12 - GPT-5.4 Pro Pricing Reaction ($30/$180 WTF)
    18:14 - Someone Rebuilt Minecraft in 24 Minutes
    19:46 - Gemini 3.1 Roast: "It's a Disgrace"
    22:36 - DEMO: Trallo (Full Trello Clone)
    29:03 - DEMO: Macrosoft Teams (Working Video Chat!)
    33:30 - The SaaS Collapse Theory
    36:42 - AI Workspace as the New Operating System
    38:57 - Forcing Features onto Entrenched Software
    43:32 - "My Programming Skills Are Useless" - Chris
    46:06 - The $12 Million Legacy Software Opportunity
    51:06 - Beyond Code: Forms, PDFs, Knowledge Work
    55:28 - How Fast Will This Change Everything?
    56:31 - Gemini 3.1 Flash Lite Quick Take
    59:36 - The Delegation Lifestyle (5 Agent Tabs Running)
    1:01:24 - Mike's Workflow Demo
    1:04:31 - Cognitive Overload Problem
    1:06:04 - Release Date: 2 Weeks (Drop Punishment Ideas!)

    Thanks for listening like and sub xoxo

    Más Menos
    1 h y 8 m
  • Nano Banana 2 is Here! Gemini-3 Shutdown & The AI Layoff Myth | EP99.36
    Feb 27 2026

    Join us on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80

    Join Simtheory: https://simtheory.ai

    TDIA Discord: https://discord.gg/gTW4RkAJvn

    Horse Egg Lifecycle Infographic: https://staging.simtheory.ai/share/file/UZ2KJU

    ----

    So Chris, this week... we're diving into Google's new Nano Banana 2 image model - 50% cheaper and supposedly faster (when the servers aren't melting). We put it through its paces with annotation-based editing, slide generation, and yes, the return of the legendary horse egg experiment.

    Plus: Google quietly kills Gemini-3 after just a few months (good riddance?), we discuss why the model was "dead on arrival" for agentic workflows, and break down the real story behind those massive AI layoff announcements from Block and WiseTech. Spoiler: it's probably not actually about AI.

    We also get into the current state of the model wars (Opus 4.6 vs Codex 5.3), why smaller models like GLM-5 might be the future for enterprise agentic tasks, and Chris's wife teaching Claude to literally speak to her using Mac's text-to-speech. The models are getting creative.

    ---
    0:00 - Intro
    0:36 - Nano Banana 2: Price, Speed & First Impressions
    3:19 - The Compositing Problem & Last Mile Design
    5:41 - Annotation-Based Editing (This Changes Everything)
    9:52 - Slide Editing & Real-World Use Cases
    12:34 - The Horse Egg Experiment Returns
    14:30 - Image Degradation & Cost Breakdown
    17:47 - Text-to-Image Leaderboard Discussion
    20:01 - Why Nano Banana Dominates for Work
    22:07 - Codex 5.3 vs Opus 4.6
    22:54 - Google Kills Gemini-3 (What Went Wrong?)
    26:48 - Google's Agentic Problem
    30:08 - The Model Loyalty Cycle
    34:22 - Why Opus 4.6 is Still the Best
    37:05 - Cost Optimization & Smart Model Routing
    43:30 - When Models Get Stuck on the Wrong Path
    45:36 - Nicole's AI Learns to Talk Back
    46:54 - Can Anyone Build Software Now?
    52:26 - Anthropic's Legal/Finance Plugins & Market Panic
    57:08 - Block Lays Off 4,000: AI or Excuse?
    1:00:05 - The AI Job Apocalypse Isn't Real

    Thanks for listening like and sub xoxo

    Más Menos
    1 h y 2 m
  • Gemini 3.1 Pro, Claude Sonnet 4.6 & The OpenClaw Hire That Killed the Chatbot Era - EP99.35
    Feb 20 2026
    • Join Simtheory: https://simtheory.ai
    • "Is This The End" now on Spotify: https://open.spotify.com/album/2Py1MyADUFqJFVUISI2VTP?si=oT3PWyJYRA2BspOmzT_ifg
    • Register for the STILL RELEVANT tour: https://simulationtheory.ai/16c0dationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80


    Two new models dropped this week — Gemini 3.1 Pro and Claude Sonnet 4.6 — and honestly? We're struggling to care. In this episode, we break down why Gemini went from being our daily driver to a model we barely touch, the "tunnel vision" hallucination problem that killed the Gemini 3 series for us, and whether 3.1 Pro actually fixes it. We put Gemini 3.1 Pro head-to-head against Claude Opus building a Geoffrey Hinton Doom Center, debate whether anyone can actually tell the difference between Sonnet 4.5 and 4.6, and make the case that smaller models running in agentic loops are secretly beating the frontiers. Plus: OpenAI acquires OpenClaw and we ask why a $100B company couldn't just build it themselves, DHH calls out the AI pricing bubble, Mike compares AI models to cheap wine hangovers, and Sam Altman refuses to hold Dario's hand at the India AI Summit. The model wars are getting weird.

    CHAPTERS:

    0:00 Intro & "Is This The End" Now on Spotify
    1:10 Gemini 3.1 Pro: Thinking Controls & The Medium Mode Fix
    3:14 The Speed vs Intelligence Trade-Off in Agentic Work
    5:10 Why Multitasking With AI Agents Made Us Anxious
    6:34 Solid Updates: The Real Goal of Agentic Coding
    7:45 Gemini's Fall From Grace: From Daily Driver to Dead Model
    10:08 The Tunnel Vision Problem That Killed Gemini 3
    13:35 Mixed Reactions: Fanboys vs Reality on Gemini 3.1 Pro
    15:06 Side-by-Side Test: Gemini 3.1 Pro vs Claude Opus (Hinton Doom Center)
    17:39 Why File Manipulation Accuracy Matters More Than Context Windows
    19:27 The Context Window Debate: 1M Tokens vs Smart Sub-Agents
    22:05 DHH on Token Pricing: "If There's a Bubble, It's This"
    24:11 Should Models Ship as Agent vs Chat Variants?
    28:43 Claude Sonnet 4.6: A $2 Discount on Opus?
    31:44 The Model Mix: Why One Model Won't Rule Them All
    34:40 Anthropic Is Winning — But Can Anyone Tell the Difference?
    38:58 OpenAI Acquires OpenClaw: Why Couldn't They Just Build It?
    44:18 The Silicon Valley Moment: Sam vs Dario at India AI Summit
    47:05 Will Smaller Models Win the Enterprise? The Cost Reality Check
    51:27 The End of Single-Shot: Why Agentic Loops Change Everything
    55:48 Final Thoughts & Gemini 3.1 Pro Gets One More Week

    Thanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. Two models dropped on a week again. What a time to be alive. xoxo

    Más Menos
    58 m
  • Am I Even Needed Anymore? GLM-5, Agentic Loops & AI Productivity Psychosis - EP99.34
    Feb 13 2026

    Join Simtheory: https://simtheory.ai

    Register for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80

    GLM-5 just dropped and it's trained entirely on Huawei chips – zero US hardware dependency. Meanwhile, we're having existential crises about whether we're even needed anymore. In this episode, we break down China's new frontier model that's competing with Opus 4.6 and Codex at a fraction of the price, why agentic loops are making 200K context windows the sweet spot (sorry, million-token dreams), and the very real phenomenon of AI productivity psychosis. We dive into why coding-optimized models are secretly winning at everything, the Harvard study confirming AI doesn't reduce work – it intensifies it, and the exodus of safety researchers from XAI, Anthropic, and OpenAI (spoiler: they're not giving back their shares). Plus: Mike's arm is failing from too much mouse usage, we debate whether the chatbot era is actually fading, and yes – there's a safety researcher diss track called "Is This The End?"

    CHAPTERS:

    0:00 Intro - Is This The End? (Song Preview)
    0:11 Still Relevant Tour Update & NASA Listener Callout
    1:42 AI Productivity Psychosis: The Pressure of Infinite Capability
    4:25 GLM-5 Breakdown: China's New Frontier Model on Huawei Chips
    7:24 First Impressions: GLM-5 in Agentic Loops
    9:48 Why Cheap Models Matter & The New Model War
    14:09 Codex Vibe Shift: Is OpenAI Winning?
    16:24 Does Context Window Size Even Matter Anymore?
    22:27 The Parallelization Problem & Cognitive Overload
    27:27 Mike's Arm Injury & The Voice Input Pivot
    31:17 Single-Threaded Work & The 95% Problem
    35:06 UX is Unsolved: Rolling Back Agentic Mistakes
    38:45 Harvard Study: AI Doesn't Reduce Work, It Intensifies It
    44:01 How AI Erodes Company Structure & Why Adoption Takes Years
    50:14 My AI vs Your AI: Household Debates
    50:43 The Safety Researcher Exodus: XAI, Anthropic, OpenAI
    56:49 Final Thoughts: Are We All Still Relevant?
    59:04 BONUS: Full "Is This The End?" Diss Track

    Thanks for listening. Like & Sub. Links above for the Still Relevant Tour signup and Simtheory. GLM-5 is here, your productivity psychosis is valid, and the safety researchers are becoming poets. xoxo

    Más Menos
    1 h y 3 m
  • Is the ChatGPT Era Over? Opus 4.6 & The Shift from Chat to Delegation - EP99.33
    Feb 6 2026

    Join Simtheory: https://simtheory.ai

    Register for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80

    It's the model same-day showdown of 2026. Opus 4.6 and Codex 5.3 dropped within minutes of each other, and we're breaking down what this means for the future of AI work. In this episode, we unpack Opus 4.6's million-token context window (if you've got billies in the bank), why Codex's pricing makes it nearly impossible to ignore for agentic loops, and the real cost of running agents for 24 hours ($10K, apparently). We dive deep into why coding-optimized models are secretly crushing it at non-coding tasks, the mental fatigue of managing AI workers, and whether the chatbot era is actually fading or just evolving. Plus: Chris accidentally books three real pig grooming appointments, we debate whether you need a "life coach agent" to manage your agent swarm, and yes – there's an Opus 4.6 diss track that goes unreasonably hard.

    CHAPTERS:

    0:00 Intro - Opus 4.6 Diss Track Preview
    0:09 The Model Same-Day Showdown: Opus 4.6 vs Codex 5.3
    0:50 Opus 4.6 Breakdown: Million Token Context & Premium Pricing
    2:31 Token Bill Shock: $10K Research Bills & Extended Context Costs
    5:04 Codex Pricing: Why It's Nearly Free for Agentic Loops
    6:42 Why Coding Models Are Secretly Crushing Non-Coding Tasks
    10:14 Tool Fatigue: Too Many Models, Too Many Workflows
    12:47 Opus 4.6 First Impressions: "Solid" and "Faultless"
    13:48 Chris Accidentally Books Three Real Pig Grooming Appointments
    16:01 Unix Tools & Why Code-Optimized Models Win at Everything
    19:59 The Agentic Retraining Imperative: Chat to Delegation
    22:16 Agent Swarms & The Master Thread Architecture
    24:51 OpenAI vs Anthropic: The Enterprise Battle
    27:09 Corporate Espionage 2.0: Stealing Skills & The Open Source Threat
    31:19 The UX Problem: Why Delegation Isn't Solved Yet
    34:24 The Stress of Hyper-Productivity & Managing Agent Swarms
    37:07 Coordination: The Next Layer of Abstraction
    40:09 The Fantasy vs Reality of Autonomous AI Businesses
    44:37 Is the Turn-by-Turn Chatbot Era Actually Fading?
    49:23 Tokens as Spice: Turning Compute Into Money
    52:08 Reduce Cognitive Overload: The Real Goal of AI
    55:07 Still Relevant Tour Announcement
    55:39 BONUS: Full Opus 4.6 Diss Track

    Thanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. The model wars are heating up, and your token bill is about to get interesting. xoxo

    Más Menos
    1 h y 2 m
  • Did Clawdbot Just Show Us the Future of AI Workers? & Kimi K2.5 Dis Track Tested - EP99.32
    Jan 30 2026

    Join Simtheory: https://simtheory.ai
    Register for the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80
    ---

    The hype train is 2026 knows only Moltbot (RIP Clawdbot). In this episode, we unpack the viral open-source AI assistant that's taken over the internet what it actually does, why everyone's losing their minds, and whether it's worth the $750/day token bills some users are racking up. We dive deep into why locally-run skills and CLI tools are beating computer-use clicking, how smaller models like GPT-5 Mini are crushing it in agentic workflows, and why the real magic is in targeted context - not massive swarms. Plus: Kimi K2.5 drops as a near-Sonnet-level model at 1/10th the price, we debate whether SaaS is dead, and yes – there are TWO Kimi K2.5 diss tracks. One made by Opus pretending to be Kimi. It might just slap?

    CHAPTERS:

    0:00 Intro - Still Relevant Tour Update
    0:48 What is Moltbot? The Viral AI Assistant Explained
    3:57 Token Bill Shock: $750/Day and Anthropic Bans
    5:00 The Dream of Digital Coworkers on Mac Minis
    6:52 Why CLI Tools & Skills Beat Computer-Use Clicking
    10:57 Why This Way of Working Is Genuinely Exciting
    14:47 Smaller Models Crushing It: GPT-5 Mini & Targeted Context
    17:30 Wild Agentic Behavior: Chrome Tab Hijacking & Auto-Retries
    20:10 Security Architecture: Locked-Down Machines & Enterprise Use
    24:01 AI Building Its Own Tools On-The-Fly
    27:08 The Fear & Overwhelm of Rapid Progress
    29:10 2026: The Year of Agent Workers
    31:43 The Challenge of Directing AI Work (Everyone's a Manager Now)
    37:24 Skills Will Take Over: Why MCPs & Atlassian Can't Stop Us
    40:38 Real-World Use Cases: Doctors, Lawyers & Accountants
    46:28 Cost Solutions: Build Workflows Around Cheaper Models
    52:58 Kimi K2.5: Sonnet-Level Performance at 1/10th the Price
    1:00:55 The "1,500 Tool Calls" Claim: Marketing vs Reality
    1:05:23 The Kimi K2.5 Diss Tracks (Opus vs Kimi)
    1:08:08 Demo: Black Hole Simulator & Self-Trolling CRM
    1:12:55 Is SaaS Dead?
    1:14:30 BONUS: Full Kimi K2.5 Diss Tracks

    Thanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. The future is open source, apparently. xoxo

    Más Menos
    1 h y 20 m
  • The AI Productivity Paradox: Why Doing More Feels Like Burnout: EP99.31
    Jan 23 2026

    Join Simtheory: https://simtheory.ai

    Reserve your seat on the STILL RELEVANT tour: https://simulationtheory.ai/16c0d1db-a8d0-4ac9-bae3-d25074589a80
    ----
    Two episodes in one week? We're either above average or completely unhinged. In this one, we dive deep into the new phenomenon of "AI exhaustion" – that fried feeling you get after multitasking across six agent tabs all day. We share our breakthroughs with AI-assisted presentations (20 minutes vs several hours), why browser-use on your local machine bypasses every anti-scraping technique known to man, and how enterprise context sharing could be the real unlock for organizations. Plus: OpenAI announces ads for ChatGPT (even on paid tiers), their CFO floats taking cuts from drug discoveries (seriously), and Google publicly dunks on them for it. Also – the Still Relevant Australia Tour is coming, and our LinkedIn group hit 200 members (we're basically LinkedIn influencers now too).

    CHAPTERS:

    0:00 Intro - Still Relevant Tour Announcement + LinkedIn Milestone
    2:08 AI Exhaustion: The Cognitive Overload of Multitasking with Agents
    4:14 Why Single-Tasking with AI Beats Parallel Agent Chaos
    7:02 The Problem with "I Spun Up 70,000 Sub-Agents" Twitter Posts
    10:03 Mike's Presentation Workflow: From Hours to 20 Minutes
    14:06 Why Isn't Copilot Doing This Already?
    16:54 Old Models + Great Context = Still Amazing Results
    21:14 What's Actually Changed? It's the Software Layer
    25:22 Enterprise Context Sharing & Organizational IP
    31:22 Skills, Sub-Agents, and Role-Based Knowledge
    35:22 Security Concerns: Can You Hack an Agent with Malicious MD Files?
    38:23 Cloud Providers Have a Bigger Moat Than the Labs
    43:16 Browser Use: The Ultimate Context Gathering Weapon
    48:25 Rethinking SaaS: Software That Actually Thinks
    53:08 Smart Paste, Smart CC – Why Isn't All Software Like This?
    56:32 OpenAI's Desperate Moves: Ads, Age Verification & Drug Royalties
    1:03:03 Google Says "No Plans for Gemini Ads" (Shots Fired)
    1:07:24 Is OpenAI Okay? The Vibes Are Definitely Off
    1:10:35 Capitalism Won't Give You Free Time, Just More Demands
    1:11:20 Outro + Still Relevant Tour Details

    Thanks for listening. Like & Sub. Links below for the Still Relevant Tour signup and Simtheory. xoxo

    Más Menos
    1 h y 13 m
  • 2026 Existential Crisis, Claude Code Hype & Is SaaS Dead? EP99.30-WIZARDS
    Jan 19 2026

    Join Simtheory: https://simtheory.ai
    ---
    Join the most average AI LinkedIn group: https://www.linkedin.com/groups/16562039/

    It's 2026 and everyone's having an existential crisis. In this episode, we unpack the two camps dominating AI C/Twitter: hype boys claiming "Claude Code can do my washing" vs. software developers doom-scrolling themselves into career panic. We put the agentic hype to the test and discover that no, you can't actually run 8 agents recreating your local business ecosystem while you sleep. Plus, we reflect on why MCP is exhausting, why Gemini 3 Pro is somehow worse than Gemini 2.5 Pro, and why Geoffrey Hinton would rather write his book than answer questions in Tasmania. Also featuring: the $200,000/month enterprise AI problem, why SaaS isn't dead (but it's scared), and our prediction that AI workspaces will become the everything app.

    CHAPTERS:

    00:00 Intro - Unpacking the 2026 AI Vibes
    02:21 Putting Claude Code and Agentic Hype to the Test
    05:57 Why Twitter AI Demos Never Show the Receipts
    07:03 Honest Assessment of Where Frontier Models Are At
    11:19 Building the Everything App with Email, Calendar and Files
    16:47 Collaborative Mode vs Agentic Delegation in Practice
    21:29 The Real Cost of Enterprise AI at Scale
    24:32 Why Cheaper Models Like Haiku and Gemini Flash Matter
    29:25 Is SaaS Actually Dead or Just Disrupted
    38:11 The Future of AI Platforms, SDKs and App Stores
    43:35 The Untapped Opportunity in Paid Proprietary MCPs
    51:21 Geoffrey Hinton Refuses to Take Questions in Tasmania
    55:05 2026 Plans and the Still Relevant Tour Announcement

    Thanks for listening. Like & Sub. xoxox

    Más Menos
    1 h y 9 m