Episodios

  • On restraining AI development for the sake of safety
    Mar 19 2026

    My take on slowing down AI. Text version here: https://joecarlsmith.com/2026/03/19/on-restraining-ai-development-for-the-sake-of-safety/

    Más Menos
    1 h y 22 m
  • Building AIs that do human-like philosophy
    Jan 29 2026

    AIs will face philosophical questions humans can't answer for them. Text version here: https://joecarlsmith.com/2026/01/29/building-ais-that-do-human-like-philosophy/

    Más Menos
    34 m
  • How human-like do safe AI motivations need to be?
    Nov 12 2025

    AIs with alien motivations can still follow instructions safely on the inputs that matter. Text version here: https://joecarlsmith.com/2025/11/12/how-human-like-do-safe-ai-motivations-need-to-be/

    Más Menos
    1 h y 24 m
  • Leaving Open Philanthropy, going to Anthropic
    Nov 3 2025

    On a career move, and on AI-safety-focused people working at AI companies. Text version here: https://joecarlsmith.com/2025/11/03/leaving-open-philanthropy-going-to-anthropic/

    Más Menos
    32 m
  • Controlling the options AIs can pursue
    Sep 29 2025

    On boxing AIs, and on making deals with them. Text version here: https://joecarlsmith.com/2025/09/29/controlling-the-options-ais-can-pursue

    Más Menos
    56 m
  • Giving AIs safe motivations
    Aug 18 2025

    A four-step picture. Text version here: https://joecarlsmith.com/2025/08/18/giving-ais-safe-motivations

    Más Menos
    1 h y 23 m
  • The stakes of AI moral status
    May 21 2025

    On seeing and not seeing souls. Text version here: https://joecarlsmith.com/2025/05/21/the-stakes-of-ai-moral-status/

    Más Menos
    37 m
  • Can we safely automate alignment research?
    Apr 30 2025

    It's really important; we've got a real shot; there are a ton of ways to fail.

    Text version here: https://joecarlsmith.com/2025/04/30/can-we-safely-automate-alignment-research/.

    There's also a video and transcript of a talk I gave on this topic here: https://joecarlsmith.com/2025/04/30/video-and-transcript-of-talk-on-automating-alignment-research/

    Más Menos
    1 h y 30 m