AI Explained Official Podcast Podcast Por Philip - Host of AI Explained YT arte de portada

AI Explained Official Podcast

AI Explained Official Podcast

De: Philip - Host of AI Explained YT
Escúchala gratis

Acerca de esta escucha

Covering the biggest news of the century - the arrival of smarter-than-human AI. From the author of Simple Bench, which reveals the remaining gap between LLM and human reasoning. Hype-free, and the British accent is a freebie bonus.

© 2025 AI Explained Official Podcast
Ciencias Sociales Desarrollo Personal Política y Gobierno Éxito Personal
Episodios
  • Grok 4 - 10 New Things to Know
    Jul 10 2025

    Grok 4 is here, but did you know these 10 things about the new model? From benchmark caveats to soloing science, $300 a month secrets to Grok 5 promises, here's 10 new things to know in just under 12 minutes.

    AI Insiders ($9!): https://www.patreon.com/AIExplained

    Chapters:
    00:00 - Introduction
    00:22 - Benchmark Results
    02:11 - Benchmark Caveats
    02:59 - ARC-AGI 2
    03:35 - SimpleBench
    04:49 - ‘Humanity’s Last Exam’
    07:20 - SuperGrok Heavy Price
    07:58 - API Price
    08:12 - Grok 5, Gemini 3.0 Beta, GPT-5
    09:12 - System Prompt Change + $1B a month, pollution
    10:20 - Not soloing science, helping you solo code

    Livestream: https://www.youtube.com/watch?v=1tQ_KrlHgfg&t=1s

    Price: https://grok.com/#subscribe
    https://x.com/ArtificialAnlys/status/1943166841150644622

    Gemini DeepThink: https://blog.google/technology/google-deepmind/google-gemini-updates-io-2025/#deep-think

    https://simple-bench.com/

    ARC-AGI 2: https://x.com/arcprize/status/1943168950763950555

    Humanity’s Last Exam: https://agi.safe.ai/

    SmartGPT: https://www.youtube.com/watch?v=hVade_8H8mE

    New Power Plant, 1m GPUs: https://www.tomshardware.com/tech-industry/artificial-intelligence/elon-musk-xai-power-plant-overseas-to-power-1-million-gpus

    Gemini 3.0 beta: https://web.archive.org/web/20250709174548/https://github.com/google-gemini/gemini-cli/blob/b0cce952860b9ff51a0f731fbb8a7649ead23530/packages/cli/src/ui/utils/errorParsing.test.ts

    Pollution: https://www.theguardian.com/technology/2025/apr/24/elon-musk-xai-memphis
    https://www.youtube.com/watch?v=C8rU4dv2w8Q
    https://www.youtube.com/watch?v=3VJT2JeDCyw

    System Prompt: https://github.com/xai-org/grok-prompts/blob/535aa67a6221ce4928761335a38dea8e678d8501/ask_grok_system_prompt.j2

    Burn Rate: https://www.bloomberg.com/news/articles/2025-06-17/musk-s-xai-burning-through-1-billion-a-month-as-costs-pile-up

    Ron Johnson: https://x.com/jdcmedlock/status/1939814516503847259



    Non-hype Newsletter: https://signaltonoise.beehiiv.com/

    Podcast: https://aiexplainedopodcast.buzzsprout.com/

    Más Menos
    12 m
  • When Will AI Models Blackmail You, and Why?
    Jun 24 2025

    In the last few days Anthropic have released an impressive honest account of how all models blackmail, no matter what goal they have, and despite prompt warnings, and other preventions. But do these models *want* this?

    Thanks to Storyblocks for sponsoring this video! Download unlimited stock media at one set price with Storyblocks: storyblocks.com/AIExplained


    AI Insiders ($9!): https://www.patreon.com/AIExplained

    Chapters:
    00:00 - Introduction
    01:20 - What prompts blackmail?
    02:44 - Blackmail walkthrough
    06:04 - ‘American interests’
    08:00 - Inherent desire?
    10:45 - Switching Goals
    11:35 - Murder
    12:22 - Realizing it’s a scenario?
    15:02 - Prompt engineering fix?
    16:27 - Any fixes?
    17:45 - Chekov’s Gun
    19:25 - Job implications
    21:19 - Bonus Details

    Report: https://www.anthropic.com/research/agentic-misalignment
    30 Page Appendices: https://assets.anthropic.com/m/6d46dac66e1a132a/original/Agentic_Misalignment_Appendix.pdf
    Announcement: https://x.com/AnthropicAI/status/1936144602446082431?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Etweet
    OpenAI Files: https://www.openaifiles.org/
    Grok 4 News: https://x.com/RonFilipkowski/status/1936372579607912473
    Claude 4 Report Card: https://www-cdn.anthropic.com/6be99a52cb68eb70eb9572b4cafad13df32ed995.pdf
    New Apollo Research: https://www.apolloresearch.ai/blog/more-capable-models-are-better-at-in-context-scheming
    Interesting Reflections: https://nostalgebraist.tumblr.com/post/785766737747574784/the-void


    Non-hype Newsletter: https://signaltonoise.beehiiv.com/

    Más Menos
    26 m
  • Apple’s ‘AI Can’t Reason’ Claim Seen By 13M+, What You Need to Know
    Jun 12 2025

    What to make of those headlines that AI can’t reason, seen by tens of millions? I cover the paper in layman’s terms, what it means and doesn’t mean, and what’s next.

    Thanks to Storyblocks for sponsoring this video! Download unlimited stock media at one set price with Storyblocks: https://storyblocks.com/AIExplained

    Plus o3-pro and whether it is my current most-recommended model.

    AI Insiders ($9!): https://www.patreon.com/AIExplained

    Chapters:
    00:00 - Introduction
    00:57 - Viral Post + Headlines
    01:42 - Apple Paper Analysis
    08:34 - But they do Hallucinate
    10:43 - Not Supercomputers
    11:18 - o3 Pro and Recommendations


    13.7M Tweet: https://x.com/RubenHssd/status/1931389580105925115

    Apple Paper: https://ml-site.cdn-apple.com/papers/the-illusion-of-thinking.pdf

    Guardian Article: https://www.theguardian.com/technology/2025/jun/09/apple-artificial-intelligence-ai-study-collapse

    Lisan al Gaib post: https://x.com/scaling01/status/1931854370716426246

    Multiplication: https://x.com/yuntiandeng/status/1836114401213989366

    The Illusion of the Illusion of Thinking: https://drive.google.com/file/d/1Zx9ikRj0Enc3SB4wA9HlYIlpmO_8QiUO/view

    Marcus: https://www.theguardian.com/commentisfree/2025/jun/10/billion-dollar-ai-puzzle-break-down

    Prof Rao: https://x.com/rao2z/status/1927707640223719631

    AI Job Headlines: https://www.nytimes.com/2025/06/11/technology/ai-mechanize-jobs.html
    https://www.axios.com/2025/05/28/ai-jobs-white-collar-unemployment-anthropic

    Sky News Story: https://news.sky.com/story/can-we-trust-chatgpt-despite-it-hallucinating-answers-13380975

    Veo 3 Ad: https://x.com/Kalshi/status/1932891608388681791

    Altman Essay: https://blog.samaltman.com/

    o3 Original benchmarks: https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b8b6c44-acd6-43b3-b5c6-1a1d5c6c25e4_2486x1388.png

    https://pbs.twimg.com/media/GfQ0bfcXQAAQt13.jpg

    Alpha Evolve Video: https://www.youtube.com/watch?v=RH4hAgvYSzg

    https://simple-bench.com/


    Non-hype Newsletter: https://signaltonoise.beehiiv.com/

    Más Menos
    14 m
Todavía no hay opiniones