Google AI: Release Notes Podcast Por Google AI arte de portada

Google AI: Release Notes

Google AI: Release Notes

De: Google AI
Escúchala gratis

OFERTA POR TIEMPO LIMITADO. Obtén 3 meses por US$0.99 al mes. Obtén esta oferta.
Ever wondered what it's really like to build the future of AI? Join host Logan Kilpatrick for a deep dive into the world of Google AI, straight from the minds of the builders. We're pulling back the curtain on the latest breakthroughs, sharing the unfiltered stories behind the tech, and answering the questions you've been dying to ask. Whether you're a seasoned developer or an AI enthusiast, this podcast is your backstage pass to the cutting-edge of AI technology. Tune in for: - Exclusive interviews with AI pioneers and industry leaders. - In-depth discussions on the latest AI trends and developments. - Behind-the-scenes stories and anecdotes from the world of AI. - Unfiltered insights and opinions from the people shaping the future. So, if you're ready to go beyond the headlines and get the real scoop on AI, join Logan Kilpatrick on Google AI: Release Notes.2024 Google Ciencia
Episodios
  • How a Moonshot Led to Google DeepMind's Veo 3
    Oct 16 2025

    Dumi Erhan, co-lead of the Veo project at Google DeepMind, joins host Logan Kilpatrick for a deep dive into the evolution of generative video models. They discuss the journey from early research in 2018 to the launch of state-of-the-art Veo 3 model with native audio generation. Learn about the technical hurdles in evaluating and scaling video models, the challenges of long-duration video coherence and how user feedback is shaping the future of AI-powered video creation.

    Chapter:
    0:00 - Intro
    0:47 - Veo project's beginnings
    3:02 - Veo's origins in Google Brain
    5:07 - Video prediction and robotics applications
    7:45 - Early progress and evaluation challenges
    10:30 - Physics-based evaluations and their limitations
    12:18 - The launch of the original Veo model
    14:06 - Scaling challenges for video models
    16:02 - The leap from Veo1 to Veo2
    19:40 - Veo 3’s viral audio moment
    21:17 - User trends shaping Veo's roadmap
    23:49 - Image-to-video vs. text-to-video complexity
    26:00 - New prompting methods and user control
    27:55 - Coherence in long video generation
    31:03 - Genie 3 and world models
    35:54 - The steerability challenge
    41:59 - Capability transfer and image data's role
    47:25 - Closing

    Más Menos
    48 m
  • GDM’s Pushmeet Kohli on solving science's biggest challenges with AI
    Sep 15 2025

    Pushmeet Kohli, Head of Science and Strategic Initiatives at Google DeepMind, joins host Logan Kilpatrick to explore the intersection of AI and scientific discovery. Learn how the team's unique problem-solving framework led to innovations like AlphaFold and AlphaEvolve, and how new tools like AI Co-scientist aim to democratize these types of breakthroughs for everyone.

    Watch on YouTube: https://www.youtube.com/watch?v=o7mdsL6BHsk

    Chapters:
    0:00 - Intro
    1:04 - Recent Alpha launches
    02:15 - Framework for selecting research domains
    06:21 - Scientific, commercial and social impact
    15:00 - Wielding AGI for breakthroughs
    16:48 - Tech transfer and team collaboration
    19:46 - IMO Gold Medal
    21:42 - Evaluating math proofs
    22:55 - From specialized models to Deep Think
    24:22 - Do math skills generalize?
    25:53 - Generalizing the IMO model
    27:43 - Democratizing AI science tools
    30:09 - AI Co-scientist
    35:17 - An API for science?

    Más Menos
    37 m
  • Behind the scenes of Google's state-of-the-art "nano-banana" image model
    Aug 27 2025

    Join host Logan Kilpatrick in discussion with some of the minds behind Google's new state-of-the-art image model, Gemini 2.5 Flash. Product and research leads from the Gemini team break down the technology behind its key capabilities, including interleaved generation for complex edits and new approaches to achieving character consistency and pixel-perfect control. With Nicole Brichtova, Kaushik Shivakumar, Mostafa Dehghani and Robert Riachi.

    Watch on YouTube:

    Chapters:
    0:37 - New model introduction
    1:21 -Demo - Image Editing
    3:44 - Text rendering capabilities
    4:44 Beyond human preference evals
    6:44 - Text rendering as a proxy for quality
    8:38 - Positive transfer between modalities
    11:25 - Demo - Multi-turn, context aware image generation
    13:54 - Pixel-perfect editing and character consistency
    15:51 - Interleaved image generation
    17:59 - Specialized vs. native models
    19:52 - Understanding nuanced prompts
    20:59 - User feedback shaping model development
    22:37 - Improvements in character consistency
    24:17 - More natural looking images from team collaboration
    26:41 - What’s next for image generation models

    Más Menos
    31 m
Todavía no hay opiniones