Reliability Enablers Podcast Por Ash Patel & Sebastian Vietz arte de portada

Reliability Enablers

Reliability Enablers

De: Ash Patel & Sebastian Vietz
Escúchala gratis

Software reliability is a tough topic for engineers in many organizations. The Reliability Enablers (Ash Patel and Sebastian Vietz) know this from experience. Join us as we demystify reliability jargon like SRE, DevOps, and more. We interview experts and share practical insights. Our mission is to help you boost your success in reliability-enabling areas like observability, incident response, release engineering, and more.

read.srepath.comAsh P
Economía
Episodios
  • #67 Why the SRE Book Fails Most Orgs — Lessons from a Google Veteran
    Jul 15 2025

    A new or growing SRE team. A copy of the book. A company that says it cares about reliability. What happens next? Usually… not much.

    In this episode, I sit down with Dave O’Connor, a 16-year Google SRE veteran, to talk about what happens when organizations cargo-cult reliability practices without understanding the context they were born in.

    You might know him for his self-deprecating wit and legendary USENIX blurb about being “complicit in the development of the SRE function.”

    This one’s a treat — less “here’s a shiny new tool” and more “here’s what reliability actually looks like when you’ve seen it all.”

    No vendor plugs from Dave at all, just a good old-fashioned chat about what works and what doesn’t.

    Here’s what we dive into:

    * The adoption trap: Why SRE efforts often fail before they begin—especially when new hires care more about reliability than the org ever intended.

    * The SRE book dilemma: Dave’s take on why following the SRE book chapter-by-chapter is a trap for most companies (and what to do instead).

    * The cost of “caring too much”: How engineers burn out trying to force reliability into places it was never funded to live.

    * You build it, you run it (but should you?): Not everyone’s cut out for incident command—and why pretending otherwise sets teams up to fail.

    * Buying vs. building: The real reason even conservative enterprises are turning into software shops — and the reliability nightmare that follows.

    We also discuss the evolving role of reliability in organizations today, from being mistaken for “just ops” to becoming a strategic investment (when done right).

    Dave's seen the waves come and go in SRE — and he's still optimistic. That alone is worth a listen.



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit read.srepath.com
    Más Menos
    31 m
  • #66 - Unpacking 2025 SRE Report’s Damning Findings
    Jul 1 2025

    I know it’s already six months into 2025, but we recorded this almost three months ago. I’ve been busy with my foray into the world of tech consulting and training —and, well, editing these podcast episodes takes time and care.

    This episode was prompted by the 2025 Catchpoint SRE Report, which dropped some damning but all-too-familiar findings:

    * 53% of orgs still define reliability as uptime only, ignoring degraded experience and hidden toil

    * Manual effort is creeping back in, reversing five years of automation gains

    * 41% of engineers feel pressure to ship fast, even when it undermines long-term stability

    To unpack what this actually means inside organizations, I sat down with Sebastian Vietz, Director of Reliability Engineering at Compass Digital and co-host of the Reliability Enablers podcast.

    Sebastian doesn’t just talk about technical fixes — he focuses on the organizational frictions that stall change, burn out engineers, and leave “reliability” as a slide deck instead of a lived practice.

    We dig into:

    * How SREs get stuck as messengers of inconvenient truths

    * What it really takes to move from advocacy to adoption — without turning your whole org into a cost center

    * Why tech is more like milk than wine (Sebastian explains)

    * And how SREs can strengthen—not compete with—security, risk, and compliance teams

    This one’s for anyone tired of reliability theatrics. No kumbaya around K8s here. Just an exploration of the messy, human work behind making systems and teams more resilient.



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit read.srepath.com
    Más Menos
    30 m
  • #65 - In Critical Systems, 99.9% Isn’t Reliable — It’s a Liability
    Jun 17 2025

    Most teams talk about reliability with a margin for error. “What’s our SLO? What’s our budget for failure?”

    But in the energy sector? There is no acceptable downtime. Not even a little.

    In this episode, I talk with Wade Harris, Director of FAST Engineering in Australia, who’s spent 15+ years designing and rolling out monitoring and control systems for critical energy infrastructure like power stations, solar farms, SCADA networks, you name it.

    What makes this episode different is that Wade isn’t a reliability engineer by title, but it’s baked into everything his team touches. And that matters more than ever as software creeps deeper into operational technology (OT), and the cloud tries to stake its claim in critical systems.

    We cover:

    * Why 100% uptime is the minimum bar, not a stretch goal

    * How the rise of renewables has increased system complexity — and what that means for monitoring

    * Why bespoke integration and SCADA spaghetti are still normal (and here to stay)

    * The reality of cloud risk in critical infrastructure (“the cloud is just someone else’s computer”)

    * What software engineers need to understand if they want their products used in serious environments

    This isn’t about observability dashboards or DevOps rituals. This is reliability when the lights go out and people risk getting hurt if you get it wrong.

    And it’s a reminder: not every system lives in a feature-driven world. Some systems just have to work. Always. No matter what.



    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit read.srepath.com
    Más Menos
    28 m
Todavía no hay opiniones