Episodios

  • Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM
    Apr 22 2025

    This week, Frank sat down with Dr. Jacob Leverich—Stanford PhD, cofounder of Observe, and a veteran of the Google MapReduce team and Splunk. Jacob’s journey, from tinkering with video game code as a kid, to innovating at the cutting edge of distributed systems and energy efficiency, is as inspiring as it is informative.

    Key Takeaways
    • Early Tech Roots: Hear how curiosity with QBasic and classic PCs (think IBM PCXT and Commodore) put Jacob on a path to high-impact data engineering.
    • MapReduce, Dremel, & the Rise of Big Data: Jacob pulls back the curtain on working with some of the most influential data processing tools at Google and how these systems shifted the entire data landscape (hello, BigQuery!).
    • Building Efficient Systems: It’s not just about scale—energy efficiency and performance optimization are the unsung heroes of today’s data infrastructure. Jacob explains why making things “just work” isn’t enough anymore.
    • The Realities of Ops & Observability: Remember the days of grepping logs at 2AM? There’s a better way. Jacob shares how platforms like Observe help teams consolidate, visualize, and act on operational data—turning chaos into actionable insight.
    • Bridging Data & Ops: The lines between data observability and traditional ops are blurring, and Jacob’s unique experience shows how best practices from data warehousing are finally making ops smoother (and less sleepless).
    • Power Concerns & the Future: As data grows, so does energy consumption in data centers. Find out why optimization isn’t just good for performance—it’s key to sustainability.

    Timestamps

    00:00 Interview with Jacob Levrich

    05:59 Journey into Game Programming

    06:43 "Pursuing Fast Video Game Code"

    10:23 Data Processing and Power Efficiency

    16:11 Snowflake's Transformative Database Approach

    19:18 Journey to Data Management Industry

    21:37 Data Products: Solving Core Challenges

    27:07 Early Web Log Analysis Techniques

    28:57 Consolidating Data for Efficiency

    33:23 Specialized Tools and Context Switching

    35:43 Unique Dual-Expertise in Tech

    38:58 User-Centric Business Strategies

    42:13 IP Data Analysis in Cloud

    47:23 Electricity Transport Upsets Local Farms

    48:25 Shift to Parallel Computing

    52:10 Hardware Specialization & Software Optimization

    57:32 "Stay Data Driven"

    Más Menos
    58 m
  • István Mészáros on going From CERN to Startup & The Cat That Launched a Thousand Queries
    Apr 14 2025

    Welcome to another insightful episode of Data Driven! Today, we're diving into the world of warehouse-native analytics with our special guest, István Mészáros, cofounder of Mitsu. Join us as we explore how Mitsu empowers startups and enterprises with a new approach to data analytics. From his beginnings as a CERN physicist to becoming an open-source evangelist and finally a startup founder, István shares his unique journey through the data industry.

    We'll discuss the motivation behind Mitsu's distinct branding, reminiscent of Hello Kitty, and why standing out in today's crowded market is crucial. István also reveals the challenges and strategies of building a data company in Europe, and how Mitsu simplifies analytics by offering a self-service solution without the high costs associated with existing market leaders.

    Timestamps

    0:00 Introducing István Mészáros

    05:30 Shifting Open Source to SaaS

    07:46 Lava-Themed Compliance Solutions Brand

    10:27 Tech Branding and Hello Kitty Insights

    13:46 Optimizing Conversion in Data-Heavy Travel

    16:31 Self-Service Analytics Tool Needed

    19:17 Automated Product Analytics Tool

    23:20 "Budget Constraints and DIY Solutions"

    28:17 Freelancer's Efficient Data Solutions

    29:08 Open Source Tool Productization Plan

    33:13 Navigating Freelance and Startup Challenges

    37:19 Transitioning to Data Engineering

    42:25 Instant Feedback in Hobbies

    43:46 Embracing Feedback in Business Transformation

    49:13 "Hoping AI Takes Over Hiring"

    51:58 Visit Site for Info & Contact

    55:22 "Parenting Boys with Earbuds"

    57:25 "Data Driven: Quantum Podcast Relaunch"

    Más Menos
    58 m
  • Barr Moses on How Data Observability Can Save Your Company Millions
    Apr 1 2025

    On this episode of Data Driven, we welcome Barr Moses, CEO and co-founder of Monte Carlo, as she delves into the fascinating world of data observability.

    Join hosts Frank La Vigne and Andy Leonard as they explore how reliable data is crucial for making sound business decisions in today's tech-driven world. Learn why a simple schema change at Unity resulted in a $100 million loss and how Monte Carlo is developing cutting-edge solutions to prevent similar disasters. From discussions on ensuring data integrity to the intriguing potential of AI in anomaly detection, Barr Moses shares insights that might just redefine your understanding of data's role in business.

    Tune in for a podcast that not only uncovers the nuances of data reliability but also touches on the quirky side of tech, like why, according to Google, you should never use superglue to fix slipping cheese on your pizza.

    Moments

    00:00 Monte Carlo: Data Reliability Innovator

    05:45 "Data & AI Observability Engineering"

    09:42 Data Industry's Growing Importance

    12:00 Cereal Supply Chain Data Optimization

    16:03 Data Observability and Lineage

    19:29 GenAI Uncertainties and Latency Concerns

    23:17 "Human Oversight in AI Accuracy"

    24:12 Data Observability and Human Role

    28:01 Adapting to Customer Language

    33:29 Data and Security Management Alignment

    35:20 Data Reliability and Observability Challenges

    38:17 Automated Code Analysis Tool Launch

    42:29 Data-Inspired Childhood

    44:12 Passionate About Impactful Work

    48:52 LinkedIn Security Concerns Highlighted

    53:19 "Data Observability Insights"

    Más Menos
    54 m
  • Sanjay Annadate on Data Driven Digital Transformation
    Mar 4 2025

    In this episode, Sanjay joins Frank for a deep dive into the heart of digital transformation and AI-powered automation. Here are some of the key takeaways:

    1. Digital Transformation Evolution: Sanjay reflects on his nearly three-decade journey witnessing the digital shift from infancy to the AI-driven present. He outlines the critical components of digital transformation, including cloud adoption and data prioritization, noting significant changes in business focus over recent years.
    2. Microsoft's Role: Sanjay provides insights into Microsoft's strategic investments in digital transformation technologies, emphasizing their pivotal role in influencing market trends and industry-specific capabilities.
    3. AI-Powered Enhancements: From the widespread adoption of Copilot to the burgeoning concept of agentic AI, Sanjay discusses how AI tools are not replacing but augmenting the productivity of data engineers, offering a glimpse into the future of business processes.
    4. Edge of Innovation: We explore how Microsoft Fabric and other technologies are simplifying complex architectures, allowing businesses to leverage multi-cloud strategies effectively, keeping them at the forefront of innovation.
    5. Real-Life Impact: Sanjay shares compelling examples, like reducing sales briefing preparation time from four days to two minutes, showcasing the transformative power of AI in real business scenarios.

    Whether you're a data engineer, business leader, or just someone fascinated by the data-driven world, this episode is packed with valuable insights.

    Moments

    00:00 Three Decades of Digital Transformation

    05:27 Microsoft's Digital Transformation Dominance

    09:37 Microsoft's Cloud Integration Advantage

    13:22 Red Hat AI's Open Source Approach

    15:33 Microsoft Fabric's Multi-Cloud Integration Strategy

    20:01 "Custom Solutions for Complex Queries"

    21:39 Content Creation Efficiency Unlocked

    26:38 Sales Role Dependency Reduction Tool

    30:06 Agentic AI and Workflow Transformation

    33:29 "Beyond Basic Automation"

    35:05 AI's Impact on Business Expansion

    39:58 Data-Driven Problem Solving Impact

    41:58 Reading Trends in Data Innovation

    Más Menos
    45 m
  • Trevor Schulze on How CIO’s Can Drive AI Strategy
    Feb 25 2025

    In this episode, Andy Leonard and Frank La Vigne are thrilled to be joined by Trevor Schulze, the Chief Information Officer at Alteryx. Trevor brings an unparalleled perspective on digital transformation, drawing from his impressive tenure at industry giants such as Micron, Cisco, and RingCentral.


    Time stamps

    00:00 "Data Driven: AI & CIO Insights"

    04:32 CIO's Role in AI Evolution

    06:50 CIO's Evolving Role with AI

    11:43 "Embracing Data Democratization"

    16:24 Democratizing Data Access

    19:33 "AI Investment and Optimization Cycle"

    20:55 AI Enhances Tool Configuration Guidance

    24:42 Breaking Free from Vendor Lock-In

    27:41 "Unleashing Shadow AI and Technical Debt"

    31:53 Digital Performance Essential for All Industries

    34:01 Data Privacy Concerns in AI Use

    37:30 AI Democratization Challenges for Enterprises

    42:15 AI Transforming Business Processes

    43:55 Data-Driven Career Journey

    47:13 "Building Trust in Data Analytics"

    52:34 Building Trust in Future Tech

    Más Menos
    54 m
  • Lillian Pierson on Revolutionizing Growth Marketing with AI
    Feb 6 2025

    Andy Leonard and Frank La Vigne delve into the exciting world of AI and growth marketing with the renowned Lillian Pierson. Lillian, a globally recognized AI growth strategist and author. She shares her unique journey from engineering to data science and her role as a fractional CMO. She provides deep insights into leveraging AI to revolutionize marketing and growth strategies, discusses breaking down the barriers in early data science, and explores the rise of agentic AI.

    This conversation is filled with valuable knowledge, humor, and a reality check on the evolving tech landscape. Tune in to explore how AI and data-driven approaches are transforming industries and why Data Driven is a top pick for AI enthusiasts.

    Moments

    00:00 "Interview with AI Expert Lillian Pearson"

    04:18 Earning a Professional Engineering License

    09:21 Evolution of Data Science Disciplines

    11:08 Career Pivot to Success

    14:01 Data Strategy and AI Insights

    19:19 Marketing's Role in Product Growth

    21:58 Customer Advocacy in Product Development

    26:16 Exploring AI for Content Automation

    28:28 OpenAI Trained on My Style

    30:51 Frank's Podcast Automation Expansion

    33:22 "Delegation vs. Self-Management Discussion"

    37:45 Decoupled, Resilient System Communication

    41:57 Clay-Powered Decision Tech Critique

    45:41 AI Is Essential in Business

    49:09 Debating with ChatGPT's Perspectives

    50:23 Google AI: Generative Podcast Tool

    56:11 Big Data Fallacies Explored

    Más Menos
    1 h
  • Dean Guida on AI Insights, Data Analytics, and Business Growth
    Jan 28 2025

    Today, we've got an exciting episode lined up for you. Hosts Frank La Vigne and Bailey dive deep into the tech universe with Dean Guida, the CEO and founder of Infragistics. Dean brings his 35-year journey and expansive experience in technology to the table, reminiscing about the early days of software development and his transition into the data-driven world.

    In this conversation, you'll hear about the evolution of Infragistics from building UI components for Windows to creating sophisticated data analytics and AI tools. Dean also shares insights from his new book, "When Grit is Not Enough," focusing on how entrepreneurs can foster agile, data-driven learning organizations. Whether you're a seasoned developer, a budding entrepreneur, or someone fascinated by the intersection of AI and data, this episode promises a wealth of knowledge and inspiration.

    Join us as we explore technology old and new, from the bygone era of Windows 3.0 to the cutting-edge capabilities of AI today. Plus, hear Dean's personal journey of navigating through various technological and economic shifts over the decades. Make sure to tune in for a discussion that bridges the past, present, and future of tech innovation!

    Show Notes

    00:00 35 Years of UI/UX Innovation

    06:35 "Simplicity, Beauty, and Conversational AI"

    15:29 Enhancing User Trust Through Transparency

    19:52 AI-Driven Learning and OKR Management

    26:20 Kids Reflecting Tech Evolution

    27:12 "AI in Future Work Environments"

    33:14 "Data-Driven Leadership and Team Alignment"

    38:44 Entrepreneurship Beyond Grinding

    48:19 Contextual Understanding in AI Assistants

    51:57 Overprotected Generation's Communication Challenges

    54:55 Generational Impact of Pandemics

    01:00:47 "Data-Driven Podcast: Ranked 38"


    Más Menos
    1 h y 2 m
  • Arjun Patel on Vector Databases and the Future of Semantic Search
    Jan 21 2025

    Today, we delve into the intriguing world of vector databases, retrieval augmented generation, and a surprising twist—origami.

    Our special guest, Arjun Patel, a developer advocate at Pinecone, will be walking us through his mission to make vector databases and semantic search more accessible. Alongside his impressive technical expertise, Arjun is also a self-taught origami artist with a background in statistics from the University of Chicago. Together with co-host Frank La Vigne, we explore Arjun’s unique journey from making speech coaching accessible with AI at Speeko to detecting AI-generated content at Appen.

    In this episode, get ready to unravel the mysteries of natural language processing, understand the impact of the attention mechanism in transformers, and discover how AI can even assist in the art of paper folding. From discussing the nuances of RAG systems to sharing personal insights on learning and technology, we promise a session that’s both enlightening and entertaining. So sit back, relax, and get ready to fold your way into the fascinating layers of AI with Arjun Patel on Data Driven.


    Show Notes

    00:00 Arjun Patel: Bridging AI & Education

    04:39 Traditional NLP and Geometric Models

    08:40 Co-occurrence and Meaning in Text

    13:14 Masked Language Modeling Success

    16:50 Understanding Tokenization in AI Models

    18:12 "Understanding Large Language Models"

    22:43 Instruction-Following vs Few-Shot Learning

    26:43 "Rel AI: Open Source Data Tool"

    31:14 "Retrieval-Augmented Generation Explained"

    33:58 "Pinecone: Efficient Vector Database"

    37:31 "AI Found Me: Intern to Innovator"

    41:10 "Impact of Code Generation Models"

    45:25 Personalized Learning Path Technology

    46:57 Mathematical Complexity in Origami Design

    50:32 "Data, AI, and Origami Insights"

    Más Menos
    52 m
adbl_web_global_use_to_activate_webcro768_stickypopup