Cloud Native Compass  By  cover art

Cloud Native Compass

By: David Flanagan & Laura Santamaria
  • Summary

  • The podcast that helps you navigate your way through Kubernetes and Cloud Native trends.
    ©️ Rawkode Academy, 2023
    Show more Show less
Episodes
  • InfluxDB 3 & Rust
    Nov 8 2023

    InfluxDB 3.0 Rewrite

    InfluxDB, a time series database, underwent a major rewrite to create InfluxDB 3.0, also known as IOx. The decision to rewrite the database was driven by the need for strict control over memory management and high performance. The project started as a research endeavor and gradually gained traction within the company. The team decided to build around projects under the Apache Foundation, such as Apache Arrow and Apache Data Fusion. In April 2022, InfluxDB 3.0 was officially announced, aiming to improve performance, scalability, and cost-effectiveness for users.

    IOx Database Engine

    The new database engine, IOx, is designed to handle various types of observability and monitoring data, including metrics, traces, and logs. It aims to provide a single store for all these signals, eliminating the need for separate databases. However, querying the data efficiently is still a challenge that the team is working on. The goal is to make IOx the go-to solution for storing and querying observational data, not only for server infrastructure monitoring but also for sensor data use cases.

    Challenges and Considerations

    Working with logs, tracing, and structured events in time series databases poses challenges. The dynamic and inconsistent nature of schemas in logs and tracing use cases can make extracting structured fields difficult. Time series databases also have limitations in handling tracing front ends and require an index to map trace IDs to individual traces. While metrics, logs, and traces are the gold standard for observability, there is room for improvement in terms of usability and performance.

    Flux and Data Fusion

    Flux, a scripting language developed for InfluxDB 2.0, addresses user requests for more complex query logic and integration with third-party systems. InfluxDB 3.0 incorporates a parser in Rust to translate SQL queries into a Data Fusion query plan, benefiting from the performance optimizations of Data Fusion. However, bringing Flux to InfluxDB 3.0 proved challenging due to the large surface area of Flux and limited time and resources. Updating the Flux engine to use the 3.0 native API could potentially resolve these issues.

    InfluxDB Development and Open Source Licensing

    InfluxData is focused on improving the core query engine of InfluxDB and enhancing its capabilities and performance. They have created a separate community fork of Flux to allow collaboration on its development. Paul Dix, the co-founder, believes that true open source should be about freedom and expresses his intention to keep InfluxDB 3 as a permissively licensed project. He discusses the recent license change by HashiCorp and the growing distrust in the developer community towards VC-backed open source projects. Putting InfluxDB into a foundation may not be feasible due to the lack of multiple contributors.

    Show more Show less
    56 mins
  • Trust and Validation in AI
    Nov 1 2023

    Here are 5 key takeaways from this episode that you don't want to miss:

    1️⃣ The People Problem: Laura Santamaria raises an important concern about verifying AI-generated outputs and tackling the challenge of the "people problem" in AI development.

    2️⃣ Verifying Data Authenticity: JJ discusses the challenge of proving that a data blob originated from a specific model and how this issue is being addressed by companies like IBM through pile cleaning and legal penalties.

    3️⃣ AI Misconceptions: We debunk some common misconceptions about AI, including the belief that it is an all-knowing fact machine.

    4️⃣ Trusted AI: IBM's approach to building trusted models, with dedicated engineers responsible for cleaning and verifying data, is explained. Plus, we discover IBM's partnerships with Hugging Face to leverage the open-source ecosystem.

    5️⃣ The Impact of AI: We delve into the potential positive and negative implications of AI, and how the rapid advancement of this technology presents challenges with trust and validation.

    💡 Fun Fact: Did you know that 95% of open-source language models are trained on a data set called "the pile," which contains pirated and copyrighted material? Discover why this has implications for copyright and patent laws!

    As always, the conversation in this episode is engaging and eye-opening. JJ Asghar provides insightful perspectives and sheds light on the future of AI development. Don't miss out on the valuable information shared!

    Questions We Covered

    1. How can the problem of untrusted data in AI models be effectively addressed?
    2. Should companies like OpenAI and Microsoft be required to provide their data sets for verification purposes? Why or why not?
    3. What are the potential risks and challenges associated with using AI technology without proper regulation?
    4. Should AI creations be eligible for copyright protection? Why or why not?
    5. How can we ensure the accuracy and trustworthiness of AI-generated data, especially when it comes to extracting information from sources like PDFs?
    6. What are some potential positive impacts of AI technology, and how can we maximize its benefits while minimizing its negative implications?
    7. How can the rapid advancement of AI technology be balanced with the need for trust and validation?
    8. In what ways do copyright and patent laws need to evolve to accommodate AI technology?
    9. What are the implications of China having its own set of laws and approaches to technology that may differ from other countries?
    10. How can individuals navigate and better understand the AI space in order to make informed decisions and contributions?

    Show more Show less
    43 mins
  • The Magic of eBPF
    Aug 25 2023

    We're back with an exciting new episode of Cloud Native Compass, and this time we're diving deep into the captivating world of eBPF. Join Laura Santamaria, David Flanagan, and special guest Liz Rice as they unravel the mysteries and explore the incredible potential of this powerful technology!

    In this episode, you will learn:

    1. The two parts of eBPF: Discover the kernel program and the user space interaction that make up the magic of eBPF.
    2. Programming with eBPF: Explore the different options for writing eBPF programs, from bytecode form to higher-level languages like Rust.
    3. Compilers and SDKs: Learn which compilers, like clang GCC and the Rust compiler, support eBPF bytecode and how SDKs can make your interaction with eBPF programs easier.
    4. The Evolution of Packet Filtering: Trace the history of packet filtering, from its humble beginnings to the powerful and versatile capabilities of eBPF.
    5. The Widespread Adoption: Uncover the sudden rise in eBPF's popularity, its impact on observability and performance, and the role it plays in modern networking.
    Now, for a fascinating fun fact from the episode: Did you know that eBPF is now considered Turing complete? That's right! With its combination of features, eBPF has surpassed its humble start as a packet filtering tool and has become a full-fledged technology powerhouse.

    Creators & Guests

    • Laura Santamaria - Host
    • David Flanagan - Host
    • Liz Rice - Guest

    • (01:46) - What is eBPF?
    • (06:45) - The Rise of eBPF
    • (09:40) - Why is eBPF Interesting?
    • (16:00) - Who's using eBPF?
    • (19:20) - eBPF for Developers
    • (24:00) - Troubleshooting eBPF
    • (27:11) - Future of eBPF
    Show more Show less
    32 mins

What listeners say about Cloud Native Compass

Average customer ratings

Reviews - Please select the tabs below to change the source of reviews.