In this episode host Tyra Peirce speaks with Scott Stroz, MySQL evangelist about how Data and AI work together. ----------------------------------------------------------- Episode Transcript: 00;00;09;02 - 00;00;34;16 Welcome to the Oracle Academy Tech Chat. This podcast provides educators and students in-depth discussions with thought leaders around computer science, cloud technologies, and software design to help students on their journey to becoming industry ready technology leaders of the future. Let's get started. Welcome to Oracle Academy Tech Chat, where we discuss how Oracle Academy helps prepare our next generation's workforce. 00;00;34;18 - 00;01;01;19 I'm your host, Tyra Peirce. In this episode, I speak with Oracle, my SQL developer advocate Scott Stroz, about how AI uses data, and the database skills students need to have as they work with AI. So, Scott, you're a returning guest for me. Can you give me a little bit about your background and role at Oracle? For those who may not have listened to our previous podcasts. 00;01;01;22 - 00;01;24;27 Sure. So, first and foremost, I consider myself a full stack developer and I've been a full stack developer for longer than the term full stack developer has actually been in existence. And in that time, the only technology in my stack that has remained constant is MySQL. I used it on my first job as a web developer, and I still use it today, and I've used it pretty much every day in between. 00;01;25;00 - 00;01;47;09 A lot of people, even people in the tech industry, they frequently ask, what does it mean that you're a developer advocate? And there's a lot of people I'm friends with and, I've met in developer relations, where we each kind of have like our own elevator pitch, like a quick, you know, 1 or 2 sit ins answer to basically get people to understand what it is that we do. 00;01;47;11 - 00;02;10;12 And the one that I've come up with is my job is to help developers be better at their jobs and I accomplish, or I hope I accomplish this in, different ways by producing content for developers through blog posts or, videos or podcasts such as this. I also, speak at conferences and I do a guest lectures for colleges and schools. 00;02;10;15 - 00;02;29;26 You are such a wonderful resource. And I we work together quite a bit, and I, I think it's good because I think that a lot of times developers don't understand what they're getting into or the different ways that they can develop and change codes. And I love that. I've got an expert that we can call on, and especially about this new topic. 00;02;29;28 - 00;03;00;18 So, kind of segue into that new topic. How does a I use data to learn SQL? It's a kind of a complicated answer, but it's also probably simpler than some people might think. With the recent release of Heat Wave, which is an AI solution in, Heat Wave that's available on Oracle Cloud, I've started playing around a little bit more than I had been, but it hasn't been with like the front end where I asked, I can I chat by questions, and it would give me answers. 00;03;00;21 - 00;03;22;07 It's been more feeding data into an AI engine, which in this case was heat wave Gen I. And while I was tinkering, I came to the realization that AI is not some black magic that just can pull answers out of thin air when you ask the question. It's just all based on math, and it's not even a new kind of math. 00;03;22;07 - 00;03;41;28 The math has actually been around for a long time, and I gotta say, I was a little a little disappointed. I was kind of hoping for some black magic, but for a genuine AI solution to be able to find pertinent data. So, if you ask a question, I put a chat bot the way it needs to find the answers. 00;03;41;28 - 00;04;14;16 And the first thing we need to do is we need to make sure that we get what are called embeddings of the source data. And this is basically what we use to feed in AI. And if we want it to say get embeddings for a collection of PDFs in any solution, we can actually use heat wave AI to run a stored procedure that will fetch the documents out of a bucket, a storage bucket in Oracle Cloud, and then it breaks down each document into a series of tokens. 00;04;14;18 - 00;04;43;24 Now, in the AI world, a token is a small series of a small set of characters. It can be a single word. It can be multiple words, or sometimes it can actually just be a small part of a longer word. And then using some mathematical algorithms, these tokens are converted into what we call a vector. And a vector is nothing more than an array of hexadecimal data, hexadecimal values that represent that token. 00;04;43;27 - 00;05;11;01 And this is our embedding. And those are stored typically in a database. There are other systems. This isn't just unique. The heat wave Gemini, other Gemini systems have a very similar process for ingesting the source data. And one of the most popular uses of AI solutions is what's called a retrieval augmented ...