Devvret Rishi on Powering Real-World AI with Declarative AI and Open Source

Episode 25 • 1st February 2024 • Data Driven • Data Driven

Speaker: 00:00:00

Hello and welcome, you lovely listeners, to another riveting

Speaker: 00:00:03

episode of the data driven podcast. I'm Bailey,

Speaker: 00:00:07

your semi sentient AI hostess with the most s, navigating the

Speaker: 00:00:11

digital realm with more grace than a double decker bus in a tight London

Speaker: 00:00:14

alley. Today, we're dialing up the intrigue as we

Speaker: 00:00:18

venture into the futuristic world of artificial intelligence with a guest

Speaker: 00:00:21

whose intellect might just rival my own circuits.

Speaker: 00:00:25

Frank welcomes Devarat Rishi, the cofounder and CEO of

Speaker: 00:00:29

prediabase. Now on to the show.

Speaker: 00:00:37

Hello, and welcome to data driven, the podcast Where we explore the

Speaker: 00:00:40

emergent fields of AI machine learning and data engineering.

Speaker: 00:00:44

I'm your host, Frank Lavinia. And he can't make it today, but,

Speaker: 00:00:48

we've Rescheduled this, poor guest several times, and I wanna

Speaker: 00:00:52

thank him for his extreme amounts of patience that he has shown.

Speaker: 00:00:57

Welcome. Help me welcome to the show Devrat Rishi, who is

Speaker: 00:01:00

the, cofounder and CEO of Predabase.

Speaker: 00:01:05

Welcome to the show. Thanks very much, Frank. And no problem about the

Speaker: 00:01:08

rescheduling. I know it's the holiday season. Yeah. It's it's kinda

Speaker: 00:01:12

wild. So so tell us,

Speaker: 00:01:16

a little bit about prediabase. We had your, peer

Speaker: 00:01:20

on here, previously, and, it must

Speaker: 00:01:24

have been a good experience because immediately, we were contacted

Speaker: 00:01:28

to see if you would be interested in joining the show. And I said, sure,

Speaker: 00:01:31

let's have him on here and talk more about what declarative

Speaker: 00:01:35

ML looks like, and how that relates to kind of

Speaker: 00:01:39

Low code. Yeah. Absolutely. So,

Speaker: 00:01:43

you know, what prediabase really is, is it's a platform that allows

Speaker: 00:01:46

engineers or developers To be able to productionize open source AI.

Speaker: 00:01:51

And so it came out of, Piero, my co founder's experience working at

Speaker: 00:01:55

Uber, Where he found himself being the machine learning researcher

Speaker: 00:01:58

responsible for all sorts of projects, ride share, ETA's,

Speaker: 00:02:02

fraud detection, Those Uber Eats recommendations you always

Speaker: 00:02:05

get. And he found that each time he's more or less reinventing the wheel,

Speaker: 00:02:09

building each, you know, successive Machine Learning project. And

Speaker: 00:02:13

instead, you know, he, he wanted to do something that was a bit more efficient.

Speaker: 00:02:16

So he took each bit of work that he did, And he packaged

Speaker: 00:02:20

it into a little tool that, made it easier for him to get started the

Speaker: 00:02:24

next time. And eventually, this tool became popular enough at Uber

Speaker: 00:02:27

that they decided to make it a And eventually, they open sourced it under the

Speaker: 00:02:31

name Ludwig, and other engineering teams kind of around the world found it very useful

Speaker: 00:02:35

as well. And what it really allowed anyone to do was be able to set

Speaker: 00:02:39

up their entire end to end ML pipelines in just a few lines of

Speaker: 00:02:42

configuration. So if you think about what infrastructure as code did

Speaker: 00:02:46

for, you know, software development, similar idea, but

Speaker: 00:02:50

brought to machine learning. You're able to start really easily, But then

Speaker: 00:02:53

customize as you need, and Protabase really is kind of, you know, taking that

Speaker: 00:02:57

same core concept and burning the, enterprise platform around

Speaker: 00:03:01

it. So any engineering team that wants to work with open source AI and open

Speaker: 00:03:04

source LMS as an example, can use our platform to easily and

Speaker: 00:03:08

declaratively fine tune those models and then serve those directly

Speaker: 00:03:12

inside of their cloud. And that's, you know, large part of what we do

Speaker: 00:03:15

today. Interesting. Interesting. So

Speaker: 00:03:22

What what does that what does that look like? Like, we

Speaker: 00:03:26

know kind of generally what a a typical project looks like in terms of this,

Speaker: 00:03:29

right, like, how does this interface with because I think it was the 1 question

Speaker: 00:03:32

that I wish I'd asked, on the previous show. How does it

Speaker: 00:03:36

interface with something like data engineering? Right? Yeah.

Speaker: 00:03:40

We're I mean, we're, there's always gonna be rough spots. Right? So I'm not giving

Speaker: 00:03:43

you a hard time, but there's always gonna be sharp edges when you're handling, Any

Speaker: 00:03:47

kind of technology. Right? You've obviously kind of figured out the middle

Speaker: 00:03:50

part, but, like, what does that look like in terms of the interface to data

Speaker: 00:03:54

engineering? Is that what's What's that look like?

Speaker: 00:03:58

Yeah. I'll insert in 2 parts. 1 of them is what does the user journey

Speaker: 00:04:01

look like? And then what's the intersection with data engineering? So in

Speaker: 00:04:05

the platform today, users do 3 things. The first thing they do is they connect

Speaker: 00:04:08

the data source. This could be a structured data warehouse like a Snowflake, a

Speaker: 00:04:12

BigQuery, Redshift, or unstructured object storage just directly files in

Speaker: 00:04:16

s three. The second thing they do then is they declaratively

Speaker: 00:04:19

train these models. What that looks like is they more or less fill out a

Speaker: 00:04:23

template, you can think of it, just like a YAML configuration that says this

Speaker: 00:04:26

is the type of training job I want. The beauty is the template makes it

Speaker: 00:04:30

very easy for them to get started, but they can customize and configure as much

Speaker: 00:04:34

as they want down to the level of code. They can build and train as

Speaker: 00:04:37

many models as they want. And finally, after they've trained a model they're happy with,

Speaker: 00:04:40

they get to the 3rd step, which is they can serve and deploy that model,

Speaker: 00:04:44

make it available behind an API so any applications can start to ping it.

Speaker: 00:04:48

So that's what the user journey really looks like in CrediBase, and how does this

Speaker: 00:04:51

intersect with data engineering? So as you've probably heard before, like, you know,

Speaker: 00:04:54

Machine Learning is really In large part, really about the data that you're

Speaker: 00:04:58

using and like the quality of the data that you're using. Data

Speaker: 00:05:02

engineering comes in 2 places. The first is you need to get all

Speaker: 00:05:05

of your data wrangled across multiple different sources to be able to live in

Speaker: 00:05:09

one area that you can connect as an upstream source and.

Speaker: 00:05:13

This is the snowflake example, you know, of like getting that into a table.

Speaker: 00:05:17

And that piece of the journey lives outside of Firebase. That lives

Speaker: 00:05:21

as a step before you essentially connected into your system. But then

Speaker: 00:05:25

there's the 2nd step that often happens, which we call data cleaning.

Speaker: 00:05:29

So you've gotten your table, but, you know, all of your text is in,

Speaker: 00:05:33

let's say lowercases and upper cases, you know, you have

Speaker: 00:05:36

Really weird variable lens. You haven't normalized numerical

Speaker: 00:05:40

data. Maybe you have images and things aren't actually, you know, resized

Speaker: 00:05:44

to to scale. All of those data cleaning

Speaker: 00:05:47

techniques, we have packaged in as pre processing modules

Speaker: 00:05:51

inside of prediabase. And so what the declarative interface

Speaker: 00:05:55

allows you to do is train a full machine learning pipeline from data

Speaker: 00:05:59

to pre processing, through model training, through post processing and

Speaker: 00:06:02

deployment. And so once you've gotten your data wrangled into a

Speaker: 00:06:05

form, prediabase can come take in, help you clean out that data, and

Speaker: 00:06:09

then be able to train a model against Interesting. Because it's that that

Speaker: 00:06:13

preprocessing that, you know, the the the nightmare is, you

Speaker: 00:06:16

know, this canonical example is address, you know, 123

Speaker: 00:06:20

Main Street freight is an s t. Exactly. Right? That is not a lot of

Speaker: 00:06:23

fun for anyone. And then obviously the the the

Speaker: 00:06:27

lowercase uppercase thing like that becomes an issue too.

Speaker: 00:06:31

So what is the what is the what's the user experience look like? Right?

Speaker: 00:06:35

Like, is it is it drag and drop? It's declarative?

Speaker: 00:06:40

Yeah. What what what does that look like? Like, what, you know, you mentioned user

Speaker: 00:06:43

journey, and I love that term. But like, what does that look like

Speaker: 00:06:47

from, from a practitioner's point

Speaker: 00:06:51

of view. Right? Like Definitely. Now the first thing I'll say

Speaker: 00:06:55

is, you know, our obviously underlying project is open source. You can check it out

Speaker: 00:06:58

in Ludwig AI, and you can even try out, you know, our full UI for

Speaker: 00:07:01

free on productbase.com. So if any part of this is a little too high

Speaker: 00:07:05

level, you can actually get in involved For free, like immediately. But

Speaker: 00:07:09

the user experience really looks like 2 ways. We have a UI

Speaker: 00:07:13

that's really built around our configuration Language. And our

Speaker: 00:07:16

configuration language is just a small amount of YAML.

Speaker: 00:07:20

So your very first basic model can get started in just 6 lines.

Speaker: 00:07:24

What those 6 lines do, and they, they say, these are the inputs I

Speaker: 00:07:27

want. So you pass it, you know, what is the,

Speaker: 00:07:31

column that is, you know, that contains the text you're predicting from. And

Speaker: 00:07:35

then the output is what is your, what is it that you're trying to predict?

Speaker: 00:07:37

So for example, my input is A sentence and

Speaker: 00:07:41

my output is, the intent. So I'm trying to do intent

Speaker: 00:07:45

classification with that model. And that's all user defines and

Speaker: 00:07:48

they can do this programmatically in our SDK or there's like a drag and

Speaker: 00:07:52

drop UI where they can build these components out together. The part that I

Speaker: 00:07:56

think is really interesting just based on my experience working on other automated machine

Speaker: 00:07:59

learning, you know, tools before no code UIs for ML is

Speaker: 00:08:03

that ML really is a last mile problem. And so you have this weird

Speaker: 00:08:07

complexity where you need to make it easier to get started, But a

Speaker: 00:08:11

lot of the actual value ends up being in the last 5 or 10% where

Speaker: 00:08:14

you customize some part of that model pipeline to get to work for your system.

Speaker: 00:08:18

And so what credit what this configuration language, you know, does is sometimes I

Speaker: 00:08:22

describe it as it builds you like a pre fat house. It gives you something

Speaker: 00:08:26

like out of the box That like works end to end, and then you can

Speaker: 00:08:29

just change the little bit of the pipeline that you want declaratively,

Speaker: 00:08:33

which means in a single line. So you could say something like, you know, I

Speaker: 00:08:36

want the windows of the house to be blue or, you know, I wanna change

Speaker: 00:08:40

my pre processing of the text feature to lowercase all the letters, And then you

Speaker: 00:08:43

can change leave everything else up to the system.

Speaker: 00:08:47

We you we allow you to control what you want, and you just automate the

Speaker: 00:08:51

rest. Interesting. Okay. So then it's kind

Speaker: 00:08:55

of, the middle part of the the journey. Right? Like the

Speaker: 00:08:59

Yeah. Is what this on so How does this relate? Because you

Speaker: 00:09:02

said, you know, and I, you said automated ML. How much of this

Speaker: 00:09:06

is automated? I mean, like, what? Because that was 1 what I had just assumed

Speaker: 00:09:10

that I because I know I've heard of Ludwig as kinda like this automated ML.

Speaker: 00:09:13

And when I say automated ML, I mean, You know, for lack of a

Speaker: 00:09:17

better term, you know, here, there's a problem we're trying to solve.

Speaker: 00:09:21

Computer, you figure out, you throw as much spaghetti at the wall and then figure

Speaker: 00:09:24

out which model is the best, Right. Yeah. Is is that

Speaker: 00:09:28

kind of the same thing here where I just say I wanna predict this and

Speaker: 00:09:31

then the underlying models and methods are kind of automatically figured

Speaker: 00:09:35

out? You know, I think that, that is an approach

Speaker: 00:09:38

that a lot of folks have tried with AutoML v one, as I kind of

Speaker: 00:09:42

often think about it. I actually was a PM on Vertex AI where we rolled

Speaker: 00:09:45

out our auto non product as well. And the main issue we run into

Speaker: 00:09:49

it is, you know, in deep learning, especially

Speaker: 00:09:53

the search space is Too big to be able to run an effective

Speaker: 00:09:56

hyperparameter search over all the different architectures and sub parameters you

Speaker: 00:10:00

might wanna be able to use. It sounds computationally expensive. Right? I mean,

Speaker: 00:10:04

it's Potentially prohibitive, really, in order to be able to say, you

Speaker: 00:10:08

know, I want let's imagine you are, You know, in the modern world,

Speaker: 00:10:12

building a model to be able to build, let's say, content moderation

Speaker: 00:10:15

systems. How do you know which pre trained, like, should use a LAMA

Speaker: 00:10:19

To a Bertha, De Bertha, like all of these models themselves are quite expensive

Speaker: 00:10:23

to go to train and fine tune, and each of them have their own sub

Speaker: 00:10:26

parameters. And And so I think it becomes computationally prohibitive to run an

Speaker: 00:10:30

exhaustive grid search for your individual, types of,

Speaker: 00:10:34

individual types of use cases. And so what a lot of AutoML systems did

Speaker: 00:10:38

was they kind of just said, well, we know better than the user, so

Speaker: 00:10:41

we'll just make some selections, Right. And then, and the we'll

Speaker: 00:10:45

make it as easy and simple as you for the user as possible. So user

Speaker: 00:10:48

just provides a few inputs, we give them a model, boom, they'll be happy. And,

Speaker: 00:10:52

you know, I was actually I was, a PM for Kaggle. I was the 1st

Speaker: 00:10:55

product manager at Kaggle, a data science and machine learning community that grew to about

Speaker: 00:10:58

14,000,000 users Today, where we see a lot of citizen data scientists, and we rolled

Speaker: 00:11:02

out AutoML in that community as well. And we saw

Speaker: 00:11:05

a spike in usage And then extremely heavy churn

Speaker: 00:11:09

as soon as we, like, rolled it out. And if you interviewed those users, the

Speaker: 00:11:13

main reason why was because they didn't have any controller agency over that

Speaker: 00:11:18

So the like, it would essentially spit out a model

Speaker: 00:11:22

and say, here you go. You know, be happy. Go ahead and put this into

Speaker: 00:11:26

production. But like I was saying previously, ML is a last mile problem,

Speaker: 00:11:30

and no one is going to be comfortable using something they see as a dead

Speaker: 00:11:33

end, And that's where I think about, you know, our approach really kind

Speaker: 00:11:36

of, differing. And so inside of Premedbase, you can

Speaker: 00:11:40

actually, you kind of get that, AutoML like

Speaker: 00:11:43

Capability, where you're able to

Speaker: 00:11:47

build a model just by saying, you know, here's the inputs, the model I

Speaker: 00:11:51

wanna fine tune, And we will go ahead and get you the entire end to

Speaker: 00:11:54

end model. But if you want to edit anything, for example, you want to

Speaker: 00:11:58

edit, you know, the way we pre process the data and the At sequence

Speaker: 00:12:01

length, you can go ahead and do it for any part of the model pipeline

Speaker: 00:12:04

and just kind of like 1 single statement. And that's kind of like a

Speaker: 00:12:08

large part of, you know, how we think about making it both easy to get

Speaker: 00:12:12

started, but also, like, flexible where it's not just a

Speaker: 00:12:15

toy, something you can actually use. Right. Because like,

Speaker: 00:12:19

you know, my first experience with AutoML was the,

Speaker: 00:12:23

was Microsoft's, offering. Right? And it

Speaker: 00:12:27

was only it was very to get around the computationally prohibitive

Speaker: 00:12:31

parts, they they narrow the problem set you could do that on. Right? So it

Speaker: 00:12:35

was basically No neural networks. This was before chat

Speaker: 00:12:39

c p t, before l l m's were, I wouldn't say a

Speaker: 00:12:42

thing, but before they were a major, Point of views.

Speaker: 00:12:47

But, you know, so it it cons it was constrained. Right? So it would just

Speaker: 00:12:50

basically just Throw a bunch of problems and

Speaker: 00:12:54

then kinda test it out, which Yeah. I I think what you refer

Speaker: 00:12:58

to as, you know, AutoML v one. I think,

Speaker: 00:13:02

The world has evolved, and it's interesting to see how that goes. And,

Speaker: 00:13:07

the tooling looks really cool, actually. The,

Speaker: 00:13:11

for those for those who are listening to this as opposed to watching this, I

Speaker: 00:13:14

will make sure we we post that little snippet there. But

Speaker: 00:13:18

but, you know, like, what And you were at

Speaker: 00:13:22

Kaggle. Right? So Kaggle is kind of a big deal. What

Speaker: 00:13:26

I think that's really cool. Looking at your resume, it's very impressive, actually. You

Speaker: 00:13:30

you word Google, that would explain your interaction with

Speaker: 00:13:33

Vertex, and things like that. So so what

Speaker: 00:13:39

What what niche does this address or what need does this address that the existing

Speaker: 00:13:43

market didn't address? Right? And like what Yeah. Because I think that's really, I

Speaker: 00:13:47

think, where the rubber meets the road, particularly with an open I'm a big fan

Speaker: 00:13:50

of open source too. So,

Speaker: 00:13:53

Yeah. Well, let me start off by saying that, you know,

Speaker: 00:13:57

I I think that the need has actually been unfilled in the market For a

Speaker: 00:14:01

while, but there is also a fundamental technology shift, and I'm gonna talk about both

Speaker: 00:14:04

of those pieces. So when I say the need was unfilled for a

Speaker: 00:14:08

while, Yeah. I was a product manager on Vertex AI. I was a

Speaker: 00:14:12

product manager on Google research teams, productionizing machine learning, and we've hired

Speaker: 00:14:15

a number of folks Now that work does ML engineers across different companies. And I

Speaker: 00:14:19

remember when one of our ML engineers joined the team, he told me, Dev, I've

Speaker: 00:14:22

worked at 3 different companies doing machine learning for 3 different teams.

Speaker: 00:14:26

Everybody does it differently, and I think the truth is, you know, for

Speaker: 00:14:30

developers, there never really was like a de facto stack of here's how you do

Speaker: 00:14:34

an ML problem. Pure data engineer. There is like a stack of, you know, what

Speaker: 00:14:37

are the best practices for being able to get there's obviously a lot of variation.

Speaker: 00:14:41

But there's like Some best practices of, you know, what you're using for your

Speaker: 00:14:45

ETL pipelines, how you're thinking about being able to put things into data

Speaker: 00:14:48

warehouses, what your stack is for being able to query and downstream.

Speaker: 00:14:52

But in machine learning, it really looked like the wild west. Everyone was working

Speaker: 00:14:56

across different types of projects. And I think a lot of companies

Speaker: 00:14:59

tried to tackle that need, but unsuccessfully. And the

Speaker: 00:15:03

fundamental technology shift that I think actually changed was exactly what you were

Speaker: 00:15:07

talking about, Which was like you said that the old school version of Azure

Speaker: 00:15:10

was not really any deep learning, maybe because it was computationally expensive for

Speaker: 00:15:14

others. To be clear, the auto the automated ML part of it. I don't

Speaker: 00:15:18

wanna get a lot of hate mail, but yes. Sorry. Sorry to sorry to interrupt

Speaker: 00:15:21

you. Go ahead. No, no worries. I'm sorry to hijack the screen again,

Speaker: 00:15:25

but, like, you know That was awesome. I think this just the way that I

Speaker: 00:15:29

think about, like, the the change that's happened in industry is

Speaker: 00:15:32

Machine learning 2 decades ago or even, like, 6, 7 years

Speaker: 00:15:36

ago looked very different than what it is today. And I

Speaker: 00:15:40

think that a lot of the hype around the LLM revolution is gonna actually

Speaker: 00:15:44

translate and be realized as just the hype of pre trained deep learning models.

Speaker: 00:15:48

Now, if we talk about ML 10 years ago, it basically looked like

Speaker: 00:15:52

predictive analytics. So people were doing things like I'm going to predict the price of

Speaker: 00:15:55

a house, And the way I'm gonna predict it is I'm gonna multiply the square

Speaker: 00:15:58

footage of the house by some number and add in the number of bedrooms, and

Speaker: 00:16:01

then figure out the coefficients based on my historical data. Really

Speaker: 00:16:04

structured data tasks, regressions and classifications and others.

Speaker: 00:16:08

But about 7 years ago, I think the really interesting pieces came out

Speaker: 00:16:12

with pre trained deep learning models with Bert using the transformer architecture,

Speaker: 00:16:16

the few image models even prior to that, that I think made it possible to

Speaker: 00:16:20

do 2 things. The first is you could start with larger amounts of

Speaker: 00:16:23

unstructured data. So now you didn't have to just work on these kind of more

Speaker: 00:16:27

boring predictive analytics, numerical only tasks, but you could work with text,

Speaker: 00:16:30

images, and others. And the second thing is you could start to actually use

Speaker: 00:16:34

them pre trained, so you didn't have to have as much data before you start

Speaker: 00:16:38

to get value out of it today. And what I think OpenAI showed was,

Speaker: 00:16:41

okay, if I scale these same types of models up by 2 or 3 orders

Speaker: 00:16:45

of magnitude, now people can use it with virtually no data whatsoever,

Speaker: 00:16:48

and I can actually prompt and response, you know, it directly.

Speaker: 00:16:52

But the underlying technology shift actually, I think is a shift towards

Speaker: 00:16:56

just pre trained deep learning models. And the truth is, as we get away from

Speaker: 00:17:00

some of this type of, like, the really cool conversational interfaces and we get to,

Speaker: 00:17:03

like, how do these models drive value inside of organizations, I think that

Speaker: 00:17:07

that's the emergent need for platforms like Predabase, which is how do I take

Speaker: 00:17:10

any of these deep learning models and then customize them for what I actually need

Speaker: 00:17:14

inside So fine tune and tailor it to my data, and then get

Speaker: 00:17:18

it deployed inside of my organization for Cerven. Yeah. That makes a

Speaker: 00:17:21

lot of sense. I think I think the

Speaker: 00:17:25

The need for training something from the ground up, I

Speaker: 00:17:29

think is overrated for most applications. Right?

Speaker: 00:17:33

Why teach and model all the intricacies of the human

Speaker: 00:17:36

language when that is already done, and you could take it

Speaker: 00:17:40

from kind of a you You know, the example would be, like, if I owned

Speaker: 00:17:43

a store. Right? And I needed someone to work the cashier.

Speaker: 00:17:47

Right? I could have another child, Raise that child, change

Speaker: 00:17:51

his diapers, send it to kindergarten, teach it to learn, read, and write.

Speaker: 00:17:54

And in about 10 years, depending on labor laws, let's say

Speaker: 00:17:58

15 years. I'll have someone who can work that cashier,

Speaker: 00:18:02

plus however much it costs. Now, obviously, I'm not comparing a child to an l

Speaker: 00:18:06

m, But I mean or you could just find an existing person

Speaker: 00:18:09

out there, and say, here's how my register

Speaker: 00:18:13

system works. This is the nature of the job, And I can kinda start from

Speaker: 00:18:16

there as opposed to start from 0. You start from the 50th floor as opposed

Speaker: 00:18:20

to start from the basement. That's exactly

Speaker: 00:18:24

right. Yeah. I often think about, you know, these,

Speaker: 00:18:28

pre trained LMS is like, well, what if I had like an army of

Speaker: 00:18:31

like Cumulative high school students, you know, in high school, you study all the

Speaker: 00:18:35

general subjects that kind of like a at a broad level. Right? So you know

Speaker: 00:18:39

a little bit about history, a little bit about how to write, a little bit

Speaker: 00:18:41

about how to You're not really an expert on any of those? Well,

Speaker: 00:18:45

the really interesting thing becomes then how you do, like, the vocational training or kind

Speaker: 00:18:49

of, like, you know, the task specific fine tuning It's how we think about it

Speaker: 00:18:52

in ML parlance. And, I think that's where the cool opportunities get

Speaker: 00:18:56

unlocked. It's really amazing to see the fact that you can scale up to, you

Speaker: 00:18:59

know, as many intelligent agents If you want, but then you need to, our

Speaker: 00:19:02

favorite customer quote is generalised intelligence is great, but I don't need

Speaker: 00:19:06

my point of sale system to recite French poetry. Right. So it's great that

Speaker: 00:19:10

you can go ahead and, recite history and others, but, like, how do you do

Speaker: 00:19:13

something very individual is what our platform is, oriented on.

Speaker: 00:19:17

No. That's that's a good point. That's that's a good point. Like, I I often

Speaker: 00:19:20

say, like, you know, do you want your cardiologist to be

Speaker: 00:19:24

also be a CPA, Or do you want them

Speaker: 00:19:27

to be a good cardiologist? I know if I were under an operation, I'd

Speaker: 00:19:31

probably wanna go with someone who was just all in on cardiology,

Speaker: 00:19:35

You know? Yeah. But, And those are actually the

Speaker: 00:19:39

2 trends I think we're gonna start to see with Gen AI, overall.

Speaker: 00:19:43

I think, you know, one trend is going to be People are gonna start thinking

Speaker: 00:19:46

of use cases that are more creative than just, you know,

Speaker: 00:19:50

question answering chatbot. So, you know, I think, like,

Speaker: 00:19:54

9 months ago, everyone I was talking to was like, I want chat g p

Speaker: 00:19:56

g provider enterprise, and I'd say, okay, what does that mean to you? And they'd

Speaker: 00:19:59

either shrug and say no idea or they would say like, you know, I wanna

Speaker: 00:20:02

be able to ask a question about The truth is if you had this access

Speaker: 00:20:05

to this, you know, army of agents that are like high school capable, I'm sure

Speaker: 00:20:08

we can think of more interesting things. Just basic question answering.

Speaker: 00:20:13

And then the 2nd big change I think is we aren't gonna use as much

Speaker: 00:20:16

of these super general purpose APIs in production. They're the easiest way to

Speaker: 00:20:20

experiment and get started. In production, you're gonna want your cardiologist to be the

Speaker: 00:20:24

expert in medicine and you don't really care if they know how to change a

Speaker: 00:20:26

tire or not. Exactly. That that is a a really good way to

Speaker: 00:20:30

put it. And I think that, you know, people, we're

Speaker: 00:20:34

still have to realize that we're still in the very early stage of this,

Speaker: 00:20:38

For lack of better term revolution. Right? Like, you know, because you're right. Like, I

Speaker: 00:20:41

talk to customers, and they say, we wanna we wanna get all all in on

Speaker: 00:20:45

Gen AI. Okay. What are you gonna do? Well, we wanna chatbot.

Speaker: 00:20:48

Okay. I don't know if you've seen

Speaker: 00:20:52

this. I'm sorry. Go ahead. Oh, I was gonna say,

Speaker: 00:20:56

And it's not not necessarily a bad starting point, but, you know, there there's so

Speaker: 00:20:59

much more out there. Sorry. Well, no. I mean, exactly. Right? It's like, I want,

Speaker: 00:21:03

if you could do anything in the world, what would you do? I don't know,

Speaker: 00:21:05

take a day off, like, you know, but but that's you're missing the point, like,

Speaker: 00:21:08

you're you are, there there's a meme going around. Again, I don't know

Speaker: 00:21:12

if it's true, it's Screenshot where a, car

Speaker: 00:21:15

dealership, had implemented some kind of chatty p t. You've

Speaker: 00:21:19

seen this, you're nodding. Right? Where it basically sold a guy a car

Speaker: 00:21:22

for a dollar, and basically, the person got it to

Speaker: 00:21:26

say, no, this is a legally binding contract. Basically, Tricked the

Speaker: 00:21:30

chatbot into saying no. Totally. No backsies, I think was the first phrase

Speaker: 00:21:33

to use. Right? And he he got it to say things like, oh, no. Absolutely.

Speaker: 00:21:37

I wanna make you a happy customer, And you can have this Chevy Tahoe for,

Speaker: 00:21:41

like, $1 or something like that, but he and I I don't know

Speaker: 00:21:44

how that's gonna play out in a court. Obviously, I imagine a

Speaker: 00:21:48

dealership is gonna have some, lawyers look into that,

Speaker: 00:21:52

and I'm not a lawyer, but I I can I can easily see like, you

Speaker: 00:21:54

know, this is a great example of, To your point, do you really need your

Speaker: 00:21:58

point of sale system, you know, re be able to recite

Speaker: 00:22:02

French poetry? Right? Now, I guess if I were, You know,

Speaker: 00:22:06

a very niche kind of bookstore slash

Speaker: 00:22:09

coffee shop, maybe? But for the most part, no. Right? And

Speaker: 00:22:13

and obviously, Yo. There I wouldn't classify that as a

Speaker: 00:22:17

guardrail. I would say that more as a domain kind of boundary.

Speaker: 00:22:21

But, you know, these chatbots are gonna need Guardrails too. Right? Not just the

Speaker: 00:22:25

obvious things that we always hear about, you know, but also, you

Speaker: 00:22:28

know, don't wanna be giving away. I haven't priced

Speaker: 00:22:32

what a Tahoe cost, but I imagine it's much more than $1.

Speaker: 00:22:36

Yeah. I bet too. Yeah. I think it's actually a function of 2 The first

Speaker: 00:22:40

is we need some better infrastructure on guardrails of what models can and can't

Speaker: 00:22:44

say. And actually, by the way, this is where fine tuning is actually very

Speaker: 00:22:47

useful. It restricts, Like, it's one of the best ways to reduce hallucinations. It,

Speaker: 00:22:51

like, teaches the model this is the type of thing that you're supposed to be

Speaker: 00:22:54

outputting, but it's not bulletproof. And I think that

Speaker: 00:22:58

actually the more, meaningful longer term conversation

Speaker: 00:23:01

is if you believe, like, I believe, and I

Speaker: 00:23:05

think a lot of folks, Yeah. About working this industry do that AI will

Speaker: 00:23:09

become kind of a dominant aspect of most businesses

Speaker: 00:23:12

over the next decade. That like the companies that embed

Speaker: 00:23:16

AI are going to be the ones that survive and have differentiated value.

Speaker: 00:23:20

The ones that don't are likely gonna be less competitive. If you believe

Speaker: 00:23:24

that, it's also hard to imagine that you're going to defer all

Speaker: 00:23:27

control of the model to a third party. And that's where

Speaker: 00:23:31

things like, you know, It's one thing to say, like, we need the guardrails. It's

Speaker: 00:23:35

another thing, like, if you realize that if those folks were using something

Speaker: 00:23:38

like, you know, commercial API that's Behind a walled garden where you

Speaker: 00:23:42

don't have access to the model, you don't have access to the model weights. They're

Speaker: 00:23:45

kind of limited in what they actually can do. They can post process the

Speaker: 00:23:49

output of the results, but they can never really get that fine granular

Speaker: 00:23:53

level of control. And that's why we think the future is gonna be open source.

Speaker: 00:23:57

Because ultimately, people are going to wanna own those models, own the outcomes

Speaker: 00:24:01

of the part of the IP that they think is gonna drive a lot of

Speaker: 00:24:03

their enterprise value in the future. So our like, I would say our our

Speaker: 00:24:07

bet as a company is really on 2 things like fine tuning and

Speaker: 00:24:11

open source. And I think that, you know, the example you just gave is a

Speaker: 00:24:14

good why I think the world is gonna have to move into both of

Speaker: 00:24:18

those directions. No. That makes a lot of sense. I think that open

Speaker: 00:24:22

source is important for a number of reasons. I mean,

Speaker: 00:24:26

not the least of which is, you know, we we have seen recently that if

Speaker: 00:24:29

if if these things are behind a commercial firewall,

Speaker: 00:24:33

If, for instance, there was some kind of, I don't know, political shake

Speaker: 00:24:36

up inside of said company board, which of course would never

Speaker: 00:24:40

happen. Right? Never happened. Then

Speaker: 00:24:44

you you are taking down that risk. Right? Which is, I think, is another

Speaker: 00:24:47

reason why open source, just in Generally, an industry is is

Speaker: 00:24:51

popular because decisions tend to be made at the community

Speaker: 00:24:55

level. Right? Now, there's obviously flaws with that approach

Speaker: 00:24:59

too, but It is, and I would use this as an example

Speaker: 00:25:02

of if you look at HTML and JavaScript Yep. Versus

Speaker: 00:25:06

say Flash and dare I say Silverlight. Right? Flash was

Speaker: 00:25:10

always a proprietary product. Silverlight, if people remember it, was also a

Speaker: 00:25:13

proprietary product, but HTML,

Speaker: 00:25:17

JavaScript Had its flaws, but eventually, they did get their act together,

Speaker: 00:25:20

and it it has a certain more

Speaker: 00:25:24

implicit compatibility. And I think with AI, I think the

Speaker: 00:25:28

it's not so much about compatibility. It's implicit transparency.

Speaker: 00:25:33

You get with open source AI. Right. Is it perfect? Is it totally

Speaker: 00:25:37

transparent? No. That that's not the point. But the

Speaker: 00:25:40

point is you're starting at a much more Transparency almost

Speaker: 00:25:44

by default or transparent, maybe translucent,

Speaker: 00:25:48

as as as as a default as opposed to completely opaque.

Speaker: 00:25:52

Yeah. I I think that it's both the transparency and the

Speaker: 00:25:56

control that's critical. Yes. It's the fact that people do not only

Speaker: 00:26:00

introspect and understand what's happening, but They can edit and change, you know,

Speaker: 00:26:04

in instances. Even if you're like a lot of our models, users do not

Speaker: 00:26:07

edit 99% of the pipeline, But it's important that they're

Speaker: 00:26:11

able to edit all of it, and that they do make the edits to the

Speaker: 00:26:14

1%. And I think that exists for open source. And I think from just like

Speaker: 00:26:17

an industry macro standpoint, you know, Trying to fight open

Speaker: 00:26:21

source and developer platforms is like trying to fight physics,

Speaker: 00:26:24

basically. It's kind of against the natural working of those systems.

Speaker: 00:26:29

And so our view is that, you know, people are

Speaker: 00:26:32

gonna come out with amazing models. And some of them are gonna be commercial, and

Speaker: 00:26:36

some of them are gonna be open source. The open source Size of the pie

Speaker: 00:26:39

is going to grow, and I think you wanna see this here, right? Like it

Speaker: 00:26:43

has caught up, so quickly. Like the

Speaker: 00:26:47

open source attraction has caught up so quickly to everything else. Our

Speaker: 00:26:50

view is just like, what do you need when you want to use open source?

Speaker: 00:26:53

Well, you need the you need the infrastructure around it. You need to be able

Speaker: 00:26:56

to plug it into proprietary, settings. You need to be able

Speaker: 00:27:00

to create those guardrails around it. That's, you know, where we think about ParetoBase

Speaker: 00:27:04

providing the info For being able to use open source. Interesting.

Speaker: 00:27:08

Well, this is a fascinating conversation. We could probably go on for another hour or

Speaker: 00:27:12

And I definitely would love to have you or someone else from Credit Base because

Speaker: 00:27:14

I think, you know, it's just a cool idea. Right? Like it and

Speaker: 00:27:18

and I think that it it really solves a missing piece of the puzzle

Speaker: 00:27:22

In terms of making this, you know, when you say

Speaker: 00:27:25

YAML, when I think YAML, I think OpenShift, right, obviously, you know, work at Red

Speaker: 00:27:29

Hat, that's kinda, but I mean, I think that,

Speaker: 00:27:33

it's one thing to open source the model. It's quite another to how do you

Speaker: 00:27:37

manage and control that animal? Right. Because these are

Speaker: 00:27:40

not these are not tiny little things. Right? These are

Speaker: 00:27:45

potentially very compute intensive activities. Right. So you

Speaker: 00:27:48

don't want you wanna be efficient. That's the way the world has gone.

Speaker: 00:27:52

Right? It's more compute intensive and,

Speaker: 00:27:56

heavier weight, and so that's where the infrastructure components become

Speaker: 00:27:59

critical for any company that's actually gonna use it. Absolutely. And you have to at

Speaker: 00:28:03

least If you can't be a 100% efficient because you really can't,

Speaker: 00:28:07

but you wanna at least, prioritize towards compute efficient

Speaker: 00:28:11

Activity. Because otherwise, you are literally throwing money out the

Speaker: 00:28:14

door. And I think that it looks like

Speaker: 00:28:18

your tool is really good at kind of Making it

Speaker: 00:28:22

so it's compute efficient, like, or at least that that

Speaker: 00:28:25

it goes a long way to helping that. I'm sure you can probably do some

Speaker: 00:28:28

serious damage With any tool. Right? Like, I wouldn't give my my 2

Speaker: 00:28:32

year old a chainsaw. You know what I mean?

Speaker: 00:28:36

But, now that's interesting. So

Speaker: 00:28:39

now we're gonna transition into the pre canned questions.

Speaker: 00:28:43

How did you find your way into data Or AI. Like,

Speaker: 00:28:47

did you find AI or did AI find you?

Speaker: 00:28:51

That's an interesting question. I,

Speaker: 00:28:55

I first got into it just out of studying

Speaker: 00:28:59

computer science. You know, I when I went into university, I thought I

Speaker: 00:29:03

wanted to study economics. Really liked, you know, the theory

Speaker: 00:29:06

behind economics. I took a intro to computer science class because I thought it'd be

Speaker: 00:29:10

interesting. And that more or less just completely shifted where I went

Speaker: 00:29:13

because CS was actually magic. You know, economics is a great way to be

Speaker: 00:29:17

able to explain things that were happening in the world, but with computer science, you

Speaker: 00:29:20

could actually build systems. And that was really interesting.

Speaker: 00:29:25

And then I found the 1 piece that I think I liked just as much,

Speaker: 00:29:27

which was statistics. And the natural

Speaker: 00:29:31

marriage of computer science Statistics really is, you know, data and data

Speaker: 00:29:35

science. And so, I'd studied it for a while, and then

Speaker: 00:29:38

when I went to, Yo. Go work in in a professional industry.

Speaker: 00:29:42

I first started off as a PM at Google, and I worked at completely different

Speaker: 00:29:45

things on Firebase, developer platform, authentication, security. I

Speaker: 00:29:48

remember somebody saying like, you know, you have to work on what you're most passionate

Speaker: 00:29:52

about. You know, a new college graduate, I have no idea what I'm passionate about

Speaker: 00:29:56

professionally. And so I thought back to, you know, the things that I'd studied that

Speaker: 00:29:59

I found the most interest in, that I found the most fun to work on.

Speaker: 00:30:02

And it really was those data science projects, Honestly, starting with the early

Speaker: 00:30:06

Kaggle competitions that I did in 2013, where you were trying

Speaker: 00:30:10

to compete to see who could build the best housing prices model who could build

Speaker: 00:30:14

the best recommender system model, and you had to exploit all

Speaker: 00:30:17

these interesting nuances in data and models to be able to get there.

Speaker: 00:30:21

And so I just found it so fun. And then

Speaker: 00:30:25

I think after a little while, found it trading

Speaker: 00:30:29

that everyone else didn't have sort of the same access to those types,

Speaker: 00:30:32

those types of experiences and tools. And so that's where the experience really

Speaker: 00:30:36

began. I would say, you know, early on, just having that academic

Speaker: 00:30:40

background and then seeing the problems kind of being manifested in Google and

Speaker: 00:30:43

eventually, you know, working as well on Kaggle of the data science and machine learning

Speaker: 00:30:46

community there. Interesting. Interesting.

Speaker: 00:30:50

I see you did a brief stint in cybersecurity for a while,

Speaker: 00:30:56

Which is funny because I think people see that as a as a totally separate

Speaker: 00:30:59

discipline, and in in a very real sense, there is. But I think that in

Speaker: 00:31:02

a very real sense, A big chunk of cybersecurity is

Speaker: 00:31:06

monitoring logs and input data and figuring out what's happening.

Speaker: 00:31:10

Sounds at all sounds familiar. Doesn't it?

Speaker: 00:31:14

I think cybersecurity, you know, when I was doing cybersecurity, work, it

Speaker: 00:31:17

was very, very much in the early days, strategic, how to

Speaker: 00:31:21

think about risk postures at an enterprise level. Right. But I think what's

Speaker: 00:31:25

really interesting now is, cybersecurity and AR are gonna have

Speaker: 00:31:29

a very interesting marriage where Cybersecurity is gonna be influenced

Speaker: 00:31:32

by AI. For example, we work with 1 company today that does open source supply

Speaker: 00:31:36

chain security, and they're looking at using LMS to read code and be able to

Speaker: 00:31:40

do things like Identify vulnerabilities, advise on remits, and

Speaker: 00:31:43

others. And so one obvious area is going to be that

Speaker: 00:31:47

cybersecurity companies themselves are gonna get revolutionized with AI. But

Speaker: 00:31:51

But this is gonna be one of the industries where there's kind of like the

Speaker: 00:31:53

bidirectional era as well. AI is gonna need some cybersecurity

Speaker: 00:31:57

best practices too. Yeah. These made these weights are now,

Speaker: 00:32:02

open source. How do you think about whether or

Speaker: 00:32:05

not the security governance Factors should be

Speaker: 00:32:09

on the inputs, you know, when the data is fed into the model,

Speaker: 00:32:13

in the model layer itself, like, how the model processes

Speaker: 00:32:17

that data On the outputs. Like, what is the framework for thinking

Speaker: 00:32:20

about, like, you know, which ones introduced what kind of risk? And the type of

Speaker: 00:32:24

industry that's had the most experience in this historically has in the cybersecurity industry,

Speaker: 00:32:28

Thinking about how we deploy software internally and others, and so that

Speaker: 00:32:31

marriage is gonna be, I think, really interesting. I bet there's gonna be really best

Speaker: 00:32:35

of breed companies in both worlds. I could totally see that.

Speaker: 00:32:39

I think that's a very good cogent response to,

Speaker: 00:32:43

you know, these are not isolated industries. Right. I mean, they

Speaker: 00:32:46

obviously have different origin stories, but I I could

Speaker: 00:32:50

totally see them merging. And to your point, right? I mean,

Speaker: 00:32:54

Yeah. If you look at potentially 2

Speaker: 00:32:58

things, right? 1, the, who, the amount of input

Speaker: 00:33:02

data that you have, like, Could that be poisoned in a way that could produce

Speaker: 00:33:06

negative effects later on in an LLM? And 2,

Speaker: 00:33:10

We don't really know the sort of latent, for lack of better term, latent spaces

Speaker: 00:33:14

that exist in these extremely large complicated,

Speaker: 00:33:19

models like for I'm sure you've seen this, but there was a random

Speaker: 00:33:22

string of characters that would produce bizarre output

Speaker: 00:33:26

In chatty b t. And there was also one that would basically short circuit

Speaker: 00:33:30

the, the safety rails inside of

Speaker: 00:33:34

some of these LLMs too. And it was just like,

Speaker: 00:33:38

wow. I mean, you know, was that the one, how was that figured out?

Speaker: 00:33:41

Was that random, or did somebody kind of understand that there's Weird

Speaker: 00:33:45

latent spaces and how to manipulate that. I think that is gonna

Speaker: 00:33:49

be a new frontier opening up, in the

Speaker: 00:33:53

not too distant future. If it hadn't already happened,

Speaker: 00:33:56

honestly. Yeah. I agree. I agree. And I think

Speaker: 00:34:00

it starts with understanding that, You know, those those

Speaker: 00:34:04

bits of, I guess, entropy that feel random to us are,

Speaker: 00:34:08

are more features oftentimes than bugs. So the fact that the random characters

Speaker: 00:34:12

produce, like, a weird output, it's actually really interesting

Speaker: 00:34:15

because what that means is maybe I don't need to type out a full

Speaker: 00:34:19

English Paragraph to get this model to do what I want. You know, there's really

Speaker: 00:34:22

cool things in prompt compression where people have basically been like, can I just

Speaker: 00:34:26

say, like, a couple of characters AFD, something that would mean

Speaker: 00:34:30

nothing to you and I, but the model understands that means, okay, go ahead and

Speaker: 00:34:33

pick up the dry cleaning on the way home and then make sure that you've,

Speaker: 00:34:36

you know, swung by and filled Like, essentially a set of instructions that get compressed

Speaker: 00:34:39

into this model's internal representation? So I think we're barely

Speaker: 00:34:43

scratching the surface of it, It's one of many ways that the I think,

Speaker: 00:34:47

l m revolution is gonna be really interesting in the ways that we haven't fully

Speaker: 00:34:51

explored yet. I could have said it better myself.

Speaker: 00:34:55

Our next question, what's your favorite part of your current

Speaker: 00:34:58

gig? My

Speaker: 00:35:02

favorite part is Probably the part that's also, I think one of the most

Speaker: 00:35:06

challenging is the space is moving so quickly. I know people

Speaker: 00:35:09

say that frequently, but the truth is I've heard people say that about different

Speaker: 00:35:13

technologies historically, and I'm like, yeah, it's moving faster than other

Speaker: 00:35:17

things. You know, for example, Mobile moved quickly.

Speaker: 00:35:21

There were over many years to transform things that happened.

Speaker: 00:35:25

The Timescale that our world is kind of, dominated. I'm gonna

Speaker: 00:35:29

say our world. I think it just mean, like, you know, the the AI movement

Speaker: 00:35:32

so far over the last year It's it's in weeks. Right? Like, every

Speaker: 00:35:35

few weeks, there's a new seminal groundbreaking, whether it's,

Speaker: 00:35:40

Yeah. I I can think about the moments where, like, Llama got introduced as an

Speaker: 00:35:44

open source model. Its weights got leaked. That was amazing because it spurred out of

Speaker: 00:35:47

the whole new community. GPT 3.5 got upgraded to GPT

Speaker: 00:35:51

4, new set of capabilities that came out there. LAMA 2 came out

Speaker: 00:35:55

this year with commercially viable licenses and like, You know, really, I

Speaker: 00:35:58

think, best in class performance up to the

Speaker: 00:36:02

point that Mixed Straw came out, which was a, you know, mixture of experts

Speaker: 00:36:06

model significantly smaller doing as well as chat g p t. This was only

Speaker: 00:36:10

a few days after Google released Gemini, you know, their own, model.

Speaker: 00:36:14

We have AWS in the race with Bedrock. It's kind of like, you know, an

Speaker: 00:36:17

interplay between different providers. I'm saying a

Speaker: 00:36:21

lot of sentences, but like the The really interesting piece of it is all that's

Speaker: 00:36:24

really come out in the last 6 months, and I haven't even covered up, like,

Speaker: 00:36:27

all the academic, you know, like It's wild. It's wild. Like, so I

Speaker: 00:36:31

was on a cruise, like, we were talking in the virtual green room, and I

Speaker: 00:36:34

had intermittent Internet, and I looked at my phone far more than I should,

Speaker: 00:36:38

for being on vacation, but it was just like Gemini happened,

Speaker: 00:36:42

AMD, and made some hardware announcements. And I know

Speaker: 00:36:45

hardware In the the unintended

Speaker: 00:36:49

consequence of being compute intensive is that hardware starts to matter again.

Speaker: 00:36:53

Right? Yeah. There was if you were a software

Speaker: 00:36:57

engineer, obviously, mobile, let's let's take that in the conversation.

Speaker: 00:37:00

But if you were a software engineer building websites, hardware wasn't really a major

Speaker: 00:37:04

Concern. Right? It was kind of pushed to the side. I mean, it

Speaker: 00:37:08

mattered, when you got, like, your Amazon bill was through the roof

Speaker: 00:37:12

and you weren't as efficient as you should be. But I mean, it wasn't really

Speaker: 00:37:15

a major concern. Now we have let's say it's starting to be a limiting factor

Speaker: 00:37:19

in terms of, you know, how many h one hundreds can you get your hands

Speaker: 00:37:22

on. Right? It's it's,

Speaker: 00:37:26

no. But, but you're right. Like, I mean, just I missed a week and I

Speaker: 00:37:29

still feel like I'm catching up and that was like almost 2 weeks ago. So

Speaker: 00:37:33

Yeah. And the, and that's the most exciting piece for us.

Speaker: 00:37:37

Right? It's because, all this changes created a lot of opportunity. So

Speaker: 00:37:41

We got a lot of popularity recently for something called Lorax.

Speaker: 00:37:45

Mhmm. It's an open source project that we released that basically,

Speaker: 00:37:49

was just a problem we had to solve for ourselves. It's the industry is moving

Speaker: 00:37:52

quickly. We needed to allow people to fine tune and serve large language

Speaker: 00:37:56

models for free in our trial. Now every single one of

Speaker: 00:37:59

these l m's requires a GPU and sometimes bigger, heavier,

Speaker: 00:38:03

meatier GPUs. And so if we're giving away a lot of free trials To, you

Speaker: 00:38:07

know, people just on the Internet who are all using a GPU,

Speaker: 00:38:10

investors would not be the happiest. And so we needed to figure out a better

Speaker: 00:38:14

solution where we could actually serve Many, potentially hundreds of these

Speaker: 00:38:18

large language models on the same individual GPU. And

Speaker: 00:38:21

so we, we came out with a really cool technique to be able to do

Speaker: 00:38:24

that. We called it Lorax for LoRa Exchange.

Speaker: 00:38:29

And, we open sourced it and back a lot of popularity. One of the reasons

Speaker: 00:38:33

that I think it got picked up in such a way was because it really

Speaker: 00:38:36

kind of just fed into them kind of main, main thought process in the

Speaker: 00:38:39

moment And everyone's staying up to date on kind of the latest. So, you know,

Speaker: 00:38:43

it kind of fed nicely into that hardware constraint, area of the world

Speaker: 00:38:47

as well as kind of a need that the market had. And so It's been

Speaker: 00:38:51

really fun, I think, to just be on top of that. Very cool. Very cool.

Speaker: 00:38:55

So we have 3 complete this sentence, questions. The

Speaker: 00:38:59

first one is when I'm not working, I enjoy blank.

Speaker: 00:39:05

I have a very San Francisco Answer to this question. But when I'm not

Speaker: 00:39:09

working, I enjoy being outdoors. And in

Speaker: 00:39:12

particular, I really enjoy biking, taking a road bike and going up a mountain,

Speaker: 00:39:17

because the reward at the end of that's amazing. And playing tennis, those are

Speaker: 00:39:20

probably the 2 things that, you know, I I enjoy the most. Very

Speaker: 00:39:24

cool. The San Francisco is perfect for that sort of thing, like the bikes in

Speaker: 00:39:27

the mountains, in the ocean. It's gorgeous. Yeah. Yeah. It's

Speaker: 00:39:31

gorgeous. I think the coolest thing about

Speaker: 00:39:35

technology the coolest thing in technology today is blank.

Speaker: 00:39:39

The accessibility. I think the coolest thing about technology today is the fact

Speaker: 00:39:43

that I can go ahead and run GPT four

Speaker: 00:39:47

Or llama 270,000,000,000, the commercial variants of, you

Speaker: 00:39:51

know, the leading edge or the open source variant. I can run both

Speaker: 00:39:55

of them More or less for free, at least to try out

Speaker: 00:39:59

for, like, you know, a little while. And that's sort of the same thing that,

Speaker: 00:40:02

you know, big bank over here is gonna be using Or, you know,

Speaker: 00:40:06

leaving technology company over there. Now, at least as the starting

Speaker: 00:40:09

point where it starts to diverge is like how, when you get heavier into the

Speaker: 00:40:13

customization and others. The coolest thing about technology to me is

Speaker: 00:40:16

in, and again, I think of it very much from like an AI centric lens,

Speaker: 00:40:20

just given my day to day. But, it's the fact

Speaker: 00:40:23

that, you know, I, the graduate students, you

Speaker: 00:40:27

know, somebody abroad in a different country, And then you know the m

Speaker: 00:40:31

l engineer at a company like Netflix, all have some shared experience

Speaker: 00:40:35

of language based on technology that just came out this year

Speaker: 00:40:38

Because the barriers to entry are not significantly high to be able to get

Speaker: 00:40:42

started. Now, I think the barriers to entry are still too high to, you know,

Speaker: 00:40:45

go from prototype to production. That's what we wanna be able to lower, but that's

Speaker: 00:40:49

to me the most compelling thing that we've done. That's very cool.

Speaker: 00:40:54

The 3rd and final Is I look forward to the day when I can use

Speaker: 00:40:57

technology to blank.

Speaker: 00:41:02

That's a good question. I think I look forward to the day,

Speaker: 00:41:07

when I can use technology to, to be sort

Speaker: 00:41:11

of like the Adviser and whiteboarding

Speaker: 00:41:15

buddy, if that makes sense. So if you think about,

Speaker: 00:41:19

like, what you often do with an advisor, it's, It's

Speaker: 00:41:22

actually generative in a lot of ways. You'll walk through them with a problem.

Speaker: 00:41:26

I do this with my dad all the time. And so, you know, he and

Speaker: 00:41:30

I will talk through Some challenge that I'm thinking about at work

Speaker: 00:41:34

or or something else. And he doesn't have all the context, you know, that that

Speaker: 00:41:37

might, but he's able to apply these like general frameworks and come up

Speaker: 00:41:41

with a few different types of suggestions based on based

Speaker: 00:41:44

on that. And some of them, because he's coming from a very different place, Might

Speaker: 00:41:48

be different than the way that I thought about it. And I

Speaker: 00:41:52

actually see that as a capability for,

Speaker: 00:41:55

For technology that as we've come up with it as well is to be, you

Speaker: 00:41:59

know, you've actually seen like companionship apps in terms of like, you know,

Speaker: 00:42:02

psychological help or behavioral help or, or Or just having someone to

Speaker: 00:42:06

talk to is actually like a use case that these models have already

Speaker: 00:42:10

started to pick up on, within like a niche group of users. And what I

Speaker: 00:42:13

think would be interesting is, you know, if you think about what you probably lean

Speaker: 00:42:16

on friends or family and other types of things for, I

Speaker: 00:42:20

think should still be friends and family and others. They are the ones who know

Speaker: 00:42:23

you best, but the model can be like one additional source of that

Speaker: 00:42:26

input. And it's gonna be really cool when, like, you know,

Speaker: 00:42:30

if you're if you're working through something hard and you wanna go ahead and, you

Speaker: 00:42:33

know, you get, like, get a few ideas for how to be able to go

Speaker: 00:42:35

through it, You can text your family group, you can text your friend group, and

Speaker: 00:42:38

you can ask the model that knows you, and you can kind of pick the

Speaker: 00:42:41

best idea amongst those 3. That's a great idea. I think that, a

Speaker: 00:42:45

lot of the media hype around things like replica AI and things like that has

Speaker: 00:42:48

been like, oh my god, it's gonna replace human interaction. And it's like, Are

Speaker: 00:42:52

they intentionally missing the point, or is it clickbait? Like, I can't tell.

Speaker: 00:42:56

Right? Are they are they are they clue are they clueless by default, or are

Speaker: 00:42:59

they clueless to make money? Not really sure. But I think that you're right.

Speaker: 00:43:03

It's meant to augment. Right? And I think that's a very healthy way to look

Speaker: 00:43:06

at it too, you know. Because I if I get stuck writing something. Right? Like,

Speaker: 00:43:09

I'll I'll ask chat TBD. Like, hey, how would you word this?

Speaker: 00:43:13

Right? Sometimes it comes up with a good answer, but at least it it kinda

Speaker: 00:43:17

clears the log jam in my head Where I'm like, oh, okay. Let me let

Speaker: 00:43:20

me go around it this way. I think that's a, I think that's an

Speaker: 00:43:24

underrated use for AI or these LLMs.

Speaker: 00:43:27

Yeah. I totally agree. Share something different about

Speaker: 00:43:31

yourself. We always joke, like, you know,

Speaker: 00:43:35

remember it's a It's a it's a family, iTunes

Speaker: 00:43:39

clean rated podcast. Something different about

Speaker: 00:43:42

myself. Yeah. I don't know if it's different or at least something that,

Speaker: 00:43:46

Not a lot of folks know about me, like, when I, first, first got

Speaker: 00:43:50

with them, but, I'm a 1st generation immigrant, and as is, like, my entire

Speaker: 00:43:54

family. So I was actually born, in India, came over, you know, when I was

Speaker: 00:43:57

a lot younger. So that I think is interesting because

Speaker: 00:44:01

I was both that, but also grew up right here in the Bay

Speaker: 00:44:04

Area. You know, I I think very much saw, like, the tech

Speaker: 00:44:08

I I think very much saw 2 things. One of them was just the US

Speaker: 00:44:12

kind of as, corollary and adjacency to to India

Speaker: 00:44:16

where, like, parents had spent the vast majority of their lives and, you

Speaker: 00:44:20

know, where we had come from. And then the second was like a very specific

Speaker: 00:44:23

part of the US with Silicon Valley that was just, had a

Speaker: 00:44:27

very interesting culture, Some healthy disregard for the

Speaker: 00:44:30

rules in some regard, not always for the best, but sometimes for the best.

Speaker: 00:44:34

And a real kind of inclination towards, you know, moving very quickly and kind of

Speaker: 00:44:38

being on the latest since and and and Barry progressed in that way. And

Speaker: 00:44:42

so I think that, This might be a little bit more of a backstory

Speaker: 00:44:45

than an interesting individual facts, but I do think that, you know, that,

Speaker: 00:44:49

immigration To especially this area, I think

Speaker: 00:44:53

was kind of a very, at least different experience than what

Speaker: 00:44:57

I think a lot of other folks that I've talked to have. Yeah. I often

Speaker: 00:45:00

wonder what it would be like to grow up in the Bay Area, and I've

Speaker: 00:45:02

met some people through through work and things like that who did. And they're like

Speaker: 00:45:06

It's hard because if you if it's if you grew up there, it's kinda all

Speaker: 00:45:09

you know, so you don't really have a good Yeah. Benchmark. Like, I grew up

Speaker: 00:45:12

in New York City, and people are like, oh my god. How could you grow

Speaker: 00:45:15

up there? I'm like, I don't know. It was just So I I

Speaker: 00:45:19

grew up in the Bay Area and then went to school in the northeast and,

Speaker: 00:45:22

you know, there's some things you realize, definitely. One of them

Speaker: 00:45:25

is, Yeah. Fewer people wear, like, hoodies and, you know, flip flops,

Speaker: 00:45:29

boat shoes are more of a thing. Like, there's all sorts of funny changes,

Speaker: 00:45:33

You know, that exists culturally, especially. I think the

Speaker: 00:45:37

biggest things that I've kind of picked up on is, like,

Speaker: 00:45:41

The Bay Area has a very kind of, or at least I think where,

Speaker: 00:45:45

the environment I grew up in, a very like, risk forward culture. It's kind

Speaker: 00:45:49

of a why not, worst thing happens. Whereas I feel like a lot of other

Speaker: 00:45:52

areas are a little bit more steeped in tradition And views

Speaker: 00:45:56

that as a good thing. I think the Bay area

Speaker: 00:46:00

potentially, and not to say one is right or wrong, but I think the Bay

Speaker: 00:46:02

area has a bit more of a culture, A healthy disregard

Speaker: 00:46:06

for tradition. And, you know, I

Speaker: 00:46:09

think, Sofia had the great quote about tradition,

Speaker: 00:46:13

That I'm forgetting. But, like it's,

Speaker: 00:46:18

yeah, I think it's one thing that I definitely think about, especially the difference between,

Speaker: 00:46:21

like, For example, where I grew up in the northeast, where I spent some time.

Speaker: 00:46:24

Right. Right. And you were I'm I'm inferring because you went to Harvard that you

Speaker: 00:46:28

were in Boston, and Boston is kind of its own Yeah. Its own corner

Speaker: 00:46:31

of the northeast. If you ask somebody, like, you

Speaker: 00:46:35

know, if you ask, I've lived in Europe, I've lived

Speaker: 00:46:39

in, in new in

Speaker: 00:46:42

New York and now the DC kind of Richmond, now

Speaker: 00:46:46

Baltimore. There are slight variations in culture, but like, I

Speaker: 00:46:50

can only imagine like how much of a shock it would have been from like

Speaker: 00:46:53

the bay area To, like, Boston, especially.

Speaker: 00:46:57

Right? Where it's it's far more I think things are far more rooted in tradition

Speaker: 00:47:00

there. Right? Yeah. And it's it's not a knock on it. Right? Like, I I

Speaker: 00:47:04

will knock on their baseball team, but that's another another story. Right?

Speaker: 00:47:09

But, you know, but still, the both I mean, the

Speaker: 00:47:13

the Boston area is also known for its innovation in both

Speaker: 00:47:16

biotech and technology. Right? So it's not, These are not mutually exclusive

Speaker: 00:47:20

things. Right? They're just different approaches.

Speaker: 00:47:24

Absolutely. And both of them have worked, you know, really well for those respective

Speaker: 00:47:27

Areas. One of them feels a lot more at home to

Speaker: 00:47:31

me. But I think, you know, it was fun and interesting to kind of see

Speaker: 00:47:35

those 2 differences, Especially spending time in both cities.

Speaker: 00:47:38

Yeah. That's cool. That gives you a unique perspective on, you know, that the

Speaker: 00:47:42

US culture is not one monolith, it's just Fragments of

Speaker: 00:47:46

different things. It's it's an interesting perspective. I almost

Speaker: 00:47:50

have to ask, like, was it as much of a culture shock coming to the

Speaker: 00:47:52

US or coming from the Bay Area? Well, honestly, the Bay Area to

Speaker: 00:47:56

anywhere else. Right? You know, the weird thing

Speaker: 00:48:00

is I didn't expect the culture shock to I expected the culture shock coming to

Speaker: 00:48:03

the US. Both from you, but you know, I was young, especially for my family.

Speaker: 00:48:07

Yeah. I think that was there, but you're kind of, you're expecting

Speaker: 00:48:11

it. And so it's always something that you're well prepared for. I don't think I

Speaker: 00:48:15

expected the culture shock going from the Bay Area to to Boston.

Speaker: 00:48:19

Because these are the 2 cities in the US. These are 2, you know, Progressive

Speaker: 00:48:22

cities that are well educated in the United States, how different can they be.

Speaker: 00:48:27

And you don't actually notice the difference, I think on a one day or two

Speaker: 00:48:30

day visit, you kinda notice the difference when you actually spend a longer period of

Speaker: 00:48:33

time there and understand the undercurrent. So Yeah. It

Speaker: 00:48:37

wasn't a shock actually as much as it it was kinda cool. Like, I appreciated

Speaker: 00:48:41

that 2 places in the US could actually feel very different because,

Speaker: 00:48:45

you know, diversity is the spice of life. So actually really, really, I liked

Speaker: 00:48:49

it even though it was different to maybe how I thought. That's cool. That's

Speaker: 00:48:53

cool. The winter must have been a good shock on you. The

Speaker: 00:48:57

winter was a shock in less of a positive way. Yeah. Diversity is a spice

Speaker: 00:49:00

of life minus in weather. Yeah. I'll say

Speaker: 00:49:04

70 degrees sunny year round all day. Were you there during the year? They

Speaker: 00:49:07

had, like, a record amount of snowfall, like, something like Yeah. Fifteen

Speaker: 00:49:11

feet over the winter? I was. Yeah. Yeah. Exactly. Yeah.

Speaker: 00:49:15

Yeah. Campus shut down. Yeah. I was a student then,

Speaker: 00:49:19

and, You know, as I was saying, very healthy risk

Speaker: 00:49:22

appetite. I think everyone was out in the yard, like, throwing snowballs at each

Speaker: 00:49:26

other while there was, like, a record blizzard So it was, it was

Speaker: 00:49:30

fun. It was less fun when the snow was still on the ground in Maine,

Speaker: 00:49:33

June. That was when I was thinking, get out of here.

Speaker: 00:49:37

Do you listen to audiobooks at all? Yes. I

Speaker: 00:49:41

I read more often, but sometimes I do re I listen to audiobooks to conveniently

Speaker: 00:49:44

Do you have any Recommendations?

Speaker: 00:49:47

I really like The Happiness Advantage by Shawn Achor.

Speaker: 00:49:52

It's yeah. It's a book about how,

Speaker: 00:49:56

I think there's a thought process that, you know, like, success breeds happiness,

Speaker: 00:49:59

but this is also, like, work by a behavioral psychologist. Like how happiness can breed

Speaker: 00:50:03

success and just how to be able to be in that mindset more often. And,

Speaker: 00:50:06

you know, it's a weird book because it's actually kind of style as a business

Speaker: 00:50:10

book. But I actually think it's a lot about like personal development. And

Speaker: 00:50:14

so, yeah, that's definitely one I'd recommend.

Speaker: 00:50:17

Cool. Audible is a sponsor of the show. And if you go to the data

Speaker: 00:50:20

driven book .com, you will get, 1 free book on us. And,

Speaker: 00:50:24

if you sign up for a subscription, You get a we

Speaker: 00:50:28

get a you get a subscription and of knowledge, and we get a little bit

Speaker: 00:50:32

of a kickback for them being a sponsor. And

Speaker: 00:50:35

finally, where can people learn more about you and Predabase?

Speaker: 00:50:40

Yeah. Absolutely. So, the obvious and easiest answer there is of

Speaker: 00:50:44

course prediabase.com. I think, you know, we've learned,

Speaker: 00:50:48

the easiest way to learn more is just to go ahead and try it.

Speaker: 00:50:52

And so you'll see things there like documentation, you'll see a bunch of

Speaker: 00:50:56

videos on our, blog page, which are short, 3 to 5

Speaker: 00:50:59

minutes, and our YouTube channel, on prediabase, p

Speaker: 00:51:03

r e d I b s e, actually has longer form 1 hour pieces of

Speaker: 00:51:07

content that are more educational. But I'm a big believer that the

Speaker: 00:51:11

easiest way to actually learn is just to be able to get your hands dirty.

Speaker: 00:51:13

So if you click that try for free button, you'll get a few weeks, and,

Speaker: 00:51:17

you know, credits. We'll give you the GPU out of the box so you can

Speaker: 00:51:20

run all these models yourself, and you can learn firsthand. That's usually the easiest

Speaker: 00:51:24

way, you know, to be able to get Started more. And then if you wanna

Speaker: 00:51:27

learn a little bit more about our underlying technology, we've open sourced

Speaker: 00:51:30

both of the key components. So for how to train models, we have Ludwig,

Speaker: 00:51:35

And then for how to be able to serve models, we have LAURACS. And

Speaker: 00:51:38

so those are the 2 l's that you can kind of use in order to

Speaker: 00:51:41

be able to understand how the tech works under the hood. Very cool.

Speaker: 00:51:45

Thanks for joining us in the show, and thank you once again for your, patience

Speaker: 00:51:49

as we work through some scheduling conf conflicts,

Speaker: 00:51:53

And, I'm glad we had this conversation. You're always welcome back in the

Speaker: 00:51:56

show, and I'll let the nice British AI lady finish the show.

Speaker: 00:52:00

Thanks, Frank, and thanks, Dev. What a

Speaker: 00:52:04

splendid conversation that was. It felt like

Speaker: 00:52:08

navigating through a maze of data with only the smartest chaps as my

Speaker: 00:52:11

guides. To our listeners, I hope your brains are

Speaker: 00:52:15

buzzing with as much excitement as mine is metaphorically speaking,

Speaker: 00:52:19

of course, since my excitement is more of a series of well organized

Speaker: 00:52:23

algorithms. To our dear listeners, if today's chat

Speaker: 00:52:27

has ignited a spark of curiosity t in you, then I dare say we've

Speaker: 00:52:30

done our job. Remember, the world of AI is vast

Speaker: 00:52:34

and ever evolving, and it's thinkers and doers like deaf who keep the digital

Speaker: 00:52:38

wheels Turning. Before we sign off, a gentle

Speaker: 00:52:42

reminder to keep your minds open and your data secure.

Speaker: 00:52:45

Until then, be sure to like, share, and subscribe as the

Speaker: 00:52:49

kids say these days.

Share Episode

Shownotes

Show Notes

Speaker Bio

Transcripts

Follow

Links

Chapters

Video

More from YouTube