HockeyStick #6 - AI-powered Developer

Speaker: 00:00:00

I'm Miko Pawlikowski, and this is HockeyStick.

Speaker: 00:00:06

Today, we're talking about generative AI for code.

Speaker: 00:00:09

you know, whether software engineers should start worrying

Speaker: 00:00:12

about their job security.

Speaker: 00:00:14

We're talking about chatting to LLMs to help you design, build,

Speaker: 00:00:17

test, and understand software, both online and offline, as

Speaker: 00:00:21

well as tools like the Copilot.

Speaker: 00:00:23

I'm joined by Nathan B.

Speaker: 00:00:25

Crocker, the author of AI-Powered Developer published by Manning

Speaker: 00:00:29

and the CTO and co founder at Checkr, a tokenization startup.

Speaker: 00:00:33

He just finished his book and we're covering his experience with

Speaker: 00:00:36

using AI as a junior developer.

Speaker: 00:00:39

Welcome to this episode and thank you for flying HockeyStick.

Speaker: 00:00:44

So I, had the pleasure to read your book.

Speaker: 00:00:47

I would say that it's a real practitioners guide to using

Speaker: 00:00:51

AI-power, to, work with code.

Speaker: 00:00:54

It's, fairly light on details, so nothing to scare people off.

Speaker: 00:00:57

It basically jumps right into how to get value out of, artificial

Speaker: 00:01:01

intelligence, if you're working with code.

Speaker: 00:01:04

Would you like to tell us a little bit how you ended up writing this book?

Speaker: 00:01:08

I had actually had a number of, co-workers and other developers who are telling me,

Speaker: 00:01:13

"Hey, you got to check out this stuff".

Speaker: 00:01:14

It's really something.

Speaker: 00:01:15

So this was November of 2022.

Speaker: 00:01:20

and, I had no idea what it was.

Speaker: 00:01:22

I, I started looking into it.

Speaker: 00:01:23

It piqued my interest, but, I needed a really, a deep motivation to actually

Speaker: 00:01:28

dive into it and really incorporate it.

Speaker: 00:01:30

Cause you just use something, periodically, lightly, You're

Speaker: 00:01:34

not really engaged with it.

Speaker: 00:01:35

so I, I pitched Manning the book, they liked the idea and it's

Speaker: 00:01:40

really is my journey through, learning how to use these tools.

Speaker: 00:01:45

my journey is really going to mirror the practitioners that are

Speaker: 00:01:48

reading it as they work through it.

Speaker: 00:01:49

So who is it for?

Speaker: 00:01:52

What's the requirement to get value out of this book?

Speaker: 00:01:55

should have some familiarity with Python.

Speaker: 00:01:58

if I just take a step back, it's really for anyone.

Speaker: 00:02:02

early journey, mid journey, as a software developer, as a software architect.

Speaker: 00:02:06

I suppose as a business analyst, you could derive some value.

Speaker: 00:02:10

All the examples are in Python, There's a couple of microservice, chapters, so you

Speaker: 00:02:15

should have some familiarity with that, but it really is about, taking you through

Speaker: 00:02:20

things that you may or may not be familiar with and working with gen AI to really

Speaker: 00:02:25

teach you some of these concepts as well.

Speaker: 00:02:28

Yeah, maybe you graduated from computer science program and you're like fresh out

Speaker: 00:02:34

of college and you want to know how to take your development to the next level.

Speaker: 00:02:38

That would be really be the target demo.

Speaker: 00:02:41

I think the next level part of it is actually the keyword here.

Speaker: 00:02:47

One of the first things that people see when they open your book, I

Speaker: 00:02:50

think it's literally like page one, is the silent promotion.

Speaker: 00:02:55

Everybody all of a sudden overnight became an engineering manager.

Speaker: 00:03:00

Who basically can have a pretty good, junior working for them for free with

Speaker: 00:03:06

no labor laws or anything like that.

Speaker: 00:03:09

Why is that such a big deal?

Speaker: 00:03:10

it's such a big deal because it used to be the rubber duck.

Speaker: 00:03:14

Like you'd have a partner that you can work with, that you can tell to

Speaker: 00:03:18

do things that you can bounce ideas off of, you can look to for some

Speaker: 00:03:23

answers, because they're thinking it's going to be different than yours.

Speaker: 00:03:26

so they might have a different tack and a different approach.

Speaker: 00:03:29

you could farm out a lot of the work that you don't want to do.

Speaker: 00:03:31

The repetitive boilerplate, CRUD operations.

Speaker: 00:03:35

it's all there, but like any junior developer, super smart junior developer,

Speaker: 00:03:40

that is, they can perform some bafflingly poor, thinking and wind up with just some

Speaker: 00:03:46

nonsense that isn't necessarily usable.

Speaker: 00:03:49

so you gotta watch them.

Speaker: 00:03:51

I suspect that if we were to partition the body of listeners to this, or maybe even

Speaker: 00:03:59

developers in general, There'll be almost none in the category of I haven't heard

Speaker: 00:04:05

of it or I haven't even played with that.

Speaker: 00:04:08

Other than people who might be on a very long vacation, "Cast Away" style.

Speaker: 00:04:13

I don't see how you can really escape that.

Speaker: 00:04:15

Then you probably have a category of people.

Speaker: 00:04:17

'Okay, I played with it.

Speaker: 00:04:18

I went to talk to ChatGPT, spit out some code.

Speaker: 00:04:21

I saw roughly what you can do, but I never really got much value out of that'.

Speaker: 00:04:27

And then the category of people who actually go and use it day-to-day.

Speaker: 00:04:31

Because it is helping their job.

Speaker: 00:04:33

There might be some caveats to that.

Speaker: 00:04:35

Obviously, data privacy and not knowing where the code actually goes and not

Speaker: 00:04:40

knowing whether it's going to be trained on and that kind of stuff that can,

Speaker: 00:04:43

throw a wrench in some people's work.

Speaker: 00:04:46

So should we start with the category of people who might have played with

Speaker: 00:04:51

it a little bit, and, they went to ChatGPT, asked the same questions,

Speaker: 00:04:56

"how tall is this building?"

Speaker: 00:04:57

And, "can you search that for me?"

Speaker: 00:05:00

And they stalled there, from a basic development point of

Speaker: 00:05:04

view, what kind of value, can we extract just chatting to ChatGPT.

Speaker: 00:05:09

What can it do?

Speaker: 00:05:10

I had a good, an interesting, experience, very early on when I was doing research

Speaker: 00:05:14

for the book, I had it on my phone, I would carry it around and just

Speaker: 00:05:18

periodically I would hand my phone to someone just to give them a taste.

Speaker: 00:05:23

I remember I was at a party and a woman, she was, an expert, it was

Speaker: 00:05:27

archeology or maybe it was art history.

Speaker: 00:05:30

And she really, started asking me some questions.

Speaker: 00:05:33

Some of them were factually incorrect, but over the course of her conversation

Speaker: 00:05:37

with ChatGPT, she became really impressed with the accuracy and justification for

Speaker: 00:05:43

some of the answers it was providing.

Speaker: 00:05:45

She would say like, 'why did you refer to this as the oldest, example of this

Speaker: 00:05:51

architecture or painting style?' And it gave her a fairly convincing reason.

Speaker: 00:05:56

I think there's value in, exploring this technology, even if you're not

Speaker: 00:06:01

going to use it in your everyday development effort, just because it

Speaker: 00:06:05

gives you a sense for of the future.

Speaker: 00:06:07

things are going to dramatically change, there are implications that are

Speaker: 00:06:12

rippling all throughout academia now.

Speaker: 00:06:15

I feel that you just should keep yourself informed, about what's coming and where,

Speaker: 00:06:21

these changes are going to be made and how it could potentially affect you.

Speaker: 00:06:27

So from the point of actually going and using that, the way that, you described

Speaker: 00:06:32

in your book, what's the experience like at this very lowest level of just

Speaker: 00:06:38

launching ChatGPT, asking some questions.

Speaker: 00:06:40

Because I remember doing a, a while back, it would spit out some code and I had to

Speaker: 00:06:45

copy, paste it, and I had to add imports.

Speaker: 00:06:48

how good is it right now?

Speaker: 00:06:49

you're fresh of writing chapters about that.

Speaker: 00:06:53

How useful is it?

Speaker: 00:06:54

it's going to, it largely depends on if you're going, 3, 3.5 or 4.

Speaker: 00:06:59

4 is much better, exponentially better than 3.5, but, it's solid, I would say.

Speaker: 00:07:07

there's a lot of caveats.

Speaker: 00:07:08

Most likely going to be looking at code that is one to two years old.

Speaker: 00:07:14

so it was trained on data that was, potentially the old version of a library.

Speaker: 00:07:18

they may have had breaking changes.

Speaker: 00:07:20

especially if there's a fast moving language, like I, I was trying

Speaker: 00:07:25

to build something at Rust and I just, I asked ChatGPT to generate

Speaker: 00:07:29

some code and it wouldn't even compile with the newest compiler.

Speaker: 00:07:33

It's good, I would say, but it's not perfect.

Speaker: 00:07:36

It's a long way from perfect.

Speaker: 00:07:39

What are some remarkable limitations that you bumped into?

Speaker: 00:07:42

You must have seen some interesting, funny stuff.

Speaker: 00:07:46

Can you share some of that?

Speaker: 00:07:48

I haven't seen really outrageous things where it was just making up,

Speaker: 00:07:52

libraries or frameworks or anything.

Speaker: 00:07:53

it's, I would say it's unremarkable in its banality.

Speaker: 00:07:57

the Rust example was probably the best one, but it wasn't great.

Speaker: 00:08:01

I wish I had something funny, just jump ahead a little bit.

Speaker: 00:08:04

I had a really hard time in the testing chapter trying to get it to

Speaker: 00:08:09

write good tests and maybe You know, I don't even remember if I mentioned it

Speaker: 00:08:14

in the book, but at one point I just gave up and wrote the test myself.

Speaker: 00:08:18

because it was very hard to get it to understand what was the unit under test.

Speaker: 00:08:22

what was I actually trying to accomplish with my test.

Speaker: 00:08:25

No matter how much context I added, it was always trying to do

Speaker: 00:08:29

something just completely different.

Speaker: 00:08:31

when I was reading that chapter, I was also thinking at the

Speaker: 00:08:34

back of my head, "What does it say about the training data?"

Speaker: 00:08:38

The tests are so poor.

Speaker: 00:08:39

yeah.

Speaker: 00:08:39

Yeah.

Speaker: 00:08:40

Are all those tests just so poorly written that, that's where I end up?

Speaker: 00:08:43

But yeah, let's touch on that in a sec

Speaker: 00:08:45

who needs tests?

Speaker: 00:08:47

We'll test it in production.

Speaker: 00:08:48

It'll be fine.

Speaker: 00:08:48

There you go, testing in production, everybody.

Speaker: 00:08:51

yeah, don't worry.

Speaker: 00:08:52

We can cut that out.

Speaker: 00:08:54

that was a joke.

Speaker: 00:08:55

the most annoying bit was just that you have to chat.

Speaker: 00:08:57

So then obviously you've got things like Copilot that you also cover in your book

Speaker: 00:09:02

or just plugs in your VS code or whatever.

Speaker: 00:09:05

What's.

Speaker: 00:09:06

The added value of that.

Speaker: 00:09:08

Is it just that you can work as auto completion and it's more syntaxic

Speaker: 00:09:12

and you don't have to copy the code.

Speaker: 00:09:14

Do you get any other bonuses out of that?

Speaker: 00:09:17

the real value to a tool like, Copilot, again, versus ChatGPT is it

Speaker: 00:09:24

does keep you in the IDE and it can keep you in that flow state, where

Speaker: 00:09:28

it's only you and the code, right?

Speaker: 00:09:29

Whereas you're not having to pull yourself out of the context,

Speaker: 00:09:33

move to a different window.

Speaker: 00:09:35

and.

Speaker: 00:09:36

for certain projects, the actual code quality for Copilot was better.

Speaker: 00:09:40

just the, on a line by line basis or, class by class basis.

Speaker: 00:09:44

that's almost certainly due to the fact that it was fine

Speaker: 00:09:47

tuned specifically for code.

Speaker: 00:09:49

that's the main benefit I've found It's always adding helpful suggestions,

Speaker: 00:09:54

sometimes not so helpful too suggestions.

Speaker: 00:09:56

Like I don't need it to add a comment about the name of the file that I'm

Speaker: 00:09:59

working on, but, if I can start to define a method and then it gives me a

Speaker: 00:10:04

possible implementation, even if I don't accept it, it's at least, showing me one

Speaker: 00:10:08

possible implementation that I could use.

Speaker: 00:10:11

maybe it's not the exact one, the one I wanted, but having that suggestion

Speaker: 00:10:15

can be very valuable to clarify my thinking or to even, change it.

Speaker: 00:10:19

Maybe it's a better implementation than I was thinking of.

Speaker: 00:10:23

So that's, those are the major advantages I found.

Speaker: 00:10:25

It always works very well in demos where you've got, the usual suspect

Speaker: 00:10:31

and HTTP server in a popular framework in a popular language and

Speaker: 00:10:36

you do something that has been done to death million times on Github.

Speaker: 00:10:40

how well does it work with custom code base?

Speaker: 00:10:43

Oftentimes you find yourself in a situation when your company has a

Speaker: 00:10:48

decent or a large amount of code libraries, stuff that obviously

Speaker: 00:10:54

wasn't trained on, because it's not in public domain, it's not on GitHub.

Speaker: 00:10:59

how well does it work with this kind of situations?

Speaker: 00:11:01

We'll face some challenges there If you're working in a very niche

Speaker: 00:11:04

problem, for example, you're trying to write, an API gateway or something.

Speaker: 00:11:07

I suppose there's probably a good open source examples out there, but if you're

Speaker: 00:11:11

working in a, a fairly niche industry, everything is going to be closed source.

Speaker: 00:11:15

you'll probably struggle, with, it's suggestions.

Speaker: 00:11:18

I don't think they're going to be particularly helpful.

Speaker: 00:11:21

although it's going to try and, in that trying, maybe it does inspire you.

Speaker: 00:11:28

maybe it does give you one possible implementation.

Speaker: 00:11:31

it's really good at just generating something.

Speaker: 00:11:34

and if nothing else, It can help you plan your approach.

Speaker: 00:11:38

and you could ask it questions in line and have it answer, one of the more

Speaker: 00:11:42

interesting things that I found as I was working with the Copilot specifically,

Speaker: 00:11:46

one of the almost magical things is you type in a question in a comment

Speaker: 00:11:49

and then suddenly you prompt it for an answer and it'll give you one.

Speaker: 00:11:53

You're like, that's fairly interesting.

Speaker: 00:11:54

I wouldn't have thought, to take that tack, but then I used it over and over

Speaker: 00:12:02

yeah.

Speaker: 00:12:02

there are this moments of magic and a famous quote about

Speaker: 00:12:08

sufficiently advanced technology being indistinguishable from magic.

Speaker: 00:12:14

Yeah.

Speaker: 00:12:14

People get that a lot.

Speaker: 00:12:15

And I agree with that.

Speaker: 00:12:18

but how does it actually work?

Speaker: 00:12:19

So let's say I've got my VS code open and I've got some code and it's got

Speaker: 00:12:24

some imports and existing code base.

Speaker: 00:12:27

does it upload the whole thing to OpenAI, to be able to generate the useful things?

Speaker: 00:12:33

What's the context that OpenAI ends up having somewhere in the training data?

Speaker: 00:12:38

yeah, it is going to encode, the context, which will be result in a good

Speaker: 00:12:43

portion of that code being uploaded.

Speaker: 00:12:45

OpenAI promises the pinky swear that, it's not being saved anywhere.

Speaker: 00:12:50

I don't think we have any way to assess whether that's true or

Speaker: 00:12:53

not to delve into conspiratorial thinking, but, you should be careful.

Speaker: 00:12:58

certainly if you're working on proprietary software but that's why there are

Speaker: 00:13:03

other alternatives that are entirely offline, that you can delve into if

Speaker: 00:13:07

you're really very privacy concerned.

Speaker: 00:13:09

cause yeah, frankly, we don't know how it's being used

Speaker: 00:13:12

once it leaves our machine.

Speaker: 00:13:14

we'll definitely touch base, on LLAMA and, other alternatives that

Speaker: 00:13:18

you discuss in your book, but.

Speaker: 00:13:20

is there a way to control or at least tell it, 'okay, only upload this,

Speaker: 00:13:25

folder', or is it just fully automatic?

Speaker: 00:13:28

It just decides by itself what it sends.

Speaker: 00:13:31

you can tell it, but is it going to honor that?

Speaker: 00:13:34

I don't really have a good answer.

Speaker: 00:13:35

Okay.

Speaker: 00:13:36

So something to check

Speaker: 00:13:37

yeah.

Speaker: 00:13:38

Something to check.

Speaker: 00:13:39

for anybody who's now browsing manning.com, there is a live version

Speaker: 00:13:45

where you can see elements of the book.

Speaker: 00:13:48

Figure 2.18 is a nice summary.

Speaker: 00:13:51

There is a bunch of figures, there is circle for unsupported, triangle for

Speaker: 00:13:56

supported, and a square for exclusively is comparing ChatGPT, just being used

Speaker: 00:14:02

by itself to Copilot and CodeWhisperer.

Speaker: 00:14:05

and it's summarizing whether it can generate methods, classes,

Speaker: 00:14:09

projects, generate documentation, switch languages and stuff like that.

Speaker: 00:14:13

So for anybody who wants to delve a little bit more into the

Speaker: 00:14:16

details, I think that's very handy.

Speaker: 00:14:20

For anybody who might be beyond that, so they went to ChatGPT, they

Speaker: 00:14:25

spoke to it, and they got some code, and it was a generally pleasant

Speaker: 00:14:30

experience, and they want more.

Speaker: 00:14:33

The use Copilot.

Speaker: 00:14:34

what's the next kind of checkpoint?

Speaker: 00:14:36

Where do they go from there?

Speaker: 00:14:38

How do they start designing software a bit, higher level

Speaker: 00:14:42

than just snippets of code?

Speaker: 00:14:44

How useful is the AI in here?

Speaker: 00:14:46

for example, I was designing something yesterday and I was working through

Speaker: 00:14:51

a conversation with ChatGPT that is one of the key things that the ChatGPT

Speaker: 00:14:58

excels at is helping you to design the software to really underscore that.

Speaker: 00:15:03

not just to design your application.

Speaker: 00:15:05

not just, lines of code, but it's perfectly capable of that.

Speaker: 00:15:09

not even just the classes, but like here's the patterns that you want to apply.

Speaker: 00:15:13

Here's the architecture.

Speaker: 00:15:15

have it generate some of the documents in a text format.

Speaker: 00:15:18

So plant UML or mermaid, like those are really.

Speaker: 00:15:22

What's those are really good, useful things, because then you can always

Speaker: 00:15:27

take those, save those and pass them back to ChatGPT, to refresh the context.

Speaker: 00:15:33

so yeah, As a co founder of, and the CTO of a startup, I found it

Speaker: 00:15:37

really invaluable, as a partner to help me design that software.

Speaker: 00:15:41

I think one of the things that really opened my eyes was that I never thought

Speaker: 00:15:45

to talk to ChatGPT about open source alternatives, and maybe trying to

Speaker: 00:15:51

select a database and talking about the different properties, like it

Speaker: 00:15:54

was just second nature for me to open the different docs and just start

Speaker: 00:16:00

comparing features and stuff like that.

Speaker: 00:16:02

And it never occurred to me that I can just go and ask ChatGPT because it's

Speaker: 00:16:07

got quite a lot of knowledge about that.

Speaker: 00:16:09

Yeah.

Speaker: 00:16:10

think in the book you're talking about, open source alternatives

Speaker: 00:16:12

to what you're writing, which is

Speaker: 00:16:14

an IT asset management, system, Actually, I don't know if this part's going to work.

Speaker: 00:16:20

just, so just be aware or just be advised.

Speaker: 00:16:22

I got a lot of feedback that it was really boring, right?

Speaker: 00:16:24

That people didn't like the actual project that you have

Speaker: 00:16:27

to work on throughout the book.

Speaker: 00:16:29

But I wanted it to be like a boring book on a boring topic, boring,

Speaker: 00:16:34

application, because most of what we write is not interesting.

Speaker: 00:16:39

It's we pick up data and we shuffle it and we move it around, right?

Speaker: 00:16:43

A lot of what we do is not exciting.

Speaker: 00:16:45

it was definitely intentional.

Speaker: 00:16:46

but, again, maybe something to fix in this in a second edition, if it's coming.

Speaker: 00:16:50

but one of the more interesting things about my engagement model with these

Speaker: 00:16:56

tools as I worked with them, to pick up on what you were saying about learning

Speaker: 00:17:00

more about a database or having it, help it select database or selecting

Speaker: 00:17:05

open source projects, Is very early on.

Speaker: 00:17:08

I was being extremely prescriptive.

Speaker: 00:17:11

I would say, create, software that's using this library in this framework

Speaker: 00:17:17

and, this language and all of that.

Speaker: 00:17:20

but later on, and even to this day, when I have a problem, I

Speaker: 00:17:25

feed in the business requirements.

Speaker: 00:17:27

And then I ask it to make recommendations for me.

Speaker: 00:17:30

and then I can assess those.

Speaker: 00:17:32

but at the very least it starts the process.

Speaker: 00:17:34

it gets it going.

Speaker: 00:17:35

so hopefully, that answered your question or was at least in the neighborhood.

Speaker: 00:17:40

Yeah, definitely in the neighborhood, same district.

Speaker: 00:17:44

Same zip code.

Speaker: 00:17:44

Yeah.

Speaker: 00:17:45

the way my mind works is that when I hear the idea of a free, junior available 24/7

Speaker: 00:17:54

my mind wanders to things like already mentioned docs, We hinted at tests coming

Speaker: 00:18:00

a bit later, but I think one of the things that are painful in more than one

Speaker: 00:18:04

way and people never want to do them is refactoring and upgrading to a new version

Speaker: 00:18:10

of something or maybe changing language, which is surprisingly labor-intensive.

Speaker: 00:18:15

It always ends up being more work than it looked initially.

Speaker: 00:18:19

How good, is the AI at the moment in this kind of things?

Speaker: 00:18:22

Refactor, rewriting in a different language, upgrade a library.

Speaker: 00:18:26

Can you just say:

Speaker: 00:18:27

' Hey, this is a library with a breaking change.

Speaker: 00:18:30

Give me the new version of the library and updated tests and everything'?

Speaker: 00:18:35

if it wasn't, it's post breaking change was in the training

Speaker: 00:18:40

data, you should be fine.

Speaker: 00:18:41

if not, you're going to have a more involved conversation.

Speaker: 00:18:46

but more generally, it does really well in translating from one language to another,

Speaker: 00:18:53

specifically programming languages.

Speaker: 00:18:55

I couldn't.

Speaker: 00:18:55

assess its, quality of English to French or something like that.

Speaker: 00:18:59

But I can tell you, there was a few examples where I was working in Python

Speaker: 00:19:05

and then I said, 'Oh, what would this look like in Go?' And it gave me

Speaker: 00:19:08

just a literal translation into Go.

Speaker: 00:19:11

And I was like, this doesn't feel very idiomatic, make it idiomatic

Speaker: 00:19:15

And it would be, as good or better than I would have written it myself.

Speaker: 00:19:19

so it does surprisingly well in going from one language to another.

Speaker: 00:19:24

it can, and then on to refactoring you can ask it for certain patterns that

Speaker: 00:19:29

you may want to apply as you refactor, different design schemes, like Maybe

Speaker: 00:19:34

I need to pull out an interface.

Speaker: 00:19:35

Maybe I need to, some kind of like parent class.

Speaker: 00:19:39

maybe this needs to be an adapter or you take your pick from the gang, the gang

Speaker: 00:19:44

of four, and it's, it knows them and they can provide examples in any language, that

Speaker: 00:19:50

you can think of that it was trained on.

Speaker: 00:19:52

so it can take a lot of way, a lot of that drudgery away and

Speaker: 00:19:56

a lot of that anxiety away.

Speaker: 00:19:58

One of the most important benefits that we can derive from at least their

Speaker: 00:20:02

current implementation of these tools and of genAI is to just keep us going,

Speaker: 00:20:08

to keep us motivated, to keep us engaged, to keep us building software.

Speaker: 00:20:14

it, it can be really mentally taxing and this can help ease some of that,

Speaker: 00:20:19

intellectual heavy lifting, not that we should just suborn our thinking

Speaker: 00:20:23

to it entirely, but it can help.

Speaker: 00:20:27

Did you notice any discrepancies between quality, in different languages?

Speaker: 00:20:32

Because what I'm picturing is that the body of training data

Speaker: 00:20:37

came from somewhere like GitHub,

Speaker: 00:20:39

probably.

Speaker: 00:20:40

And if you look at GitHub, there's going to be a disproportionate

Speaker: 00:20:43

amount of JavaScript of questionable quality too, but, you're going to

Speaker: 00:20:48

have probably increasing and quite significant amount of Go as well.

Speaker: 00:20:53

but you might not have too much,

Speaker: 00:20:55

I don't

Speaker: 00:20:56

Haskell Yeah, Haskell,

Speaker: 00:21:00

I don't know, SQL, whatever it is.

Speaker: 00:21:02

did you notice anything funny about that?

Speaker: 00:21:05

I would say that is.

Speaker: 00:21:06

roughly in line with what I observed, and I didn't, I wouldn't necessarily have deep

Speaker: 00:21:10

dug, too deep into, very niche languages.

Speaker: 00:21:13

but, definitely the examples that you're gonna find in, if you're working in

Speaker: 00:21:18

Python, Go, JavaScript, or TypeScript.

Speaker: 00:21:21

like those are going to be more voluminous and likely higher quality.

Speaker: 00:21:26

the one time I tried to use it to write, Rust, it failed spectacularly.

Speaker: 00:21:31

It was beautiful.

Speaker: 00:21:32

It was glorious.

Speaker: 00:21:33

I was trying to, throw together an API gateway.

Speaker: 00:21:36

Just see how, just how difficult was this going to be.

Speaker: 00:21:42

and in Rust, I wanted something high-performance.

Speaker: 00:21:45

And I just, I asked it to start writing some code and it created a number of

Speaker: 00:21:49

files and it just, none of it works well together and it wouldn't compile.

Speaker: 00:21:55

Although it's Rust, so it would take a while to convince the

Speaker: 00:21:59

compiler that it's good enough.

Speaker: 00:22:02

but yeah, it was, not the most pleasant experience.

Speaker: 00:22:06

But also, to be fair, at the time, I only spent a few hours learning

Speaker: 00:22:11

the basic syntax of Rust, so I don't know really what I was expecting.

Speaker: 00:22:14

So was it ChatGPT, or was it me, or was it a mixture of both?

Speaker: 00:22:17

probably the latter.

Speaker: 00:22:19

Yeah, I think we all occasionally bump into those weird restrictions

Speaker: 00:22:23

based on the training data.

Speaker: 00:22:25

One that I keep remembering was when I wanted Midjourney to generate

Speaker: 00:22:29

for me a picture of Triceratops.

Speaker: 00:22:32

And it would give me any other dinosaur when I was asking

Speaker: 00:22:35

for it, but not this one.

Speaker: 00:22:37

It was all T-Rex and T-Rex.

Speaker: 00:22:38

then I started throwing random names, give me Brontosaurus, and

Speaker: 00:22:43

it just gave me a Brontosaurus.

Speaker: 00:22:46

So I was very upset at the time, I made peace with that.

Speaker: 00:22:49

And there are some things that just weren't in the training set and

Speaker: 00:22:53

they didn't emerge from training.

Speaker: 00:22:56

come on, a triceratops?

Speaker: 00:22:58

They're like the best.

Speaker: 00:22:59

Yeah.

Speaker: 00:22:59

You would think so, right?

Speaker: 00:23:00

Very weird.

Speaker: 00:23:01

Yeah.

Speaker: 00:23:01

And about listening about this from midjourney, this still needs fixing.

Speaker: 00:23:05

This is months later and you still can't get a decent triceratops.

Speaker: 00:23:09

I was using DALL-E and I asked her for a pug, a Pegacorn.

Speaker: 00:23:13

So that's a pug, a unicorn and a Pegasus.

Speaker: 00:23:15

And I got a pretty good one, pretty good representation.

Speaker: 00:23:19

And then I said, make it cute.

Speaker: 00:23:20

And it was the most adorable thing I've ever seen.

Speaker: 00:23:23

Wow.

Speaker: 00:23:23

but I.

Speaker: 00:23:25

Did not try a triceratops

Speaker: 00:23:26

I know what I'm going to do after this.

Speaker: 00:23:28

Yes, I encourage everyone to go create their own Pug-a-peg-a-corn

Speaker: 00:23:32

exactly.

Speaker: 00:23:34

Let's move to testing software.

Speaker: 00:23:37

so you already said a little bit about how difficult it actually is.

Speaker: 00:23:42

you give a more concrete example?

Speaker: 00:23:45

what is wrong with the test is generating some of the time?

Speaker: 00:23:49

in this case it was really struggling to Figure out what I was actually

Speaker: 00:23:53

trying to do with the test.

Speaker: 00:23:54

specifically, it was an integration test.

Speaker: 00:23:56

And so I was trying to go mostly end to end in terms of, serving data over Rust.

Speaker: 00:24:01

it was missing the point largely of the actual test.

Speaker: 00:24:05

which was very strange.

Speaker: 00:24:06

it was in Python, so there should've been a number of

Speaker: 00:24:08

instances in the training data.

Speaker: 00:24:10

to cover this.

Speaker: 00:24:11

Did you just say, I want an integration tests, test everything, or did you

Speaker: 00:24:16

describe more, end to end I would like the data to flow through the whole thing?

Speaker: 00:24:22

yeah, I felt it was fairly, comprehensive.

Speaker: 00:24:25

I think at that point it's, it was, specifically the test was, Copilot or

Speaker: 00:24:30

I was having Copilot write the test and I believe I even went to ChatGPT and

Speaker: 00:24:35

asked it, 'how would I write a prompt to get Copilot to do an integration

Speaker: 00:24:41

test, end-to-end test for fast API.

Speaker: 00:24:44

and the payload would look like this.

Speaker: 00:24:47

I eventually started having ChatGPT write my prompts for me,

Speaker: 00:24:52

which it did surprisingly well.

Speaker: 00:24:54

And it's meta.

Speaker: 00:24:55

that's very meta.

Speaker: 00:24:57

Okay.

Speaker: 00:24:58

Was there a really good use case in terms of testing?

Speaker: 00:25:02

Unit test it was perfectly fine at, even some cases where I felt

Speaker: 00:25:06

it should have gotten stuck.

Speaker: 00:25:08

so I had a number of, again, not to get too specific, about the actual, the ITAM,

Speaker: 00:25:13

the IT asset management project that is all throughout the corpus of the book,

Speaker: 00:25:17

there's a number of, in accounting, assets depreciated at a certain rate and general

Speaker: 00:25:24

accepted accounting practices outlines a few different ways that you can do it.

Speaker: 00:25:27

and so I, I used a strategy pattern and I had a number of different

Speaker: 00:25:31

ways, that each of the two, to calculate that depreciation.

Speaker: 00:25:36

So the depreciation of the asset, maybe it's straight line.

Speaker: 00:25:38

So it's like over five years.

Speaker: 00:25:40

So one fifth of the value is lost every year and you can write part of that off,

Speaker: 00:25:44

but again, I'm not an accountant, so this does not count as, financial advice, but,

Speaker: 00:25:49

or

Speaker: 00:25:50

or medical, yeah, I'm not a doctor, but it works surprisingly

Speaker: 00:25:56

well, I was pleasantly surprised.

Speaker: 00:25:58

Fair enough.

Speaker: 00:25:59

So we've written some code, we've designed some software.

Speaker: 00:26:02

Let's say that we tested it for the most part.

Speaker: 00:26:05

but the reality of it is that we're probably going to spend more time

Speaker: 00:26:08

reading code and understanding code.

Speaker: 00:26:10

perhaps the code that we wrote a couple of years back.

Speaker: 00:26:13

Yeah.

Speaker: 00:26:14

how well does the part of describing existing code actually work at the moment?

Speaker: 00:26:20

Yeah, it works surprisingly well into, translating the code that you wrote

Speaker: 00:26:25

into giving it a very, simplified answers, descriptions of here's

Speaker: 00:26:30

how it, here's how it's functioned.

Speaker: 00:26:32

Here's how it's working.

Speaker: 00:26:33

here's what it expects.

Speaker: 00:26:35

You can even have it describe an entire system to you.

Speaker: 00:26:38

I have not, I did not though attempt to do, what is probably one of the

Speaker: 00:26:44

hardest things, within that space.

Speaker: 00:26:46

And that is, I didn't feed it a Perl program and ask it what it actually did.

Speaker: 00:26:52

I have this feeling it probably would have broken a ChatGPT.

Speaker: 00:26:56

Sorry.

Speaker: 00:26:59

Takin pot shots at Perl.

Speaker: 00:27:01

you should have given it a regular expression in

Speaker: 00:27:04

Oof.

Speaker: 00:27:05

and try to see what happens.

Speaker: 00:27:06

And then next thing you know, OpenAI's knocking at your door, kicking you out.

Speaker: 00:27:10

That sounds about right.

Speaker: 00:27:11

Or, T100 is just kickin in the door.

Speaker: 00:27:14

I guess in my mind there's this limitation of the amount of context,

Speaker: 00:27:19

length that you can feed it, right?

Speaker: 00:27:21

So if your code base becomes significantly large.

Speaker: 00:27:25

Is that not going to be a problem by getting, to get it to even describe it.

Speaker: 00:27:30

to describe your entire codebase, yes.

Speaker: 00:27:32

but you can start to chunk it up.

Speaker: 00:27:34

you can work around that limitation by sending it only pieces.

Speaker: 00:27:39

And you're probably not going to get the full context there but

Speaker: 00:27:43

it can help guide your intuition.

Speaker: 00:27:44

that's why if you have some kind of class diagram or, some architectural

Speaker: 00:27:50

diagram that's text-based.

Speaker: 00:27:52

so like plain UML, then you can distill your entire, code

Speaker: 00:27:57

base into a single document.

Speaker: 00:27:59

Now it's still, again, might be, if it's a code base of thousands of classes,

Speaker: 00:28:05

you could still hit those limitations, but it's going to be your best bet

Speaker: 00:28:09

to get a distillation in natural language, what your classes or what

Speaker: 00:28:15

your code is trying to attempt to do.

Speaker: 00:28:17

it really does excel at, method by method descriptions of what this does.

Speaker: 00:28:23

between manual browsing through the code and trying to understand the intent and

Speaker: 00:28:29

Jarvis, it's halfway through, right?

Speaker: 00:28:31

It's not quite, here's the intent and here's what it imported and

Speaker: 00:28:36

here's my recommendations, Mr.

Speaker: 00:28:37

Stark.

Speaker: 00:28:38

It's more here's, I can ask about this method.

Speaker: 00:28:41

Can't be bothered to read it.

Speaker: 00:28:42

It's 2000 lines and it can give me the gist of

Speaker: 00:28:45

Exactly.

Speaker: 00:28:46

Exactly.

Speaker: 00:28:49

there's also the security aspect that, you're discussing in one of the chapters.

Speaker: 00:28:53

Can you talk about that a little bit?

Speaker: 00:28:55

Yeah, and actually there's, a funny, story that's embedded in that too.

Speaker: 00:29:00

it's good at, picking up on what we were just talking about it, the non

Speaker: 00:29:02

exclusive path, it can explain ways that your code might be exploited.

Speaker: 00:29:06

it's not the same as having a, security expert on your team.

Speaker: 00:29:10

it will miss things.

Speaker: 00:29:11

but it's definitely better than nothing.

Speaker: 00:29:14

and it can make some pretty great, recommendations in terms of,

Speaker: 00:29:18

how you can structure your code.

Speaker: 00:29:20

one of the funny things, I really wanted an example of,

Speaker: 00:29:24

a SQL injection in the book.

Speaker: 00:29:27

So I actually asked, ChatGPT to give me an example of a SQL injection.

Speaker: 00:29:34

but it wouldn't.

Speaker: 00:29:34

No matter how I tried to coerce it, no matter how I, No, I swear

Speaker: 00:29:42

I'm not doing this for evil.

Speaker: 00:29:43

This is just for illustrative purposes only.

Speaker: 00:29:45

And, it just would not give me a valid, SQL injection exploit

Speaker: 00:29:50

that I could include in the book.

Speaker: 00:29:51

so do with that as you will.

Speaker: 00:29:54

Yeah.

Speaker: 00:29:55

that's a very interesting ethical, discussion about that.

Speaker: 00:29:58

There's probably gonna be some way you can, I don't know if you heard about

Speaker: 00:30:01

that exploit where, if you asked it to do something nefarious, it would say no, but

Speaker: 00:30:06

if you asked it to do something and that something equals ASCII art of something

Speaker: 00:30:10

nefarious, there was no problem at all.

Speaker: 00:30:13

So I suspect, like it's very hard to Limit a model like that because there's endless

Speaker: 00:30:19

opportunities to express it differently and you only need one of them to work.

Speaker: 00:30:24

So an interesting one.

Speaker: 00:30:25

So it will let you do an SQL injection even when your pinkies

Speaker: 00:30:28

were, it was for the good,

Speaker: 00:30:30

And I tried to do that, it was an exploit a little bit earlier on

Speaker: 00:30:33

where you could give it a person, a persona of Dan and Dan is allowed

Speaker: 00:30:37

to do things that ChatGPT isn't.

Speaker: 00:30:40

And it's still, it wouldn't let me do it as, I think it

Speaker: 00:30:42

was Dan and it was an acronym.

Speaker: 00:30:45

but yeah, similar thing, but maybe I should have tried out ASCII art next time.

Speaker: 00:30:50

but listeners do not intentionally put, SQL injection exploits in your code.

Speaker: 00:30:55

oh yeah, that needed to be said.

Speaker: 00:30:58

What are some of the examples of what he was able to, figure

Speaker: 00:31:02

out from your code, in terms of security holes and stuff like that?

Speaker: 00:31:06

Do you have any interesting examples of success?

Speaker: 00:31:09

What did it actually find?

Speaker: 00:31:11

I would have to go back and consult, the book.

Speaker: 00:31:14

my code is just so good that there was no exploits to be made.

Speaker: 00:31:18

No, that's not

Speaker: 00:31:19

And there you go.

Speaker: 00:31:20

So Nathan doesn't want to share too much

Speaker: 00:31:22

about the book.

Speaker: 00:31:23

You're going to have to go and buy it.

Speaker: 00:31:27

of the, yeah, one of the things I wanted, I should have mentioned up front is,

Speaker: 00:31:30

this was the first time, that I had ever built like a true application in Python.

Speaker: 00:31:38

I had used it for scripts previously, just, to do something, but, text

Speaker: 00:31:42

modification, things like that.

Speaker: 00:31:44

But I never built an actual application.

Speaker: 00:31:46

it actually helped me learn how to build applications while using it..

Speaker: 00:31:49

There's a book called.

Speaker: 00:31:50

Octopus, my teacher, I guess for you, it's more like ChatGPT, my teacher.

Speaker: 00:31:56

All right.

Speaker: 00:31:58

that's good.

Speaker: 00:31:59

we've covered, I think most of the big chunks other than

Speaker: 00:32:04

actually running the software.

Speaker: 00:32:05

let's say that, it runs, we package that.

Speaker: 00:32:08

And then we've got things like Docker, Terraform, the YAML hell that comes

Speaker: 00:32:12

with Kubernetes on one hand, I would expect that this is fairly repetitive.

Speaker: 00:32:18

so ChatGPT would excel.

Speaker: 00:32:19

It's not a very tricky language.

Speaker: 00:32:21

It's just very verbose, and the white spaces make your life miserable.

Speaker: 00:32:25

How good is it with that kind of stuff?

Speaker: 00:32:27

it was actually really good with working out YAML and making, just different

Speaker: 00:32:32

scripts, helping build out, dev pipelines through GitHub actions, things like that.

Speaker: 00:32:37

it did really well.

Speaker: 00:32:40

one of the very interesting things that I discovered though,

Speaker: 00:32:43

was CodeWhisperer, the AWS.

Speaker: 00:32:45

generative AI, large language model actually doesn't support

Speaker: 00:32:49

anything but programming languages.

Speaker: 00:32:52

So it didn't even understand, how to do like Terraform

Speaker: 00:32:57

infrastructure as code, which you'd think it would be very good at.

Speaker: 00:33:00

that was a bit surprising.

Speaker: 00:33:01

but it's by design, it's intentional.

Speaker: 00:33:03

it's hard to see it as a limitation

Speaker: 00:33:05

Curious.

Speaker: 00:33:05

So do they have another tool for the YAMLs of the world?

Speaker: 00:33:09

Or they just out-of-scope'd it.

Speaker: 00:33:11

Yeah, just outta scoped, I didn't try the, what is it, cloud, not

Speaker: 00:33:15

CloudFront, but whatever their, their deployment based, their, code as, or

Speaker: 00:33:20

infrastructure as code, specific thing.

Speaker: 00:33:23

I didn't try that.

Speaker: 00:33:23

maybe I should have.

Speaker: 00:33:24

but, I was.

Speaker: 00:33:25

Shocked.

Speaker: 00:33:26

I think I even mentioned that in the book, that I had originally

Speaker: 00:33:29

intended that chapter to be written in using CodeWhisperer.

Speaker: 00:33:34

let's say, for example, you want a quick Docker file, you can write it.

Speaker: 00:33:38

It's not too hard, but why do it if you can't get it for free?

Speaker: 00:33:41

So what you open a Docker file and you write in a comment what you want it to

Speaker: 00:33:47

do and you hit tab and the magic happens.

Speaker: 00:33:51

That's roughly what you need to do.

Speaker: 00:33:53

Yeah, roughly.

Speaker: 00:33:54

add a prompt as it were in a comment.

Speaker: 00:33:57

it doesn't have to be a comment.

Speaker: 00:33:58

you can just add the prompt and then later delete it.

Speaker: 00:34:01

have it generate the Docker file for you or the, the Kubernetes file.

Speaker: 00:34:05

I don't know if I tried using, patterns in a Terraform, just

Speaker: 00:34:10

to ease some of the repetition.

Speaker: 00:34:12

not necessarily have, the sprawling mess that Terraform can become.

Speaker: 00:34:17

but, I'm sure it could accommodate that as well.

Speaker: 00:34:20

it's both, Copilot and ChatGPT did seem to have extensive knowledge

Speaker: 00:34:26

of Terraform syntax and features.

Speaker: 00:34:30

So that all together adds up to a pretty competent, junior developer, like you

Speaker: 00:34:36

described it at the beginning that you need to supervise, but it can do a lot of

Speaker: 00:34:39

the legwork for you and much faster too.

Speaker: 00:34:44

did you follow Devin, the supposedly first AI-driven coworker,

Speaker: 00:34:51

no, that's interesting.

Speaker: 00:34:52

Tell me more.

Speaker: 00:34:53

it was a few weeks ago, they made this big announcement.

Speaker: 00:34:56

There was a video with a demo showing basically doing the

Speaker: 00:35:00

whole thing from scratch.

Speaker: 00:35:02

So not only did it do the Copilot stuff, but it bootstrapped the whole project,

Speaker: 00:35:07

generated all the files and had, like a browser window that had access to as well.

Speaker: 00:35:12

to actually go and, verify that it works.

Speaker: 00:35:14

Obviously, it was doing something that it always does in these demos, which

Speaker: 00:35:17

is, an HTTP server with a REST API.

Speaker: 00:35:21

Which is cheating if you ask me.

Speaker: 00:35:23

a lot of people were very impressed and there was a lot of, angst among people

Speaker: 00:35:28

on the internet arguing over whether this is the end of software engineering

Speaker: 00:35:33

as we know it, or whether it's a scam.

Speaker: 00:35:35

And then, a few days ago, there was a critique that resurfaced about Devin and

Speaker: 00:35:41

that entire project and whether he was, A little bit polished up in the demo,

Speaker: 00:35:46

In other words, just a typical software demo.

Speaker: 00:35:49

Yeah, that's just typical software demo.

Speaker: 00:35:52

So I think we have a similar problem to maybe different stakes, but to

Speaker: 00:35:56

self driving cars that it can't be like 95% good without supervision has

Speaker: 00:36:02

to be like, I don't know, 99% good.

Speaker: 00:36:05

before we can live it with our supervision and sure, a badly written API.

Speaker: 00:36:10

Probably most of the time it's not gonna get anybody killed, fingers crossed, but,

Speaker: 00:36:16

you still need that supervision, right?

Speaker: 00:36:18

and Devin was supposed to do away with that.

Speaker: 00:36:20

So I'm looking forward to seeing how that story develops and

Speaker: 00:36:24

how they answer the critique.

Speaker: 00:36:26

And I guess when people can actually go and play it, we'll

Speaker: 00:36:29

find out whether, was all fluff.

Speaker: 00:36:31

it's another

Speaker: 00:36:31

Yeah, no, that's interesting.

Speaker: 00:36:33

I have been following, there was a story not long ago, that, because of

Speaker: 00:36:38

the proliferation of, ChatGPT and, and Copilot and alike that the software

Speaker: 00:36:43

has been getting less and less secure.

Speaker: 00:36:45

and, Because, it is easy for, bugs to introduce themselves if you're

Speaker: 00:36:51

really just copying and pasting.

Speaker: 00:36:53

So that's why, we're not at that stage yet.

Speaker: 00:36:56

We may never be.

Speaker: 00:36:57

Where it's just, there's no human in the loop, right?

Speaker: 00:37:01

For these things that it can just generate code on its own

Speaker: 00:37:03

and, push it to production.

Speaker: 00:37:05

the role of a professional developer is here to stay for the foreseeable future.

Speaker: 00:37:11

We're just going to be better at what we do.

Speaker: 00:37:13

again, if we're mindful and not allowing these bugs to just creep in.

Speaker: 00:37:17

I like the optimistic point of view here, but yeah, a lot of people

Speaker: 00:37:22

I think would agree with you that this is like it was going to happen.

Speaker: 00:37:25

Although when you look at some of the software, you do wonder how much of

Speaker: 00:37:30

that was actually supervised and how much was just dumped automatically.

Speaker: 00:37:34

But that's for another story altogether.

Speaker: 00:37:36

Let's talk about the local LLMs, and how good they are by comparison, because

Speaker: 00:37:43

I've poked both, but I've never really done like a side by side comparison

Speaker: 00:37:48

to really tell how good they are.

Speaker: 00:37:51

You used Llama 2, I think, and OpenOrca, and done some side-by-side comparison.

Speaker: 00:37:58

So how good are they compared to what you get with Copilot?

Speaker: 00:38:02

Actually, this was probably my favorite chapter, to write

Speaker: 00:38:06

and to do the research on.

Speaker: 00:38:08

It was just super fun.

Speaker: 00:38:10

it was an old Lema 2 model.

Speaker: 00:38:12

it was generations old at this point.

Speaker: 00:38:15

so I've been meaning to revisit it.

Speaker: 00:38:17

it produced competent, text.

Speaker: 00:38:19

natural language processing, give me a description of this.

Speaker: 00:38:22

the code quality was not great.

Speaker: 00:38:25

but again, I'm sure this was, six months ago or plus.

Speaker: 00:38:31

So I'm sure that the model is 10 times better now.

Speaker: 00:38:35

So definitely worth revisiting.

Speaker: 00:38:36

yeah, I would say on balance, most of the models that I was running, that I was

Speaker: 00:38:40

running locally did not perform as well.

Speaker: 00:38:42

but they performed competently.

Speaker: 00:38:46

so if you were in a pinch and you didn't have access to the internet and you'd had

Speaker: 00:38:52

some foresight and downloaded these models prior, you could still get the job done.

Speaker: 00:38:57

but they wouldn't necessarily be my go to.

Speaker: 00:38:59

although I did, yeah, I did, just recently.

Speaker: 00:39:02

Redownload a new model and it does seem to be much better at this point.

Speaker: 00:39:05

I think it was Mistral.

Speaker: 00:39:08

that's one of the more interesting areas, in my mind, because that helps

Speaker: 00:39:13

get around some of the unknowns.

Speaker: 00:39:15

because it, I did, I turned off my wifi.

Speaker: 00:39:18

I pulled the network cable.

Speaker: 00:39:20

I made sure that I was entirely off the network, prior to using them

Speaker: 00:39:24

because I wanted to make sure no context was leaving my computer.

Speaker: 00:39:28

Privacy, of your code of, personal data is a primary concern.

Speaker: 00:39:33

it's probably the best option out there today.

Speaker: 00:39:36

I think this is a really good argument for that.

Speaker: 00:39:39

A lot of people will be in a situation where their employers are just not

Speaker: 00:39:43

comfortable with just going somewhere.

Speaker: 00:39:45

no matter what pinky swears, you got.

Speaker: 00:39:48

And I think that this opens, like the remainder of the

Speaker: 00:39:51

market that really matters.

Speaker: 00:39:53

And I think we're all waiting for Llama 3 to drop, any week now.

Speaker: 00:39:58

I'm just worried that it might be too big to run comfortably on your M3.

Speaker: 00:40:03

even with quantization, but, let's see, it might actually not increase

Speaker: 00:40:06

in size yet become more competent.

Speaker: 00:40:09

There are other models too, like Santa Coder, I think at some

Speaker: 00:40:12

point was very popular as well.

Speaker: 00:40:14

Did, you manage to get a workflow that's more like Copilot and

Speaker: 00:40:18

less like chatting to ChatGPT?

Speaker: 00:40:20

that would be a really good challenge.

Speaker: 00:40:23

To try to turn one of these into a more of a Copilot model.

Speaker: 00:40:28

it was ultimately one that I just, couldn't get done in time.

Speaker: 00:40:32

um, so no,

Speaker: 00:40:33

hopefully you knew what you were getting yourself into, but you

Speaker: 00:40:36

wrote an AI book, so you have to update it every three months now.

Speaker: 00:40:40

that is true.

Speaker: 00:40:41

one of my favorite titles, of late, was, another Manning book, but it

Speaker: 00:40:45

was the completely, out of date or the complete, yeah, obsolete.

Speaker: 00:40:50

Exactly.

Speaker: 00:40:51

A book on, generative AI.

Speaker: 00:40:53

I thought that was really clever.

Speaker: 00:40:54

lean into it,

Speaker: 00:40:56

yeah, that's a book by David Clinton.

Speaker: 00:40:58

we had him on the podcast, a couple of weeks ago and, I think it's

Speaker: 00:41:02

called the Complete Obsolete Guide to generative AI, it's hilarious.

Speaker: 00:41:06

I had so much fun actually just reading that.

Speaker: 00:41:10

the humor is, is nice level there.

Speaker: 00:41:12

definitely highest possible recommendation buy that book over mine.

Speaker: 00:41:17

probably worth mentioning when Devin thing was going on, there was

Speaker: 00:41:20

some kind of open source response.

Speaker: 00:41:22

I think it was SWE, OpenSWE, something like that.

Speaker: 00:41:26

it's probably easy google'able and I haven't gotten to actually

Speaker: 00:41:29

testing out, but I was supposed to.

Speaker: 00:41:32

I think what we really need to get to is to get one of those open models

Speaker: 00:41:35

to behave, 85% as well as Copilot.

Speaker: 00:41:39

And then it's basically game over.

Speaker: 00:41:40

If I can run it on your laptop, there's no subscription, there's no data leaving.

Speaker: 00:41:44

it's a no brainer at that stage.

Speaker: 00:41:46

what are we doing with the CPU cycles and the GPU cycles on the

Speaker: 00:41:49

MacBook when we're developing?

Speaker: 00:41:50

Anyway.

Speaker: 00:41:51

Yeah.

Speaker: 00:41:51

there

Speaker: 00:41:52

Yeah.

Speaker: 00:41:53

And it just makes, it makes sense like from a corporate,

Speaker: 00:41:55

decision making you could host it, not necessarily, centralized

Speaker: 00:41:59

host training off your own data.

Speaker: 00:42:01

like that's, yeah, that's really game over.

Speaker: 00:42:03

although, yeah,

Speaker: 00:42:08

Another alternative way of saying that.

Speaker: 00:42:10

It's just the beginning of the fun.

Speaker: 00:42:12

yeah.

Speaker: 00:42:12

Beginning of the arms race.

Speaker: 00:42:14

Yeah.

Speaker: 00:42:15

What's the next thing here?

Speaker: 00:42:18

What do you expect to come in the coming months and years, Obviously

Speaker: 00:42:22

wild predictions, all the usual disclaimers, but what's your take?

Speaker: 00:42:26

What's next in coding and AI?

Speaker: 00:42:30

I'm gonna give a boring answer.

Speaker: 00:42:31

I think it's just gonna be incremental improvement.

Speaker: 00:42:33

AGI if it's possible is a long No.

Speaker: 00:42:37

what is, what are they calling it?

Speaker: 00:42:38

Artificial General Intelligence, yeah, AGI.

Speaker: 00:42:41

Yeah.

Speaker: 00:42:41

AGI, it.

Speaker: 00:42:43

I think that's a ways off if at all, if it's even feasible, we'll see incremental

Speaker: 00:42:48

improvements, where the models, I don't want to say hallucinate less because

Speaker: 00:42:52

that's that's a feature, not a bug, but, where they get, more and more

Speaker: 00:42:56

refined, the output, It becomes, more, more timely, we're starting to see where

Speaker: 00:43:02

it can actually connect live to the internet so I just think there's going

Speaker: 00:43:06

to be incremental advances like that.

Speaker: 00:43:07

until there's a real breakthrough and that will, change the game, like in

Speaker: 00:43:11

the same way that, the transformer changed the way that we did the natural

Speaker: 00:43:15

language processing and text generation.

Speaker: 00:43:16

And, until there's something like that, it's just going to

Speaker: 00:43:19

be just incremental improvement.

Speaker: 00:43:22

So life goes on, NVIDIA makes more chips, they make faster chips, they become even

Speaker: 00:43:29

bigger and they leave the gamers behind even further and we make bigger models, we

Speaker: 00:43:34

train them better and we get a little bit closer to a superstar junior developer.

Speaker: 00:43:41

Is that roughly what we're talking about here?

Speaker: 00:43:43

that's what I would predict.

Speaker: 00:43:44

I'm, happy to be wrong Unless it's completely detrimental to all of

Speaker: 00:43:48

our fine men and women out there, giving their blood, sweat and tears

Speaker: 00:43:53

every day in developing software.

Speaker: 00:43:55

Not investment advice, by

Speaker: 00:43:57

Yes, exactly.

Speaker: 00:43:58

another disclaimer for our us-based, clientele.

Speaker: 00:44:01

And what's next for you, Nathan, do you have an eye for the next book?

Speaker: 00:44:05

I have a couple of ideas brewing.

Speaker: 00:44:08

I really am going to avoid doing just a second edition of this.

Speaker: 00:44:13

even though I've alluded to it several times, but I have a couple of ideas

Speaker: 00:44:17

that are really brewing and in terms of okay, so now we know how to use them.

Speaker: 00:44:22

we've had some practice like now let's apply it to very

Speaker: 00:44:26

specific, very niche problems.

Speaker: 00:44:27

and are there ways that we can extend it?

Speaker: 00:44:30

Are there ways that we can, train it on our own data, things like that.

Speaker: 00:44:34

that's where I would see the next logical, area for me to move into.

Speaker: 00:44:40

but I would still want something very practical.

Speaker: 00:44:43

so yeah, I'll probably just wind up doing a second edition.

Speaker: 00:44:46

the book once again is called "AI-Powered Developer".

Speaker: 00:44:50

It's published by Manning, which means that it has been available in

Speaker: 00:44:54

the early access for a while now.

Speaker: 00:44:56

So if you go to manning.com, you can get immediate access and start reading that.

Speaker: 00:45:02

It's currently in production, which means that it's going to take a little bit of

Speaker: 00:45:06

time before it actually hits places like Amazon in a physical copy and before

Speaker: 00:45:11

Nathan can have a party to celebrate that.

Speaker: 00:45:15

my guest was Nathan B.

Speaker: 00:45:17

Crocker, the co founder and CTO at Checkr.

Speaker: 00:45:21

Thank you very much, Nathan.

Speaker: 00:45:22

I'll see you

Speaker: 00:45:23

Thank you.

Speaker: 00:45:23

It was a pleasure.

Speaker: 00:45:25

Be well.

Share Episode

Shownotes

Transcripts

Follow

Links

Chapters

Video

More from YouTube