Artwork for podcast Data Driven
*Live* Tis the Season for SSIS
Episode 1524th December 2024 • Data Driven • Data Driven
00:00:00 01:34:39

Share Episode

Shownotes

In this livestream, Frank and Andy discuss the timeless nature of backend enterprise tech, that, much like a Christmas special from decades ago, is still very much celebrated.

Moments

00:00 Exploring SSIS future in a festive episode.

08:28 Data engineering evolved from business intelligence systems.

10:57 Social networks project before Facebook's popularity.

19:19 SSIS training informed data engineering concepts teaching.

24:56 Bill Gates moved project to immature Microsoft tooling.

29:10 Data engineering possible in 2024 using T-SQL.

35:23 Huge cloud companies surpass previous brick-and-mortar giants.

40:10 Old technologies endure; misconceptions about their age.

46:03 Evaluate change benefits: technical ease, business growth.

52:30 Cloud departure interests rise, SSIS assistance sought.

55:47 Big government agency utilizing diverse cloud platforms.

01:00:59 Security is crucial; clients' preferences vary.

01:08:56 Certification issues hinder software updates and compliance.

01:10:02 People stick with older systems for reasons.

01:15:15 Proper GPU driver drastically improved loading time.

01:22:16 Repost increased engagement and communication with author.

01:25:45 Data scientists should learn SQL for simplicity.

01:31:06 Obsolete systems cause issues without quotes.

Transcripts

Speaker:

In this special holiday themed episode, we're diving into a topic

Speaker:

that's as classic as Christmas Carols, but just as divisive as fruitcake.

Speaker:

And that topic is the future of SQL Server Integration

Speaker:

Services, SSIS. But wait, there's a

Speaker:

twist. This episode was recorded live, so if you notice

Speaker:

a different vibe, some festive banter, and maybe even a change in

Speaker:

our usual musical interludes, that's why. Think of it as

Speaker:

the holiday party version of our usual data driven discussions.

Speaker:

Together, we'll explore why SSIS, despite its vintage

Speaker:

status, remains a cornerstone of data engineering and why

Speaker:

dismissing it might just be a data driven mistake. So grab your

Speaker:

cocoa, settle in by the fire or your nearest CPU,

Speaker:

and let's get festive with some data talk.

Speaker:

Well, hello, and welcome to franksworld.comstream.

Speaker:

And, with me today is Andy, and I'm

Speaker:

looking for the lower third that has us both. There we

Speaker:

go. Frank and Andy Frank Lavinia and Andy Leonard, host of Data

Speaker:

Driven, which I might turn this into a podcast. I might take the

Speaker:

audio and and turn it into a podcast. What do you think about that? That'd

Speaker:

be good kind of festive stream and also kind

Speaker:

of up to date on things. And it gives me some more time that to

Speaker:

put together another episode that I had a really great

Speaker:

conversation with a guy who does red teaming for LLMs.

Speaker:

Nice. So which I think is a growth industry

Speaker:

and certainly a wise career move.

Speaker:

Speaking of career moves. Good thing.

Speaker:

Oh, we have a first comment. SQL dev d b a.

Speaker:

Hey, SQL dev. Awesome. So

Speaker:

this is this may be the first time we've done this. This feature's been around

Speaker:

for a while. No. We did it once or twice before. Did we do it

Speaker:

on recent? Like, months. Yeah. That we've done. So we're sharing

Speaker:

our so Frank's audience, people that are connected to Frank,

Speaker:

they're seeing this. People connected to me are seeing this. It's like it'll

Speaker:

because it told me Frank started this, and, then he sent me

Speaker:

the link. And as I joined in, it it said, hey. You can

Speaker:

share this with with your on your channels as well. So I

Speaker:

was like, oh, yeah. Click that. Oh, you know what it is? We did it

Speaker:

the other way. You were the main, and then I shared it on my channels.

Speaker:

That's what happened. That's what happened. Yeah. Yeah.

Speaker:

Well, it's cool, though. If you've never met me before hello?

Speaker:

That's Frank. Frank digs data on the socials and,

Speaker:

franksworld.com, datadriven.tv, which hopefully you know about that,

Speaker:

and impactquantum.com. So that's me.

Speaker:

And, yeah. So back to the segue.

Speaker:

Yeah. I was talking about how security and AI is a

Speaker:

good career move. And we were talking about, speaking of

Speaker:

career moves, 'tis the season for SSIS is the title of the

Speaker:

stream. And this kind of goes,

Speaker:

I'm sorry. Come on, man. It's fun. Right? It is.

Speaker:

It's awesome. So so and I had kind of done,

Speaker:

2 livestreams on this already, but one of them for, like, 10 minutes, I didn't

Speaker:

catch the fact that I had no audio. And then yesterday, I did one for

Speaker:

2 minutes, so I didn't catch the fact that I didn't do the audio. So

Speaker:

I figured I'd bring the troublemaker himself onto here. Although, strictly

Speaker:

speaking, you're not the original troublemaker on this. Well,

Speaker:

I participated in it. I'll I'll own my my part of the

Speaker:

trouble. You'll own your part of the trouble. So so I definitely will. Yeah.

Speaker:

What's the background here? Well and and I'll

Speaker:

I'll do a plug for, for andylehner.blog.

Speaker:

And if you go there, you can sign up for my newsletter over on the

Speaker:

right side. It's kinda hard to read because the widget is a little

Speaker:

narrower than it needs to be. But if you if you do that or if

Speaker:

you just look up engineer of data, I think it's

Speaker:

engineer of data dot substack.com.

Speaker:

But I I put a newsletter out today kinda talking about it. Yeah.

Speaker:

There's the site. Thanks, Frank. No problem. And, you see the subscribe

Speaker:

to my newsletter down there on the right, and there's a box on

Speaker:

the left where you type your email address and then on the right, you click

Speaker:

it's free. And it'll it should take you, right over to

Speaker:

Subsec, which by the way, I started using this year. And so far,

Speaker:

I'm pretty impressed. It's it's been a a

Speaker:

really interesting, experience for me. So the

Speaker:

trouble here here's, here's where the

Speaker:

trouble happened. I I have been, reading. I

Speaker:

caught a couple of articles just every now here and then, mostly

Speaker:

on LinkedIn, where people

Speaker:

would express an opinion about, you know,

Speaker:

SSIS stinks. I don't like it. It's old. It's was

Speaker:

so much trouble. And, you know, and they would just

Speaker:

kind of kind of poo poo share their their negative thoughts about

Speaker:

Azure sorry. About SSIS. And

Speaker:

I've, of course, I've worked in SSIS since

Speaker:

really before it came out, I got to work on that Rocks book project

Speaker:

with Brian Knight and I remember that book. Yeah. Yeah. 10

Speaker:

of us. Yes. Back when Rocks would put your picture on the cover of the

Speaker:

book. And have a copy around here somewhere. Yeah. That

Speaker:

yeah. Thank you, Frank. You know, it just but it's

Speaker:

so I got yes. I got kind of a boost out of my career,

Speaker:

and I did an awful lot in SSIS for a long time. And

Speaker:

every now and then, I still do. I used to

Speaker:

deliver training, as part of solid,

Speaker:

solid quality learning is what it was called when I joined it. Solid

Speaker:

queue. After that, I worked with them for a few years and I delivered

Speaker:

training developed by Eric Veerman and

Speaker:

also did consulting gigs. And I learned a lot,

Speaker:

about both data engineering and SSIS while I was

Speaker:

doing both those things. When I left solid q, I

Speaker:

think I put about a year or 2 between me and,

Speaker:

you know, and the business. Actually, it was about two and a half years because

Speaker:

I went to work for Unisys then as a ETL architect. I remember

Speaker:

that. You're up in Reston quite a bit because that's where it was. Oh, yeah.

Speaker:

Yeah. Frank. Now an apartment complex now, that building. Oh, is

Speaker:

it? Okay. I think so. Yeah. Okay. So Frank and I

Speaker:

have been friends since the before times, even before SSIS came

Speaker:

out. And, Well, no. I think you had just written the book at the

Speaker:

time. I I'm trying to remember. So Just moved to Richmond just when I

Speaker:

met. It was November of 2005. December 2005.

Speaker:

Yeah. Yeah. November 2005 is when we met. And,

Speaker:

another mutual friend that I won't name, but we're all still friends now.

Speaker:

And the book actually was published in

Speaker:

January, I think, of 2006.

Speaker:

Yes. That's right. So it wasn't it wasn't quite

Speaker:

ready for for prime time. But oh, sorry. The the

Speaker:

book wasn't out. It was going through the process, and it takes a couple of

Speaker:

months from the from the time all of the drafts are finished

Speaker:

until they they make a book out of it. It was my very first,

Speaker:

book project. And, yeah, I I'm pretty sure I was I was so

Speaker:

excited. I was telling everybody, I worked on a book. Oh, yeah. Yeah. Because it

Speaker:

was for the Richmond Code Camp, which was in May, April of

Speaker:

Yep. 2006. Yeah. 2000

Speaker:

Yeah. It was 2006. You and I. Where I did A team. A team.

Speaker:

Developers on a plane, and I had the guy I photoshopped the

Speaker:

guy carrying your book. That's right. I do remember that.

Speaker:

Yeah. I have to find that picture somewhere. I've been I've been using

Speaker:

SSIS for a a long time. I would say I

Speaker:

learned more about data engineering, the

Speaker:

field and did more projects probably

Speaker:

in, in data warehousing where I used SSIS

Speaker:

for for the data engineering, data integration. I think it's important to to

Speaker:

to, 1, explain for those who may not know what SSIS

Speaker:

is, and 2, explain that data engineering was not always seen as a

Speaker:

discrete,

Speaker:

profession or or Yeah. It's a data engineering's a

Speaker:

relatively new word to describe what we do. It was called

Speaker:

the part of business intelligence.

Speaker:

Back even before all that, I think the first term I

Speaker:

heard was data acquisition,

Speaker:

and it was in it was sometimes that was that phrase

Speaker:

was used standalone. The most often,

Speaker:

at the time when I and this is what got me into databases

Speaker:

was doing system control and data acquisition or SCADA

Speaker:

systems. These were manufacturing systems where you collected data from

Speaker:

instruments on the floor. You gotta remember, IoT

Speaker:

was, you know, still somebody's dream back, you know, in

Speaker:

the 19 nineties. IoT. It was just OT back then. It just was

Speaker:

OT. You're right. It's funny. Yeah. But

Speaker:

but we still did acquire, data from,

Speaker:

plant floors and instruments that were mounted all over, but they weren't

Speaker:

Internet enabled at that time. They were, most of them were hardwired. A

Speaker:

few were using wireless. And so that's kinda what led me

Speaker:

into this whole this whole field. And the idea of the

Speaker:

field, is of data engineering, data

Speaker:

integration as we called it back then, is we do that data acquisition part.

Speaker:

We go find wherever the data lives, we go find it there.

Speaker:

And sometimes the data is a very static

Speaker:

list. It it could be even a text

Speaker:

document, created in notepad that

Speaker:

is tab separated or, you know, delimited

Speaker:

by character position or something like that. And a lot of old

Speaker:

old lookups, lookup data was that way. And I'm not making

Speaker:

that up. It was maintained in a a text EDI.

Speaker:

EDI. Yeah. So electronic data interchange. Yeah.

Speaker:

So, yeah, EDI is I have an interesting stories about EDI, but but one of

Speaker:

the things that really kept me away from the data space for a long time

Speaker:

was I didn't wanna be DBA. And this work, I think, had traditionally

Speaker:

been kind of merged with DBAs. Oh, absolutely.

Speaker:

But at some point, I don't know exactly when it really

Speaker:

evolved into its own discipline. And I remember.

Speaker:

Go ahead. Because I remember I tried to get you a job at a particular

Speaker:

company. I remember that. And what do they do? And what was

Speaker:

it? Why do we need a DBA? You don't need a DBA.

Speaker:

Right. And I think that I'm not DBA. That

Speaker:

was the funny part. Well, that was the fun. Well, we clearly did because at

Speaker:

the time there was a project going on,

Speaker:

and I think the term data architect is what you just said. You were I'm

Speaker:

not a DBM data architect. And then that fell

Speaker:

on deaf ears. And, ironically, like,

Speaker:

not like a couple months later, there was a project that we worked on that,

Speaker:

so many stories, and I'm just trying to protect the innocent and the guilty,

Speaker:

and myself, from from from libels. But, basically,

Speaker:

there was a project going on that when it was basically kind of

Speaker:

behavioral analysis of social networks. Right? This is before

Speaker:

Facebook. I think Myspace was around that sort of thing. But it was basically the

Speaker:

idea of organizational networking as a discipline. And it turned out that the

Speaker:

the software that we bought off the shelf would actually query the

Speaker:

database, bring everything back in from the database,

Speaker:

and then run through the filtering on the C Sharp

Speaker:

components on the web server. Gotcha. So

Speaker:

long story short, there was 0 optimization, hardly an

Speaker:

index. I mean, it was just a mess. A data architect

Speaker:

will use the terms of the day, would have slot spotted this right away. We

Speaker:

didn't. And it was just a massive disaster. And it's kind of one of those

Speaker:

things where there were a number of projects that that company was taking

Speaker:

on. Basically, one of their one of their core

Speaker:

business models was was brilliant actually was software maintenance. So you have an

Speaker:

existing application offshore or outsource it outsource it to

Speaker:

us, and we'll take care of it for you. And,

Speaker:

you know, it was really like an an an education

Speaker:

in kind of Jenga programming. Right? Where you had they wanted updates

Speaker:

to this stuff, but they didn't wanna pay to redo it. So you kinda, like,

Speaker:

had to replace rip and replace stuff. And there's one particular

Speaker:

instance where there was a SQL query that took like 14

Speaker:

minutes to bring back an answer. And I'm like, it's only like

Speaker:

like 30,000 records. Like, what what's the deal here?

Speaker:

And turns out there was no indexes, no nothing.

Speaker:

Well, you know, those indexes take up space. Right? Exactly.

Speaker:

Exactly. I mean, this is like why you should save space.

Speaker:

Joke. That's a joke. For one reason or the other, like, there was there was

Speaker:

no index. And I was like, well, let's add indexes. And like, no, no, no.

Speaker:

We can't change the schema. Okay. So what I end up what I end

Speaker:

up doing was creating temporary tables with indexes

Speaker:

and then copying all the data, and I still got it down to 2 minutes.

Speaker:

Nice. Which 1 minute and 59 seconds was copying the data,

Speaker:

and then one second was actually changing. Yeah. So, like, it was it was kind

Speaker:

of like what I call Jenga programming or Jenga architecture. You had to like they

Speaker:

wanted updates, couldn't touch too much, couldn't change anything,

Speaker:

couldn't improve anything because it was just it was a

Speaker:

time in my career that I think back of and I've kind of learned

Speaker:

many lessons, both hard lessons and soft skill

Speaker:

lessons. But Sure. But we digress. But, I'm just

Speaker:

gonna I'm just gonna take that answering your question in my usual

Speaker:

long winded way. SQL Server Integration

Speaker:

Services, came along, and it was probably

Speaker:

the, again, it was the thing that

Speaker:

spanned the longest part of my career. Before that, I worked with something called

Speaker:

data mirror. That was the first, I'd I'd

Speaker:

say the the first system like that. First bit of software that

Speaker:

way. Before that, I was writing my own. So I

Speaker:

was reading from these plant networks and writing to all

Speaker:

sorts of stuff. And I got into SQL Server because

Speaker:

I crashed access back in the nineties. So I ran,

Speaker:

I collected a 1,000 points of data every second for a long

Speaker:

weekend. And I wanna say the access file grew to about 4 gigs.

Speaker:

When I went to open it and start doing some analysis on it, it turned

Speaker:

out it wouldn't open. So 4 gigs is nothing now. Right? You

Speaker:

can do that on a smartwatch. But back then, a server

Speaker:

struggled, to open the file system. If you go back far

Speaker:

enough, would have freaked out or anything over certain size unless it was, like, NTFS

Speaker:

or something like that. Right? Yeah. And this this

Speaker:

wasn't. This was, one of the other OSs. But

Speaker:

so, you know, I went I went to, altavista.digital.com

Speaker:

and typed in Microsoft database, and I saw this

Speaker:

listing for something called SQL Server, and that's how it all started.

Speaker:

Well, then I I got got in as, working

Speaker:

on a data warehouse, and part of my job moved

Speaker:

into the database part of it. I actually was hired to do the reporting piece

Speaker:

of it, and lots of cool lessons learned there as

Speaker:

well. But on the database side, they use Data Mirror. I

Speaker:

think that company is still around. I'm not sure. But this is like 25 years

Speaker:

ago. And it was it was so cool,

Speaker:

and I was fascinated that somebody had built software to

Speaker:

orchestrate this collection of data. I was like, wow.

Speaker:

That is a good idea. You know, it always makes me feel better, Frank, when

Speaker:

smart people come up with an idea that I've also come up with independently.

Speaker:

It makes me feel like, okay. Maybe I'm onto something. Go through all of

Speaker:

that, data transformation services or DTS, and then finally, SSIS

Speaker:

and this big block. And

Speaker:

what I've noticed and I kinda noticed this trend started

Speaker:

maybe 4 or 5 years ago. I people complained about SSIS before

Speaker:

that. Don't get me wrong. And a lot of it is

Speaker:

because are you sitting down? It's

Speaker:

hard. We're not making it up.

Speaker:

Comparatively, though, like, I I remember when I was at

Speaker:

barnesandnoble.com and which just goes

Speaker:

back a ways. So if you bought a magazine at Barnes and

Speaker:

Noble between 1996 and probably about

Speaker:

2012, 13, you

Speaker:

you interacted with the system I wrote, nice in the late

Speaker:

nineties or at least part of it anyway. So, you

Speaker:

know, that's how I learned EDI, right? Because we get these feeds

Speaker:

from publishers, literally a mainframe would dial up another

Speaker:

mainframe, download the file over a modem.

Speaker:

And, and this is how it worked. And what we did was we pulled down

Speaker:

the raw EDI files and I parsed it

Speaker:

and I had to do that and drop it into an informix database. So it

Speaker:

was a cool writing for GL scripts to to to take

Speaker:

that data in text format and then dump it

Speaker:

into an actual. You were doing data engineering. I was

Speaker:

doing data engineering, which is kind of funny. But like, you know, data engineering as

Speaker:

a discipline is not easy. Right? So SSIS being hard.

Speaker:

I mean, you know, brain surgery brain surgery is hard too. Right?

Speaker:

You you make a good point about it. And it, you know, it took me

Speaker:

a while, especially teaching it. And I would do

Speaker:

4 or 5 day course, originally with solid q and then

Speaker:

eventually on my own. I I wrote my own course. I

Speaker:

found myself adding to Eric's content when I would deliver the

Speaker:

material here. And don't get me wrong. Eric

Speaker:

is still a genius. He was then and he still

Speaker:

is. I just I I had a way of approaching

Speaker:

some, demos and examples that I felt kinda added

Speaker:

to the clarity of the information we were sharing. I

Speaker:

kind of expanded that out and wrote all my own material, my own I

Speaker:

use my own data, that I collect as part of my, weather station

Speaker:

here. And to this day, there are students that

Speaker:

are going through, recordings of that class.

Speaker:

The last recordings I made were back in December 2020,

Speaker:

and I recorded 3 courses on SSIS. The 4

Speaker:

day from 0 to SSIS course was, you know, will take you

Speaker:

from if you can spell SSIS to being a

Speaker:

functional, advanced beginner, low

Speaker:

end intermediate developer. And it was built for

Speaker:

that. It's got labs 13 12, 13 labs

Speaker:

that you do in 2 days, of that course. And then it talks

Speaker:

it kinda changes gears and goes to the care and feeding of SSIS

Speaker:

and ancillary topics. So

Speaker:

I learned a ton about the concepts

Speaker:

of data engineering on while as

Speaker:

while doing SSIS training and consulting and

Speaker:

development. So when I teach it, Frank,

Speaker:

I share these concepts that

Speaker:

I learned. Because you gotta keep in mind, this all came out around the same

Speaker:

time as a data warehouse toolkit book, by,

Speaker:

Kimball and his crew. And the in

Speaker:

fact, I don't know what the relationship was between Microsoft

Speaker:

and Kimbell, but I do know from the horse's mouth

Speaker:

that the, data flow task in SSIS

Speaker:

was modeled to load, Kimball data

Speaker:

warehouses. There's just a lot of functionality baked right in

Speaker:

that, you know, targets those star schemas, and, you know,

Speaker:

it's it's built to do that. There's so, you know, there

Speaker:

was that aspect of it. So at the same time, I'm reading

Speaker:

and learning and, you know, and then going out and teaching

Speaker:

and, you know, and and consulting. There's

Speaker:

this nice amalgam going on. I'm getting information from books.

Speaker:

I'm applying that information on consulting gigs. I'm

Speaker:

figuring out new ways to solve, you know, problems I hadn't

Speaker:

seen before, And then I'm training. So I'm just

Speaker:

rolling all that together. When I do the training, I'm sharing with people, hey. Here's

Speaker:

some first principles, if you will Right. Of data engineering.

Speaker:

And we call it data integration and BI back then.

Speaker:

And star schemas and why you use them and how they work and, you

Speaker:

know, kind of the trade offs that you get. Data explodes a little

Speaker:

bit. Talking about concepts like staging, data,

Speaker:

the benefits of it, why you like, how

Speaker:

you would wanna build your staging tables.

Speaker:

If you're reading from a flat file, everything in

Speaker:

that file is text. Now the text may be

Speaker:

numbers. It may be dates, but it's really just text.

Speaker:

So you built the stage tables with and bar charts.

Speaker:

So at, you know, stuff like that because you wanna get in and get out

Speaker:

just quickly. Memory than the way to do it would be in memory and then,

Speaker:

like, do validation as you do the insert and things like that. There's a there's

Speaker:

a 100 different ways to slice that. Yeah. There really are.

Speaker:

But, you know, when you did, that was that was just pieces and parts of

Speaker:

saying, okay. You know, Tim, I'm teaching you how to use this mechanism,

Speaker:

if you will. Right. SSIS. But I'm also sharing with

Speaker:

you how you would use it and then why you would

Speaker:

use it that way. And, you know, so there's more to

Speaker:

it than just the data engineering. And the point I wanted to make thanks,

Speaker:

Hector. Merry Christmas to you too, Hector. The data

Speaker:

engineering all by itself, just that world,

Speaker:

that's hard all by itself. Yeah. Absolutely. And then the tool

Speaker:

itself was extremely

Speaker:

flexible. And, you know, from the years that you and I have been sharing

Speaker:

about stuff, anytime you say it's it's flexible, you're

Speaker:

also saying, the the it's a sonic way of

Speaker:

saying it's complex. Right. And if

Speaker:

it wasn't flexible, people would say that it's too simple. And, like, it's just

Speaker:

one of those things where now that I'm in a job where I am in

Speaker:

a on the product group, what Microsoft would call PG, a product group or or

Speaker:

team. Yeah. We call it a BU. I I understand. Like, there's

Speaker:

only so many hours in a day that you have engineers and

Speaker:

there's time to market. You have to kind of make these trade offs.

Speaker:

And, you know. That's it. I mean, that's that that I mean,

Speaker:

I had this real eye opening moment with with I think suspect was the guy

Speaker:

who introduced us, who was an evangelist at Microsoft back in the day.

Speaker:

And, you know, I wanted some new shiny feature in Visual

Speaker:

Studio 2005. And, you know, I was complaining about it.

Speaker:

And he kind of pointed out like, look, even Microsoft has limited

Speaker:

resources in terms of people, time and testing and material and

Speaker:

things like that. And I was like, you know, I mean, my god, if Microsoft

Speaker:

has that problem, then I guess everyone has that problem. You know? It turns out

Speaker:

they're just a bunch of software developers just like the rest of us. Turns out

Speaker:

they're all humans. Although maybe now it's mostly AI. Who knows? But,

Speaker:

it's getting there. So so so, you know, I think we both kind

Speaker:

of set the stage for the controversy here. Right. SSI has

Speaker:

been around for at least 20 years,

Speaker:

maybe 25 and SQL Server itself. Let's let's remind folks the

Speaker:

history of SQL Server. It was originally who was it a

Speaker:

partnership with? Sybase? Yes. I believe it was a

Speaker:

Sybase product, completely. And I don't know if it was like And it was like

Speaker:

version 6. Got into the mix, and there was a collaboration

Speaker:

or something, and then they ended up with it,

Speaker:

owning it. That's my best guess on it. I actually

Speaker:

I I know I haven't spoken to her in a while, but I was I'm

Speaker:

friends with and and have co worked with, with

Speaker:

Caitlin Delaney. And she was

Speaker:

with Sybase. Oh, okay. Yep.

Speaker:

So, you know and did we have her as a guest on the show? I

Speaker:

know we wanted to. We we totally need to because that would be Yeah. Interesting

Speaker:

story because I first heard of SQL Server when I was at Barnes and Noble

Speaker:

because at the time we were ready to launch in 19 this is why I

Speaker:

left Barnes and Noble. We're ready to launch by Christmas of 96 with a

Speaker:

Yeah. Linux or Unix based based system based on Spark, Oracle,

Speaker:

and a few other things. No. I'm sorry. 4 g l. It was Ultimate

Speaker:

Formics. And, you know, we had the hardware. We had

Speaker:

everything set up. And then as the story goes, Bill

Speaker:

Gates and, one of the Riggio brothers who was the CEOs

Speaker:

of kind of co CEOs of Barnes and Noble at the time.

Speaker:

Bill Gates had kind of I don't know what he'd done, Jedi Mind Trick.

Speaker:

In August, September of, like, 96,

Speaker:

basically said, no, we're ripping everything we've built so far and we're moving

Speaker:

it over to Microsoft tooling, which at the time was not really mature. I

Speaker:

mean, it was this is like inter dev. I think we had a beta version

Speaker:

of visual inter dev. Yeah. Yeah. Which

Speaker:

was not the best product at the time. Right? It was, you know You know,

Speaker:

I used it At the time. At the time.

Speaker:

Yeah. I I used it, and if you came

Speaker:

from, like, cold fusion or some other development platform.

Speaker:

Yes. Was also awful. Yes. But yeah. So

Speaker:

So I started on inter dev. In fact, that was the first tool

Speaker:

that I I remember downloading for, Visual

Speaker:

Studio. I don't think I downloaded it. I think I went somewhere and bought a

Speaker:

CD or something. Yeah. Yeah. I think I found it in her dev 97 CD,

Speaker:

which was the the second or third version. But, I mean, I we we had

Speaker:

everything written in per on CGI Pearl scripts. Like, we had everything,

Speaker:

and it was just a very different era. But my

Speaker:

take was and this was my I was at the meeting with the CEO and

Speaker:

everyone else. Like, if we don't launch by this Christmas,

Speaker:

people are not going to use us as a habit. Amazon will

Speaker:

take the mindshare and this and that. And then then the

Speaker:

CEO said, sit down, s t f u. Basically, you don't know

Speaker:

how to sell books. You may know technology, but you don't know how

Speaker:

to sell books. Now we can look back at Jeff

Speaker:

Bezos' super yacht and his, you know, moon

Speaker:

missions and all that. These guys have super yachts and moon missions.

Speaker:

Right? They do not, actually. Oh. And my well, I

Speaker:

mean, I'm pretty sure they live in an oceanfront thing in Long Island. But,

Speaker:

he didn't know anything about selling books online either. So I can kinda I

Speaker:

can sit back here, you know, some, you know, good God almost 30 years

Speaker:

later and kind of be smug about it. Right? Right. But

Speaker:

it's just it's just funny. Right? Like, so so what's interesting is and I think

Speaker:

this really cuts to the bone of what this controversy is. And I

Speaker:

have the thing queued up. I can kind of show the screen where you posted

Speaker:

it, where

Speaker:

the fundamentals haven't really changed. Not at all.

Speaker:

Right. Yeah. Binary is still binary.

Speaker:

The debates about schema optimization and things like that are still

Speaker:

very much the same today as they were

Speaker:

20 years now. The numbers are bigger. The stakes are arguably bigger.

Speaker:

But for the most part, the fundamentals haven't changed. And and

Speaker:

I would say this is really where it kind of boiled down to. And this

Speaker:

is this is where the controversy starts. So buckle up, kids.

Speaker:

Let's see. I will share the screen. There's actually

Speaker:

2. I think you talked about one of them. The choices? I'm only

Speaker:

aware of 1. This is the this is the one post,

Speaker:

and I'll drop you the link, to to one of

Speaker:

the others. Right? Yeah. I'll put it in a chat.

Speaker:

I'll I'll send that to you here. Just a second. Along can can understand. So

Speaker:

this is what I saw. And it was basically Kendra

Speaker:

Little, who was a I would say legendary. Scary

Speaker:

smart. She's legendary in in in in the sequel

Speaker:

kind of family, right? Hashtag sequel family. Is that still a thing? I

Speaker:

think so. She's legendary. She used to work at Redgate. I think she worked at

Speaker:

Microsoft, too, at a time.

Speaker:

I think so. But I'm not positive. Well, we can look at LinkedIn. If only

Speaker:

we had that information. But anyway,

Speaker:

so you basically so if you read this and she says

Speaker:

so it says strong disagree. Don't run after every shiny

Speaker:

thing. Again, that is good advice. But, Lord, I would assume

Speaker:

that is her saying. But Lord, don't learn SQL Server and

Speaker:

SSIS if you want to be a data engineer. That's 2 decades too

Speaker:

out of date. Sincerely, a SQL Server expert. I think that's

Speaker:

a bit harsh. She's right about this part. Don't try to change chase out

Speaker:

there if you show anything. So apparently, I can't, and I

Speaker:

can't select a thing. So I read that,

Speaker:

and and I know there's more controversies that are in there as I as

Speaker:

I look at the thing. And you said I humbly submit

Speaker:

data engineering may be accomplished even in the year of our Lord, 2024,

Speaker:

using T SQL, this foul year of our

Speaker:

Lord, 2024. To borrow a phrase from Hunter Thompson,

Speaker:

T SQL, SIS, ADF, Fabric Data Factory

Speaker:

and other technologies supported by Microsoft, which I thought

Speaker:

clearly Microsoft's not going anywhere. Right? Yeah.

Speaker:

And so I basically said

Speaker:

fundamentals never grow out of style. Then I think I wrote again somewhere

Speaker:

like when I looked at the context of it because that's

Speaker:

not what you're supposed to do apparently in social media. You're supposed to react right

Speaker:

away. I did that, by the way, Frank.

Speaker:

I'm guilty. I did not go look at the context. So this is the

Speaker:

original context. I well and, you know,

Speaker:

you pointed that out and I'll I'll be honest, I I'm still

Speaker:

running on second hand information. I have not yet clicked it and gone back

Speaker:

to, to our guest post. Now I can see

Speaker:

it. Now you can see. So so this is what struck me is, well. This

Speaker:

is what struck me as odd. And I know we had talked about it and

Speaker:

I had talked about it. You talked about it. We talked to each other about

Speaker:

it. You know, we talked to our dogs about it. I don't know. Like, but

Speaker:

like, it was kind of like so so when I read the thing, it gets

Speaker:

even stranger. Right? So Yeah. He was talking to

Speaker:

someone, and I guess strictly speaking, even this is secondhand knowledge. Right?

Speaker:

But, so that's the data

Speaker:

scientist in there. Like, well, strictly speaking, this data is also all right. So

Speaker:

so look looking to someone to get a job as a data engineer. Okay?

Speaker:

Right. Unfortunately, he was learning about LLMs and other ML stuff.

Speaker:

I'm like, that's not data engineering. That's a

Speaker:

AI engineering or data science type work. That's more like I think he's he's trying

Speaker:

to set him straight from that. He's like, you're learning the wrong things. That's how

Speaker:

I read those two sentences. I mean, I would say you're learning the right things

Speaker:

if you wanna be an AI practitioner.

Speaker:

Yeah. But I wouldn't call I wouldn't, you know, read up on Langchain,

Speaker:

you know, Ollama and anything LLM and all that stuff

Speaker:

and then call myself a data engineer. I mean, that's Yeah.

Speaker:

That's like a cardiologist cutting up you know, doing your taxes. You

Speaker:

know what I mean? Like Sure. Or or cutting open your brain. Like,

Speaker:

I mean, I suppose there's some similarities, but it's not

Speaker:

the same. Well, I I do like bullet number 1.

Speaker:

Yeah. You know, let's see that. Yeah. This is something I think

Speaker:

that you point out quite a bit. So when you give your talks, either on,

Speaker:

SSIS or ADF, you ask

Speaker:

people, like, how many people here have workloads running in the

Speaker:

in the cloud or right? And then only a quarter of the hands

Speaker:

go up. Well, it's it grew to about

Speaker:

40% the last time I did it, but it's been over a year

Speaker:

since I since I spoke live and asked that question, ran

Speaker:

that little survey. There's a slide usually hidden in,

Speaker:

all of my presentations that has survey up near the very

Speaker:

top. Right. You know, it just and that's that's what the survey is about.

Speaker:

And often, especially

Speaker:

say the last, I said it's been over a year. So let's say from a

Speaker:

year ago and then back maybe 4 years of asking that

Speaker:

question. Almost every time I did that and people

Speaker:

didn't see everyone else's hand go up with theirs,

Speaker:

the those people would come up to me at the end. And usually, their

Speaker:

first comment was, I didn't know

Speaker:

that it was most of the people here were not doing

Speaker:

production jobs in the cloud at this point point with data. I thought we

Speaker:

were way behind and we're the only ones. And my response would be

Speaker:

2 fold. The first would be, that's because Microsoft

Speaker:

marketing is doing an astounding job. That is not a swipe

Speaker:

at Microsoft Marketing. If anything, they deserve a raise

Speaker:

because they were so effective at communicating

Speaker:

how cool this is Right. And how these larger

Speaker:

companies are doing it. You all of the big shows, keynotes,

Speaker:

There's some list of big companies, and they're almost all of them or

Speaker:

companies that you'd wanna work for because it's prestigious.

Speaker:

That's so I don't know if you want it on my personal market. Seem like

Speaker:

everybody's doing it. And I I know I know for a fact it's not always

Speaker:

true because when I worked in the sales for Microsoft, we

Speaker:

would encounter them and there was a pejorative term used internally called server

Speaker:

huggers. Okay. Right. Because like, oh, they're

Speaker:

server huggers. They'll never go to Azure. Right.

Speaker:

So so now, you know, I used to see that it's server hugger as a

Speaker:

pejorative. Now in light of kind of maturity and

Speaker:

working, with more customers and being

Speaker:

more aligned in the open source kind of realm and dealing with

Speaker:

international customers who have very real regulatory concerns.

Speaker:

You're right. Call them smart. Right. It's not, you know,

Speaker:

I didn't so much drink the Kool Aid is I became one with

Speaker:

the Kool Aid. You couldn't tell where I ended and where it began, where I

Speaker:

kind of had this deep programing experience

Speaker:

of. Yeah. That's not always the answer. Right. And I

Speaker:

think that dealing with LLMs and AI and things like

Speaker:

that, I think really makes that more obvious.

Speaker:

Right? Yeah. I totally agree with that. And, you know,

Speaker:

to be fair, and I wanna start with, you know, with being as

Speaker:

positive as I can about this. If I was It's not a negative on any

Speaker:

from scratch. Wasn't. No. I I'm just saying. But if I'm starting

Speaker:

today, day 1, and I wanna go, be a

Speaker:

starter company and and work with data, I it

Speaker:

would be foolish. Foolish to start

Speaker:

today and not go to the cloud. Absolutely.

Speaker:

So and and the reasons are numerous. Yeah. Here's the

Speaker:

thing. The companies there are a handful of

Speaker:

companies, really large companies, mind you,

Speaker:

that have started sent in the cloud age. Let's just call it

Speaker:

that, or the Internet age. There's a small number of them

Speaker:

that have gone on to be huge, but they are really huge.

Speaker:

They're overpowering, oversized. They're larger than the

Speaker:

companies that are previous to the Internet age companies

Speaker:

that have made their way into the Internet. And that's that's

Speaker:

not an accident. However, those

Speaker:

companies, the brick and mortar companies, are the companies calling consultants like

Speaker:

me and asking me to help them either

Speaker:

transition from a purely on premises

Speaker:

environment, managing their data into a cloud environment

Speaker:

or the and back before that, in 20 years ago, when I was first

Speaker:

getting called to do this kind of work, they were just trying to figure out

Speaker:

how to collect their data and then analyze it. And

Speaker:

so, you know, SSIS was a great way to do that. T

Speaker:

SQL was everywhere. Azure Data Factory didn't exist. Yes.

Speaker:

Much less Fabric Data Factory. And so we were just trying to solve

Speaker:

this business problem. And I was trying to couch couch my responses,

Speaker:

especially there was a thread that that got combative, I

Speaker:

would say. And, you know, as we went went down

Speaker:

through that, and I kept trying to say, and

Speaker:

I did. I said over and over again that, you know,

Speaker:

my job is to go help solve these business problems.

Speaker:

And what I meant by that opening line,

Speaker:

that T SQL, SSIS, Azure Data Factory, Fabric Data

Speaker:

Factory, even in 2024 of viable ways to accomplish

Speaker:

data engineering. I I meant that, and I'm not back backing

Speaker:

off that for one minute. I I misunderstood the context of the question,

Speaker:

and I didn't really understand until I listened to your stream

Speaker:

last night where you had gone back and done what I should have done and

Speaker:

read the original post. And you said, yeah. It's kind of a mixed mesh

Speaker:

post. The guy's talking about data engineer, but he's also talking about LLMs

Speaker:

and machine learning. And in the middle of that, he

Speaker:

throws out, you know, this comment about SSIS,

Speaker:

how 90 99.5% of the companies are still using. I

Speaker:

think that estimate is high. I I think it was more of

Speaker:

a, let's make this point that there's still a lot of companies out there

Speaker:

using, T SQL and SSIS to

Speaker:

accomplish this. And this is something that I can't find the comment that I put

Speaker:

in there. I'm looking for it now, but. Yeah. Some of the

Speaker:

comments I can't get to anymore. I don't know why. Maybe they were

Speaker:

reported or maybe they're. Who knows? Right. I mean, social media

Speaker:

does weird things to people psychology. But the

Speaker:

point that I think that I wanna say

Speaker:

that Kendra overlooks. I think everyone overlooks it.

Speaker:

Data and back end systems have a

Speaker:

longer shelf life. And I say

Speaker:

this as someone who was, what, 10, 15 years ago,

Speaker:

strongly ensconced in client development. Right? Whether it was your

Speaker:

Windows, Windows Phone, or other types of Windows

Speaker:

based devices. Right. Or web development. Right.

Speaker:

Those technologies turn over pretty quickly.

Speaker:

Right. You know, you're likely to get

Speaker:

multiple updates per year on a device phone, like an app on

Speaker:

a device, but you're likely to never see,

Speaker:

a radical change or redesign. You'll you'll see a

Speaker:

radical change or web redesign of a website or portions of a website

Speaker:

couple times a year maybe. Right? But you're never gonna see

Speaker:

a radical redesign of a data back end system,

Speaker:

but once or twice a decade. And It's true.

Speaker:

Yeah. And mostly what drives that is scale,

Speaker:

not features. Right. Not features. It's just date or just tend yeah.

Speaker:

Exactly. Right? So if you a 100 x and who could who could accounted for

Speaker:

that, you know, going to the project started. It's a problem. Still a

Speaker:

wonderful problem to have, but but a problem nonetheless. Well, and there's

Speaker:

also the fact that, you know, it's 2,000 whatever now, and

Speaker:

there's still mainframes running. Right? There are still not not not to to

Speaker:

knock on IBM too hard because they are the company of Red Hat. But,

Speaker:

d b 2 is still around, still getting updates. Still backbone of

Speaker:

many Fortune 100 companies that also share the stage with Satya

Speaker:

at these big Microsoft events too. Right? Like Which was mind blowing

Speaker:

for people from the old days of Microsoft. Right? Well,

Speaker:

that's a whole other thing. But, like, you know but, I

Speaker:

mean, it it really boils down to, like, these technologies have a longer shelf

Speaker:

life. So if something is 20 I think we get

Speaker:

hung up. 1 of the threads sub threads in here gets hung up on, you

Speaker:

know, 30, 20 year old technology. We're thinking that, well, you know,

Speaker:

there's a meme of the the little monkey puppet, like, you know,

Speaker:

giving a side eye and then goes like a cringe face, like, and a side

Speaker:

eye where it's like, oh, Windows is, you know, I don't know, 40 year old

Speaker:

technology. And I'm thinking, like, some, you know, Unix people or Linux slash

Speaker:

Linux people are, like, 40 years old is old. You

Speaker:

know? I mean, this stuff goes back much further. So it's but

Speaker:

it's still like and that's not a knock. It's just

Speaker:

No. It's just now that we're in this

Speaker:

industry now for as long as we've been in it and the

Speaker:

industry's been around longer this long, there's just

Speaker:

stuff that is gonna just start aging out, but it doesn't age out as

Speaker:

quickly as we think it does. It's not like it's not like the iPhone. Sure.

Speaker:

Right? Where you the iPhone I don't know what number they up

Speaker:

to. 16, 17. Right? Oh, well, suddenly my iPhone 15 looks bad, and that

Speaker:

happens every year or 2. You this you don't see that in

Speaker:

database systems. Right? The only impetus to really move, say, from, like, SQL Server

Speaker:

2,005 to 2019 is updates stop going. Right?

Speaker:

And that's a whole big project. Yeah. The maintenance cycle. So it goes out

Speaker:

of maintenance. And then you worry that if something crazy happens,

Speaker:

you can't get support for it. And that's

Speaker:

kinda like, you know, it's it it's sort of it I'll say this.

Speaker:

It's analogous to your phone starting to run slow for some unknown

Speaker:

reason. That's funny. Something something on SQL.

Speaker:

Yeah. Something something. Sybase something. Well, and you think about all the

Speaker:

I mean, I mean, and contrary to this, contrary to that statement of these things

Speaker:

have long life shelf lives. Yeah. Is the fact that I mentioned

Speaker:

Informix earlier. Raise your hand if you heard of Informix. Right?

Speaker:

So I've heard of Informix. You've heard of Informix? I mean, we don't count.

Speaker:

But but, like no. But, like, I remember my first

Speaker:

experience with Informix was because some alum of Fordham had

Speaker:

because it was a big shot at Informix. And, and I think we had somebody

Speaker:

who was also a big shot at Silicon Graphics. So we had SGI machines

Speaker:

running at Formix. Right. So I remember my first UNIX I used was an

Speaker:

IRIX system. Right. Which most people today

Speaker:

wouldn't even know what what that means. Right. And, you know, but Informix is

Speaker:

out of business. Sybase is gone.

Speaker:

I can't even think of other names. I know there's more.

Speaker:

Right. But really, the only things that it those have probably been

Speaker:

migrated to SQL Server or Oracle. Well,

Speaker:

or some form of Postgres or something like that. And I

Speaker:

I hear you. You know, there's there's an argument to be made

Speaker:

for, you know, the the cost of maintaining

Speaker:

old software. Right. There there definitely is.

Speaker:

I'll say this about SSIS. I if you learned

Speaker:

SSIS in probably in 2,005 era, between

Speaker:

2,005 and 2,008, that engine

Speaker:

I I don't know how many lines of code were

Speaker:

changed before it was upgraded to 2008

Speaker:

or r two, but it changed. There were some performance tweaks in there. It was

Speaker:

obviously, faster. And then again, that happened in

Speaker:

the 2012, error when we saw

Speaker:

I love that SQL tab.

Speaker:

You know, it's dead. Long live Crystal. So was Crystal ever database or was it

Speaker:

just Crystal Reports? Reports is all I knew. I didn't know about it as a

Speaker:

database. That's all I I I use Crystal Reports and

Speaker:

my favorite thing was it filling up the drive because

Speaker:

it kept caching things. But I

Speaker:

remember the whole idea of just because you place it somewhere, it doesn't

Speaker:

mean it's actually gonna end up there. Like, the whole thing is

Speaker:

but sorry to cut off. That's okay. But SSIS in general,

Speaker:

if you learned it even in, you know, 2006, came out

Speaker:

in November, I think, of 2,005. Even if

Speaker:

you learned it then, it at at a fundamental level, it

Speaker:

hasn't changed that much. And whereas you'll see other software

Speaker:

you Visual Studio is, you know, a software development platform

Speaker:

that allows you to do c sharp and v v and, you know, all of

Speaker:

the stuff. And it allows it still supports for us. I know. I haven't tried

Speaker:

v v in 9 years now. So

Speaker:

it's been a while. But if you look at

Speaker:

how much most software changes from a developer

Speaker:

perspective, and SSIS is software development. So as your

Speaker:

data factory, and any data engineering, that software development,

Speaker:

SSIS is probably in the 95%

Speaker:

of what it was. If if you knew the fundamentals

Speaker:

in 2006, you know those fundamentals in

Speaker:

2024. And Right. Part of the decision

Speaker:

to go make the upgrade, we talked about, you know, maintenance wonders and stuff, and

Speaker:

I I get it. And it's not the same as your phone slowing down. I

Speaker:

said that, but that's a bad analogy. But Well, it is also

Speaker:

it's also, I think, also very relevant to Windows 10. Right? If you're

Speaker:

on Windows 10, your updates are gonna stop in October. That's

Speaker:

true. I don't wanna get on that soapbox and rant. Sure. No. But I

Speaker:

I mean, there's I get reasons for that as well. I don't

Speaker:

like that it's gonna change because I like Windows 10. But,

Speaker:

but yeah. Well, there's there's I'm gonna join you in not

Speaker:

going down that road. But I'll say this. Hey,

Speaker:

Maddie. How are you? The, the

Speaker:

just the fundamentals of data engineering haven't changed. And the

Speaker:

tool itself, you know, if you knew it back then.

Speaker:

And it's, you know, you know it now. And so if you

Speaker:

learned it now, you could go back then and still work

Speaker:

in the previous versions of it with, very little headache. And

Speaker:

that speaks a lot speaks volumes to the,

Speaker:

the team that designed and built that. And

Speaker:

so in addition to the technical reasons for doing this, the business

Speaker:

reasons, kind of revolve around one of my favorite

Speaker:

phrases. I mentioned this in today's newsletters. It's a compelling

Speaker:

reason. Do you have a compelling reason to make this change?

Speaker:

And business people think about this all day every day because

Speaker:

the amount of money that they make, the profit is based directly

Speaker:

on the amount of money that they, you know, they spend and are they getting

Speaker:

this value for it. So if they can improve the performance of

Speaker:

something, say 10 times, and a result of

Speaker:

that is they get, 5 times as many customers,

Speaker:

then that's not a bad investment. That'll just work. But

Speaker:

if you're coming to me and I'm a business that existed

Speaker:

before the Internet, if you're coming to me and saying,

Speaker:

I want you to change to this completely different model,

Speaker:

where, you know, and and the way it's presented

Speaker:

often is you can save money. And that's

Speaker:

true because if I start a new business today, I'm I

Speaker:

couldn't even compete. I'm not gonna be able to stand up the

Speaker:

servers, you know, take that time and buy that hardware and float

Speaker:

that that inventory that I need to manage all that. Whereas,

Speaker:

I can I can pay rent essentially every month

Speaker:

on that service? Right? Right. I I always like to say I

Speaker:

always like to say if I wanted to start a bookstore today,

Speaker:

right, versus 1996,

Speaker:

right, or 1995, depending on when you wanna say when they started.

Speaker:

I mean, Barnes and Noble spent a ton of money. I don't have the exact

Speaker:

number, but I can there's probably tens of 1,000,000 of dollars, probably closer to a

Speaker:

$100,000,000 to just before they had their first customer.

Speaker:

Right? Wow. And that but that was the heyday of the

Speaker:

dotcom. Right? Because they were you know? But then

Speaker:

but if you wanted to start a bookstore today, whether or not it's a good

Speaker:

idea, let's let's just suspend our disbelief for a

Speaker:

second. You can probably do it on a on an average credit

Speaker:

card limit. Because

Speaker:

because your IT is enabled. Right. And and you pay like I

Speaker:

said, you pay fractions of what you would have to do in the brick and

Speaker:

mortar. And most of the initial spend isn't gonna be your servers or hardware.

Speaker:

It's gonna be in development and marketing. Right? Getting the word out

Speaker:

because it's such a noisy market. It did the the market has radically changed.

Speaker:

And I also think imagine go ahead. I'm sorry.

Speaker:

What? Imagine? I was gonna say imagine that you've built

Speaker:

Right. This infrastructure on premises already. You've got all of this done.

Speaker:

It's a sunk cost. We can debate about how to feel about sunk cost.

Speaker:

Right. But it's there. You spent the money and it's there. And

Speaker:

you're not gonna get that 5 x income boost when you move to

Speaker:

the cloud. In fact, in some cases, not

Speaker:

all by by any stretch, but in enough cases, you

Speaker:

move to the cloud and it costs you money. Because when

Speaker:

you're getting the presentation about starting using the metrics of this

Speaker:

new company being started today Right. You're, you know, you're told the

Speaker:

truth. You're not being lied to at all in in any of this.

Speaker:

But often, systems that were designed software

Speaker:

and front end back end systems that were designed, you know, from

Speaker:

the nineties through the mid early 2000.

Speaker:

Those systems were architected in a whole different mindset

Speaker:

of what's the prevalent mindset for today. And as a result of

Speaker:

that Yeah. Yeah. As a result of that, one of the things missing from

Speaker:

the spreadsheet calculation that you're gonna get the ROI

Speaker:

from moving off your on premises servers to the cloud is

Speaker:

that couple of $1,000,000 and about 18

Speaker:

months, of the hit that you're gonna have to

Speaker:

spend rearchitecting Yep. All of your systems so that they

Speaker:

now fit today's paradigm. And frankly,

Speaker:

if you are interested in doing that, you you could go do that

Speaker:

at any you could have done that at any time in the last 10 years

Speaker:

and made that shift. But people didn't do

Speaker:

it because the business people didn't do it because the ROI was not dead. There

Speaker:

was not enough return on that investment. If they wanted to, they

Speaker:

would have spent that money then, but it wasn't gonna improve the bottom line.

Speaker:

In fact, it was gonna hurt the bottom line. And so you see

Speaker:

companies now make this move into the cloud and

Speaker:

then, yeah. Yes. That is a

Speaker:

that that's an astute question to ask. So for those who may be

Speaker:

listening and not viewing this, it says is SQL SQL dev

Speaker:

d b a says, I use Brent's and I'm assuming Brent Ozarks. Brent

Speaker:

Ozars. Problem are you trying to solve by changing this for justifying

Speaker:

upgrades? Brilliant. That's that is brilliant. And he's

Speaker:

right. And the you know, but it's compelling to hear and

Speaker:

read the case studies of of companies that, you

Speaker:

know, were able to do use to to access

Speaker:

$10,000,000 worth of hardware, like you said, on a credit card. And think,

Speaker:

wow, what would that do for us? And the answer is sometimes, yeah, it'll

Speaker:

revolutionize your business. You'll 10 x coming out of this. But other

Speaker:

times, it's like, no. You'll point 8 x.

Speaker:

You know, this isn't as compelling. So it's interesting because, like, I think

Speaker:

there's a number of and I found the article. I'll pull it up. But but

Speaker:

one of the examples, it was either Dropbox or Box. I forget which company it

Speaker:

was. But but they had basically started off, I think, in

Speaker:

AWS. Mhmm. And then they got to a certain size. They actually

Speaker:

figured out it's cheaper for them to design their own servers that are optimized for

Speaker:

mass storage Mhmm. Than doing it. So they started building their own hardware and their

Speaker:

own stuff. But I could tell you, if they were a startup and they went

Speaker:

to a VC saying, we want to start with this on prem, they would have

Speaker:

been laughed out of the building. Yep. Today. Yes. Today they would

Speaker:

have. They mean, you know, and it's

Speaker:

it just shows that the the shifting economics of cloud versus on

Speaker:

prem and and other types of things that I don't think people really have figured

Speaker:

out yet. So this was a really interesting I'm gonna share this tab if

Speaker:

I can show it on the screen. Sources. But that that

Speaker:

use case is you can't, you know, having the compelling

Speaker:

reason to migrate to the cloud, and you can do that upfront.

Speaker:

It's harder. But exactly what you're showing there, you're

Speaker:

sharing that that idea of leaving the cloud, that's

Speaker:

growing. And it it's growing across the board. And I

Speaker:

one of the, metrics for that that's

Speaker:

directly related to what we're talking about here today with

Speaker:

with data engineering, is that there

Speaker:

there's been an increase in 2024 in the

Speaker:

number of, people that reach out to me to talk

Speaker:

about, SSIS help with their systems.

Speaker:

And, I mean, I do consulting in, you know, ADF and fabric, and

Speaker:

most of my consulting has been in ADF. When SSIS was involved,

Speaker:

it was in lifting and shifting SSIS into an Azure SSIS,

Speaker:

integration runtime. But all of a sudden, after

Speaker:

2, 3, 4 years of that, that shifted this

Speaker:

year. And people started reaching out to me with SSIS on

Speaker:

premises consulting things, and I kept up with it. So I was

Speaker:

able to do it. And but there's other

Speaker:

evidence that I will not share. I probably I may be able to, but I'm

Speaker:

just not going to. But it's even better evidence than my

Speaker:

anecdotes about people more people reaching out to me. Right. That

Speaker:

the amount of SSIS being executed in

Speaker:

the world has increased, and it's a double digit percentage

Speaker:

increase just in the past few months.

Speaker:

And I I think I now this is where I start speculating, and I

Speaker:

don't know the answer to that. But we have a our mutual

Speaker:

friend that we, another mutual friend you and I connected with

Speaker:

in November of 2025. Sorry. 2005. Like in the

Speaker:

future. Recently recently worked for a

Speaker:

year and a half, 2 years for this large agency that's

Speaker:

not part of the government, but does money supply stuff.

Speaker:

Oh, okay. I know. I know. Yeah. After getting his MBA from Sloan, you

Speaker:

know, which no. Sloan. No. Sloan is

Speaker:

important. The, you know MIT. School with MIT.

Speaker:

Right. He's a graduate with that. Super smart.

Speaker:

He shares with me when I'm telling him this story, I give him that stat,

Speaker:

and he says, here's what's going on. Economically,

Speaker:

money is more expensive today than it was. And

Speaker:

so he said he said that as he's telling me this as a

Speaker:

cautionary tale because he says it's gonna change. It's good. Money's gonna get

Speaker:

cheap again, and people are gonna flock back to the cloud. That's his

Speaker:

theory that it's all being driven by money, and I don't think he's wrong,

Speaker:

especially I think that's one level. I think that's one lever. I think there's

Speaker:

more than one lever. That is certainly a big one. But I you

Speaker:

know, as someone who I you know, my previous role at Red

Speaker:

Hat and my current role at Red Hat, I have to think globally. Right? And

Speaker:

we don't again, not a commercial for Red Hat even though the

Speaker:

fedora is there. You know, one of the things we do

Speaker:

is we basically provide a data platform end to end that

Speaker:

can run-in any cloud on

Speaker:

prem or, you know, one of the hyperscale. Or hybrid. Yeah. Yeah. Or

Speaker:

hybrid. Right? Where and there was one customer that I spoke with

Speaker:

before I won opportunity to leave. They were, big government agency.

Speaker:

And this big government agency, you know, they have

Speaker:

their own data centers, even though there was a push to

Speaker:

get rid of them all. But they also have because of way contract

Speaker:

government contracts work in the US, they had, foot

Speaker:

you know, money to spend in AWS, money to spend on Azure, and I think

Speaker:

even money to spend on Google Cloud. So the one

Speaker:

advantage that we had that the other ones couldn't is that the he called them

Speaker:

the soft costs of training people how to do he'd do the same

Speaker:

thing to do linear regression in SageMaker and

Speaker:

push them out of production in SageMaker and Azure and in Google.

Speaker:

Right? Yeah. And this was one tool. You learn it

Speaker:

once. You administer it once. The same glass.

Speaker:

It was the same thing. I think those environments are very real. Now those are

Speaker:

probably limited to large customers or kind of the government

Speaker:

agencies that have these kind of contracts and things like that.

Speaker:

Yeah. But also Mhmm. You have a number of

Speaker:

countries that it's just not a good look to move

Speaker:

your data out of country. Right now, in the

Speaker:

US and Canada, we don't have this issue because there's plenty of all the hyperscales

Speaker:

have footprints in Canada and the US. But if you're in

Speaker:

Latin America, which is this where this example comes from, right, there's only

Speaker:

at least as Red Hat defines Latin America, includes Mexico, and basically all the

Speaker:

way down to Antarctica. Mhmm. And only just,

Speaker:

like, 30 some odd countries. Right? Someone's gonna write me hate mail saying that this

Speaker:

is the exact number, but let's just keep the math simple. It's 30. We

Speaker:

love those mail. We do. We love we love the mail. We learn things every

Speaker:

time. Right. I talk about them personally when I get corrected at

Speaker:

at the dinner table because I wanna share that with No. I mean, it's it's

Speaker:

good. I'm not saying don't do it. I'm just trying to keep the math simple

Speaker:

because it's Friday before, you know, basically, we're sure holidays. Sorry, Frank.

Speaker:

I I derailed you. No. That's fine. Only 3 countries

Speaker:

from Mexico down to Antarctica have hyperscaler presences.

Speaker:

Now, the 4th one in the but out of 30.

Speaker:

Right. So it's actually 10% or less realistically. I think it's like

Speaker:

37 countries. I asked Wikipedia and stuff like that.

Speaker:

So less than 10%. Right? Right. If you're in a

Speaker:

country that doesn't have a footprint. If you're in that 90%.

Speaker:

You have to ship it out as a country. You have to be okay with

Speaker:

that or do roll your own solution on a thing. So there was a

Speaker:

government we we won a big contract because they

Speaker:

wanted to do advanced AI and they wanted

Speaker:

to keep it in country. Right? Doesn't necessarily have to be on prem. Could just

Speaker:

be, like, you know, an Equinox data center down the street or something like that.

Speaker:

Right. But within their thing. And it was a government agency, so it wasn't

Speaker:

computer science or even data science. It was political science that really kinda was the

Speaker:

driver there. Right? Because if I'm in country x and I have to move

Speaker:

my and I'm a government agency in country x, I have to move my data

Speaker:

to a sovereign country y. Not a good look.

Speaker:

Yeah. Right? And,

Speaker:

you know, would it really matter? I don't think so. Like, in a but

Speaker:

from a legal point of view, it kinda does. Like, where the data resides in

Speaker:

the residency. And I think if you go to the Azure website

Speaker:

now, they'll actually tell you where the data resides. And they actually interestingly

Speaker:

enough, they get down to granular, at least on the US side to the

Speaker:

state. Right? So, like, it'll say, you know, Virginia and

Speaker:

stuff like that. We have a comment from, or not. I'm gonna hide my

Speaker:

screen so I can look up the Azure map and kind of

Speaker:

demonstrate that. And then I have to figure out where the sources are. There we

Speaker:

go. So you wanna read the comment while I do that while I'm distracted? Sure.

Speaker:

So so many comments, but there are clients who are

Speaker:

considering the technology used in the software package, and they

Speaker:

may escape when they see the old school,

Speaker:

stuffs. I'm so maybe.

Speaker:

And and that may be, you know, I hate to be that guy that says,

Speaker:

oh, that use case is invalid. I don't think so. I'm not aware of it.

Speaker:

That doesn't mean it's invalid. I'm not aware of a lot of things.

Speaker:

But but maybe. And, you know, definitely

Speaker:

have mixed emotions, about that. If you're buying

Speaker:

because the the if you're a client and you're

Speaker:

buying from, some company and you

Speaker:

decide to go to a different company because of the

Speaker:

technology stack that's being used behind there, I

Speaker:

don't know. I think that says more about you as a client than it does

Speaker:

about the company. If they're delivering the service and it's, you know,

Speaker:

the the the rules of data engineering are you get accurate

Speaker:

data as fast as possible, and those priorities

Speaker:

are in that order. Yeah. I don't think the old school stuff second.

Speaker:

The only risk of the old school stuff is it's still maintained or there's still

Speaker:

security packages. That would I mean, if I were in that

Speaker:

position, I would be like, oh, I mean I mean, if you're still using, say,

Speaker:

Sybase 6. Right? You know, like Yeah.

Speaker:

You know, got a little there's definitely a line there, and it's drawn

Speaker:

based on it's drawn different places first. And some of the

Speaker:

reasons, that it is drawn in different places is

Speaker:

security is huge these days. I mean, that's gotta be your number one

Speaker:

concern. And and, you know, it it

Speaker:

goes from there. But if you're delivering the service securely,

Speaker:

I'll just pick that one, then I would say

Speaker:

that, you know, that if if you

Speaker:

lose a client because you're not using the new shiny,

Speaker:

I don't know what you can do about that. I'm trying to think I'm not

Speaker:

gonna say the client's wrong for feeling that way. They probably have valid reasons

Speaker:

for feeling that way. But if, you know, if they wanna if they

Speaker:

wanna make that decision based on that. And I'm looking at Frank's graphic here

Speaker:

of the is that the data centers? This is the one I was telling you

Speaker:

about. Right? So this is this is just Azure, but I would say it's a

Speaker:

pretty good proxy for the other hyperscalers. Right? Mhmm. I would

Speaker:

say Azure at one point had more. I I haven't kept up.

Speaker:

Well, that was one of our talking points when I worked at Microsoft. Right? We

Speaker:

have more than to be honest. Right? But I would say if it's not exactly

Speaker:

more, it's close enough. Right? So there's Mexico. You see the United

Speaker:

States is pretty well covered. So is Canada. Right? So if you were a Canadian

Speaker:

company, you had to keep it in Canada. You had an option. Right?

Speaker:

Yeah. If you're an American company, you have pretty good choices.

Speaker:

If you're Mexico, yep. But if you're in any of these countries in Latin America,

Speaker:

down through South Wales. We only had okay. So now there's Chile.

Speaker:

Okay. Gotcha. Right? So I'm sorry. Now there's

Speaker:

4.

Speaker:

So my math is going to get more complicated right away. Right. So, there's

Speaker:

Brazil. Actually, no, there's still 3.

Speaker:

So there is no footprint for Azure in Argentina.

Speaker:

These little blue things you see, that is Colombia. Those are networking

Speaker:

pops. So basically, from a

Speaker:

networking point of view, if computer science were the only thing that would matter,

Speaker:

then that be that would be acceptable. But data residency is

Speaker:

the issue. So if I go here, the US East 1.

Speaker:

Right? It'll tell you that its

Speaker:

location is Virginia and it's stored at rest in the United States.

Speaker:

Like like here.

Speaker:

Is there any more details? There isn't. US.

Speaker:

Yeah. They used to. Yeah. They don't talk much about that,

Speaker:

about where they are. But one of the

Speaker:

US East Georgia. 1 of the Yep. The US

Speaker:

East 2 is down in Danville, isn't it?

Speaker:

They are or Mecklenburg. I forget which. Well, essentially, they chose a

Speaker:

picture of Richmond. Right.

Speaker:

I mean, these are, you know, with

Speaker:

we kinda touched briefly on on politics in a geopolitical

Speaker:

slash, sovereignty strategic way.

Speaker:

Right. These are huge. And I I know I'm not the one, thinking

Speaker:

about it. But Look at this. Yeah. There's one in Israel. Those satellites, Frank,

Speaker:

are getting it, by the way. Those those

Speaker:

satellites on the graphic, those things are moving at way faster than

Speaker:

normal satellites. Oh, yeah. Yeah. I mean, satellites are going to

Speaker:

change things, but, like, in terms of where the data sits at rest is really

Speaker:

where because ultimately, I think it really boils down to when are the

Speaker:

local police going to barge down a door with a with a court hopefully

Speaker:

with a court order and basically copy everything. Right. That's really what

Speaker:

matters. Right. That turns out that's really what ended up mattering. But

Speaker:

if you look at the Middle East, right, like 1, 2,

Speaker:

3 countries have it.

Speaker:

Right. And that entire region, you

Speaker:

know, obviously with geopolitical tensions being what they have been for a number of

Speaker:

years. Yeah. Moving your data center to any one of these countries may be an

Speaker:

issue for you for your organization or your

Speaker:

regulatory. Right? Europe is kind of the same thing, right, where, you

Speaker:

know, there's Switzerland, there's Italy. And I

Speaker:

know that there's different kind of things in terms of Germany.

Speaker:

It was actually I don't know if it still is now, but it

Speaker:

might have been a, I used to live in

Speaker:

Frankfurt, actually. Yeah. There was actually what they call a

Speaker:

sovereign cloud because there was concern that if it was a US company owning

Speaker:

a data center, that US courts would have jurisdiction there, which is a

Speaker:

brilliant move by a a past administration. I say

Speaker:

brilliant sarcastically in case you're didn't get pick up on that.

Speaker:

Where they thought that they could basically issue a a court order to

Speaker:

demand something from here in Ireland. And

Speaker:

Microsoft fought that because they realized, like, wait a minute. That would mess up our

Speaker:

entire that would cause a lot of problems. Yeah.

Speaker:

And, ultimately, they dropped the case before it was finally

Speaker:

decided. But in order that they could, thing, at one point,

Speaker:

anyway, this is actually owned by a German

Speaker:

company, managed by a German company, and it's leased to Microsoft to to

Speaker:

to have that concern. I think China also operates the same way.

Speaker:

Frank, I was commenting on the, satellites on the graphic

Speaker:

there. Oh, that they were moving around. Yeah. Yeah. They are moving very, very

Speaker:

fast. And it keeps they're moving with us.

Speaker:

But, I mean, keep in mind, though, like, keep in mind, though, that

Speaker:

we're just talking about data residency. There's other things that if you're building a real

Speaker:

solutions, other things to consider. Yeah. Right? Like And there's

Speaker:

a whole lot to that. And Yeah. Yeah. Oh, absolutely. And,

Speaker:

you know, part of the part of it is,

Speaker:

part of what what happens when you start kinda going back to

Speaker:

the data engineering, platforms and stuff that you use.

Speaker:

There are sound business reasons for not making a change,

Speaker:

and there are some unsound business reasons that will

Speaker:

confine you to not making a change. And I I think about this. I'll put

Speaker:

it in context of, of SQL Server.

Speaker:

Companies will come up and they this has happened, and I still have clients

Speaker:

running applications on old servers

Speaker:

because the company that so they

Speaker:

they serve, their clients include

Speaker:

enterprises that care an awful lot about

Speaker:

checking boxes and auditability and all of that stuff. Regulatory

Speaker:

type things, which is not bad. It's just the

Speaker:

way that it is. That's their their business demands something

Speaker:

like this. These companies were formed. They were stood up, and they've got SQL

Speaker:

Server 2,005 running or 2,008 or stuff that's been

Speaker:

out of the maintenance cycle at Microsoft for a long,

Speaker:

long time. And unknown it's also not a well known

Speaker:

fact that if you don't wanna upgrade to version x or

Speaker:

y, you can pay extra money, and Microsoft will maintain

Speaker:

and provide you patches. Right? There there's rumors that

Speaker:

there's at least as of a few years ago, there were still Windows 95 systems

Speaker:

that were, you know and that sounds absurd.

Speaker:

Yeah. But Well, you walked down the, entry to,

Speaker:

Delta flight, and there was one that's what was it? One is 97, I

Speaker:

think, sitting there. 98. Yeah. 98, was it? Yeah. One is 98.

Speaker:

Sorry. Yeah. One is 98, boxes sitting there for the longest time. They're still

Speaker:

I believe they're 1 to 7 now. Still. I I

Speaker:

saw XP. XP. You're right. It is XP. Yeah.

Speaker:

So, you know, you just I kinda noticed this, like, wow. I hadn't seen

Speaker:

that in a while. But it's it's not about

Speaker:

will the new technology run that

Speaker:

SQL Server 2,005 database. The answer is clearly yes.

Speaker:

Well, you can always virtualize something. Right? Like, that's something that, like I

Speaker:

mean, there's that compatibility levels help. Right. There's a number of things that

Speaker:

do it. But here's the kicker. If the application is

Speaker:

not certified to run on that

Speaker:

and you're for you can change it, and you be maybe you have changed

Speaker:

it and tested it and go, yeah. It works. We'll just move it to, you

Speaker:

know, SQL Server 2019 or 2022. We

Speaker:

know it works. But the people you're serving,

Speaker:

people who care way more about checking all the boxes and the regulations

Speaker:

being a 100% and auditable, they won't

Speaker:

allow you to. And it gets even more complex when that company that

Speaker:

originally sold you that software 20 years ago is no longer in

Speaker:

business. Right. So you have no path forward.

Speaker:

I mean, the only Or they get bought by another company that you

Speaker:

don't really like. Exactly. That's happened too.

Speaker:

Exactly. Then A lot of mainframe companies were brought up by I don't wanna

Speaker:

name names, but, like, were brought up, and they it really was, like,

Speaker:

ironically, because they what they do, they they knew they had them. And

Speaker:

ironically, a lot of mainframe migrations happened because of

Speaker:

that. Like, it was And so you've got, you know, you've got that

Speaker:

angle where people are sticking with older systems for whatever reason.

Speaker:

And it's, you know, it goes like I'm saying, my point is that this goes

Speaker:

beyond just the data engineering realm. There are

Speaker:

there are compelling reasons to use,

Speaker:

older software. It may not be anybody's, you

Speaker:

know, satisfactory answer, but it is, you know, those

Speaker:

reasons exist. And if you're the, you know, if you're a

Speaker:

developer who likes using the new shiny and learning the new

Speaker:

stuff, I'm one of those. That's why I'm teaching courses on fabric data

Speaker:

factory right now and watching as it kind of some

Speaker:

days it works and some days it doesn't. We've had that happen on a

Speaker:

number of deliveries this year, with that.

Speaker:

So if you read what I wrote about this and

Speaker:

you come away with Andy's against the new stuff, well, you're just

Speaker:

as wrong as wrong can be. That's not the case at all.

Speaker:

You know, it's an

Speaker:

interesting I mean, so back to the lecture at hand, what kind of kicked this

Speaker:

all off and inspired the stream was

Speaker:

this post where I think the short

Speaker:

answer is everybody's a little right. Everybody's a little

Speaker:

wrong. And as a consultant, you can appreciate these two

Speaker:

words. It depends.

Speaker:

Right? Because, like, you may want to upgrade to the new shiny. I know every

Speaker:

developer wants to do that. And I think the comment for some reason I can't

Speaker:

say is like basically hiring managers will put in a job description. All

Speaker:

there's also the other matter of job descriptions and, you know, job requirements

Speaker:

are. They're always a 100% accurate. Disconnecting

Speaker:

from consensus reality. Yes. I like to say.

Speaker:

But they may want someone with, like, say, ADF

Speaker:

and and and this, but then actually have them working on systems and SSIS

Speaker:

because the hiring manager knows that he or she may not have that open req

Speaker:

for a while and has a in the back of

Speaker:

the mind the idea of moving to that someday.

Speaker:

Sure. But realistically, for the next 2 years, you're gonna work in this site.

Speaker:

Yeah. I mean And there are still large

Speaker:

consulting companies out there that develop brand new

Speaker:

applications in SSIS, brand brand new data

Speaker:

engineering data warehouse. One worse than that or one better depending on your

Speaker:

point of view. A few years ago, I think it was on dotnetrocks. They

Speaker:

were talking about telemetry from Visual Studio. And this back

Speaker:

when I cared about Windows client development.

Speaker:

Basically, WPAF came out in 2006,

Speaker:

2007. Right? XAML.

Speaker:

No. Not XAML. Metro or modern

Speaker:

applications, UWP came out in 2011,

Speaker:

2012. Right? So there's been multiple

Speaker:

frameworks to write when and it's been a number since. But, again, don't really care

Speaker:

about those client development anymore.

Speaker:

Windows Forms is still the number one of

Speaker:

all those, like, ways to develop Windows applications that run on Windows

Speaker:

Mac. Windows Form is still accounts for 80% of development.

Speaker:

Wow. Something someone's going to like, please email me in

Speaker:

hate with hate mail, not hate mail, but like tell me the exact number. But

Speaker:

it was still. He's off. Well, and they kept saying in Visual

Speaker:

Studio 2005 came out. They said, this is it. This is the end of the

Speaker:

line for Windows Forms. We're not updating. We're not adding anything.

Speaker:

And the future is from now on. Right?

Speaker:

Until the future became something else. And then when

Speaker:

I last installed, I think it was Visual

Speaker:

Studio 2016, 2019.

Speaker:

There was improved. They added stuff to Windows Forms, which is kind of funny because

Speaker:

they said they never would. Yeah. But it just because they.

Speaker:

Demand. Right? Customer demand. Ultimately, customer demand

Speaker:

is what pays the bills. So you've got to be very mindful of that.

Speaker:

And, you know, if you if it's 2024 and you're writing

Speaker:

a Windows Forms app, I have questions.

Speaker:

You know, I'm not saying I disagree, but I have many questions.

Speaker:

Arguably, you could say the same thing for UWP or

Speaker:

WPF. Right? You know,

Speaker:

But, again, it really depends. Like, in the last time I had worked with

Speaker:

WPF professionally was it was for when I

Speaker:

was at a, between my stints

Speaker:

at Microsoft, and there was a, you know, there was

Speaker:

a customer who was a mortgage company, and they basically had their mortgage

Speaker:

intake form written in WPF. Right?

Speaker:

And they were having performance issues with it. And I I looked at the code,

Speaker:

and I was like, this is a good lesson, I think, is

Speaker:

that, you know, they loaded up, like, some 6, 700 controls

Speaker:

all at once. Right? Wow. Because there were a lot of

Speaker:

fields and but they were all collapsed and things like that. And I was like,

Speaker:

well, I'm looking at this, and I'm, like, testing it. And I'm like, oh,

Speaker:

dear god. This is gonna be a nightmare because it's 600 controls. You could do

Speaker:

lazy loading and things like that, but then Sure. There could be unintended

Speaker:

consequences there. And then then I happen to notice when I load the

Speaker:

app, the CPU spikes, but the GPU was

Speaker:

hardly touched, which the whole promise of WPF was

Speaker:

that it would offload as much of the rendering

Speaker:

Yeah. Over to the GPU as possible because it was basically built on

Speaker:

XNA, which was a gaming framework. But that, again,

Speaker:

different different sidetrack, and different lifetime ago.

Speaker:

So I'm like, what the heck is going on? So then I'm like, I looked

Speaker:

at the some of the machines. I'm like, they had the generic

Speaker:

GPU driver. So I'm like, just

Speaker:

for grins, let me see if I

Speaker:

can get the proper driver for this device.

Speaker:

All of a sudden the 30, 40 seconds it took to load that initial

Speaker:

screen went down to 5 seconds.

Speaker:

Wow. And that's a big jump. That was a

Speaker:

big jump. And that was like, and I said to the guy,

Speaker:

it's like, look, just installing this driver, you get

Speaker:

a, you know, massive increase in it. Right? Do you

Speaker:

really wanna architect it or are you trying to just you want this to be

Speaker:

faster? Like, what's considered acceptable? And he said, well, under 10 seconds would be

Speaker:

acceptable. And I was like, I could do you better. How about 5 and a

Speaker:

half? Right. And I showed him

Speaker:

and he goes, well, not everybody has a GPU in their device. And I was

Speaker:

like, well, like, you know, this GPU costs

Speaker:

about, I think, 189. For some reason, 189 is stuck

Speaker:

under $200. Yeah. And I'm like,

Speaker:

so you'd have to install it. You have to think about the labor of installing,

Speaker:

like, this cheap GPU. Right? And this, he goes, that's

Speaker:

fine. He goes, look. The cost of redeveloping and

Speaker:

rearchitecting and retesting this versus $200 per

Speaker:

box, plus whatever it takes for somebody to go in with a screwdriver

Speaker:

and update the drivers. Right. It was so like we

Speaker:

ended up not having to touch the code at all. Right? Yeah. It was just

Speaker:

a matter of a driver update, which nice. Because it was like a fixed

Speaker:

price kind of support contract. I was the hero because I

Speaker:

solved the problem with about 3 out 3 to

Speaker:

5 hours of work. Nice. And the customer was happy because

Speaker:

they didn't have to re architect anything. Everybody wins.

Speaker:

Everybody wins. I love it. Happens. But but it it's just it

Speaker:

just goes to show you, like, sometimes the most cost

Speaker:

effective approach isn't is to not

Speaker:

touch anything. You know, and it's although it was a

Speaker:

relatively small amount of money, and and,

Speaker:

manageable amount of time for the client,

Speaker:

sometimes in if you look at the, you know, kind of the the

Speaker:

big, performance tuning

Speaker:

picture. And I I ran into some of this, in the past where

Speaker:

Well, it's not always a happy ending. As as an engineer,

Speaker:

I want to, you know, to fix the the thing that I'm engineering. And if

Speaker:

it's software, then I wanna make the software perform better. And

Speaker:

if it's, you know so I went through one of those experiences and

Speaker:

then, number of circumstances where

Speaker:

but the end result was we we

Speaker:

threw money at it and

Speaker:

bought, better disks.

Speaker:

And I remember telling me about this. I remember you telling me this. Oh, it's

Speaker:

a specific long story, but, yeah, it's back from about 12 years ago. Yeah.

Speaker:

You were, like, just a long SSDs, and it got fast

Speaker:

enough. And so when I did the math on that was the

Speaker:

enterprise level project, there were dozens of people

Speaker:

tuning on a tiger team, and we did make it

Speaker:

go faster. And as a test, what I did was I

Speaker:

rolled the code back to where it was when we

Speaker:

started. I will say now though, tiger the term tiger

Speaker:

team after my Nobody knows what I mean. Sorry. I know what you mean. I

Speaker:

know what you mean. Of individuals, dozens of individuals focused on solving

Speaker:

a particular problem. It's it's like, Although in some cases

Speaker:

slurring. Focused on not solving a particular problem, but that's a story

Speaker:

for another day. Different different story. You're right. But, yeah, we

Speaker:

we did and and I rolled everything back and ran the old

Speaker:

code that we had optimized. You know, it ran super

Speaker:

fast on the SSDs, and it was running okay. It was acceptable,

Speaker:

barely, on, the spindles. But when we rolled it back, it

Speaker:

was, it was the same difference. We actually got a touch more

Speaker:

performance just off the SSDs. And when it you know, you do the math on

Speaker:

that. At that time, SSDs were rather new, and the amount

Speaker:

that I wanted was, not trivial.

Speaker:

I asked for it on a lark, and I was surprised when it showed

Speaker:

up. So she did work for Microsoft. Okay. I'm not

Speaker:

surprised. Kendra is, scary as much. She did consulting. We

Speaker:

Brent's name came up a a few times. She worked with Brent for a while.

Speaker:

She worked at Redgate. Not everyone knows what Redgate is, but they're kind of a

Speaker:

big deal in this in this situation. Yeah. Yeah. They're a big deal. So, like,

Speaker:

clearly, like, I I just find I'd love to get her, like, initial

Speaker:

opinion after factoring in all kind of all of this is that,

Speaker:

you know, ironically, Sync don't run after every shiny thing.

Speaker:

But the the thing that that guy that was originally learning was is

Speaker:

the new shiny thing, ironically. Like, so there's a lot of layers to this.

Speaker:

There's even if you take this kind of at face value of not knowing the

Speaker:

context, there's a lot of layers. But, like, beyond that, there's even more layers. Like,

Speaker:

it becomes this multidimensional problem. It's true. It's like an

Speaker:

ogre. Many layers. Many I was wondering when we're

Speaker:

gonna have a movie reference because it's been a while. There we go. Boom. Shrek's

Speaker:

a classic. Shrek is a classic. The

Speaker:

sequel is not so much, but that's a common case.

Speaker:

Sure. Very, very few Matrix movies looking at here. It was

Speaker:

you know, I I saw the, activity on this and it's best because

Speaker:

since I posted this, I can see how many people looked at that. The the

Speaker:

link I sent you for the other one is kinda the one that started it.

Speaker:

I don't know if you have that link in chat. I think If you wanted

Speaker:

to click on that. I have That was that was a

Speaker:

few days, maybe a week before this. That was the first one I

Speaker:

commented. It was similar sentiment and I shared, you know, again, I

Speaker:

joined the conversation and then, I can't believe is

Speaker:

SSIS dead? Yeah. That's the one. So that

Speaker:

particular one has gotten, like, probably 12

Speaker:

to 15 times as many views. It's happened to You know, you mentioned And

Speaker:

it was the first time I had piped up about anything like this. I'm I

Speaker:

commented on the original one. LinkedIn will pop this up if you write a

Speaker:

lengthy comment like like that one. And

Speaker:

I said, that window exist. Looks like Windows XP

Speaker:

era. Well, that's SSIS. That is

Speaker:

a funny. Like, it's just funny. Yeah. But, And I

Speaker:

commented when this LinkedIn popped up and said, hey. Do you wanna repost this

Speaker:

since you wrote this comment? And I said, sure. And it just pastes the

Speaker:

comment up at the top of the repost. But I can see, like, the number

Speaker:

of people that and that one drew a lot more,

Speaker:

a lot more comments. And like I said, 12 to 15 times

Speaker:

the views. And, and and I communicated

Speaker:

with the original author just to touch about that at all. It it

Speaker:

went nowhere near as I'm

Speaker:

trying to think of the right word. No nowhere. It it

Speaker:

didn't get nearly as heated. I'll say it that way. And it

Speaker:

may just be that, you know, that I'm saying heated, but there was

Speaker:

there was one individual in particular who just very passionate about the tools that they

Speaker:

that they used for. I I I wanted to ask,

Speaker:

that individual, you know, how

Speaker:

many SSIS packages they developed and put into production.

Speaker:

Because I I I I would again, I think I know the answer

Speaker:

to that. This in the this individual conversations like that.

Speaker:

Say that again? Which individual? Not in that one.

Speaker:

Oh. Not it's in the other range. The one we've been looking at. This angle

Speaker:

here, though. Right? Nothing against Kiwi

Speaker:

ETL pools. What do you Yeah. Appreciate so this is this

Speaker:

is an argument of GUI versus Yeah. Straight

Speaker:

code. I think that's also an interesting concept. Low code, no

Speaker:

code versus you know? I like them both, but,

Speaker:

you know I do too. It's for me, it's which solution

Speaker:

matches best and as a consultant who goes to work with other companies a

Speaker:

lot Right. A big factor for me is What they have. You know, what

Speaker:

happens when I'm gone? Can you support it? What what are you most

Speaker:

familiar with? Even if I'm working in a tech a technology

Speaker:

technician, a technology like SSIS, there's really a

Speaker:

couple of ways you can, float it. It. That's good, John.

Speaker:

2 was good. I'm referring to a 3, and I think there was a 4th

Speaker:

one. Oh, really? I didn't even know there was a 4th one. So there you

Speaker:

go. But in SSIS, you can take a more And

Speaker:

the matrix 2 was good as well. Matrix 2 was good as well. I was

Speaker:

That's true. Yeah. You you could take, like, a more

Speaker:

DBA approach, a more T SQL driven approach, or you

Speaker:

can take a more dot net driven approach because you have both

Speaker:

execute SQL tasks and SQL sources and destinations and transformations,

Speaker:

and you have a script task and a script component.

Speaker:

And, you know, which way do you go? Which is the right way to go?

Speaker:

Well, it depends. If you've got a bunch

Speaker:

of dot net developers being tasked with maintaining

Speaker:

the, ETL after I leave, no way. 5.

Speaker:

Good god. Shrek 5 or matrix 5.

Speaker:

Either one sounds like a terrible idea. You know,

Speaker:

whoever's gonna be left behind support net, you gotta make sure that Right. You've done

Speaker:

a a fair enough job of representing their preferences,

Speaker:

on that. And, you know, I I know

Speaker:

I know consultants who come in and say, you know, we gotta do it

Speaker:

this way and you're bad and wrong and you're, you know, you're gonna go out

Speaker:

of business in 18 months if you don't listen to me. Right. That

Speaker:

if I also think too, like, when I first read it I'm sorry. I don't

Speaker:

say hire somebody else. Right. I mean, I when I first read

Speaker:

it, I first read it as should you learn SQL?

Speaker:

Okay. Right? And that's a big debate in the data science community is should should

Speaker:

data scientists learn SQL because, you know, Python can do everything. Just your no. No

Speaker:

knock on my Python can do everything. Should you use Python for everyone everything, I

Speaker:

think, is another question. Right? And I think the answer is no. Yeah. I think

Speaker:

if you're a data scientist if you're a

Speaker:

data scientist or even an AI kinda engineer, whatever the word is

Speaker:

this week, you should still learn SQL

Speaker:

because this one, it's way less complicated than anything else you're doing. And

Speaker:

2, it's kind of the the the the lingua

Speaker:

franca of anything data or data interaction. Right? And

Speaker:

again, Frank, the context comes into play here. Right. There's the there's the

Speaker:

mechanical tool. It's describing a software, you

Speaker:

know, a language is mechanical is one one way of looking at it. But Right.

Speaker:

There's also the problem you're trying to solve. And if if you're gonna do

Speaker:

data exploration or managing or whatever you wanna call it,

Speaker:

then I don't really care which mechanism you use. But don't

Speaker:

tell me one of these mechanisms is better than the other. It

Speaker:

Without a qualifier. For this particular task.

Speaker:

Yeah. So it could be that, you know, one

Speaker:

of them is is good at this one particular thing, and I I would argue

Speaker:

this. I did in my newsletter. Oh, strike 5. Okay. In my

Speaker:

newsletter, I argued, every single tool has something

Speaker:

that they're stronger and there's there's some

Speaker:

feature or some aspect of it that's better than all of

Speaker:

the others. And they also have some weakness

Speaker:

that's worse than all of the others. It's true. And and so

Speaker:

you gotta, you know, you gotta strike that balance. It's gonna depend

Speaker:

on your use case, the parameters, the things that are important to you.

Speaker:

Sometimes it's cash on hand.

Speaker:

Sometimes it's the servers, hardware that you're forced to work

Speaker:

with because you're owned by somebody and they they're not upgrading. They're not

Speaker:

giving you that staffing concerns. The people, their

Speaker:

experience, their languages, that their Well, how easy is it to find someone that

Speaker:

necessarily knows SQL versus knows Python?

Speaker:

Or, the bill rate for someone that knows SQL is probably gonna be different than

Speaker:

the person who knows Pandas. I think so.

Speaker:

I I think there are differences there, but, again, it's gonna depend on the company.

Speaker:

That's true. You're worth our job market. Right. You're worth

Speaker:

more to, you know, this company as a SQL developer than, you

Speaker:

know, maybe than a Pandas developer is to that that company. That's a

Speaker:

possibility. Go go where you get paid the

Speaker:

most, but realize especially if you're new, and I think that

Speaker:

person that in Kendra's original, post that

Speaker:

she was, she was jamming on, that

Speaker:

that was a different scenario. You're right. There's a number of squirrelly things about that

Speaker:

original post, but if you're

Speaker:

a young developer and just getting started, you're in college. I've got a

Speaker:

daughter studying computer science in college right now. And my

Speaker:

recommendation to her is first, learn everything that you

Speaker:

can. Do as much as absolutely as you can.

Speaker:

Pick up that knowledge. But,

Speaker:

be aware that there's more to it than just the, you

Speaker:

know, the the the brain exercise you get and the

Speaker:

thrill you get from seeing the code execute. Keep that thrill. Keep that spark

Speaker:

alive. Right. Right. Right. Right. But, you know, real

Speaker:

realize there's often more to it. And some of those factors you have no control

Speaker:

over, and the person you're working for has no control over.

Speaker:

And, Ed, you know, there's just a number of things totally external to the experience

Speaker:

of writing code that often

Speaker:

impact the experience of writing code. That's very true.

Speaker:

That's very true. K. Actually, going on for,

Speaker:

like, 90 minutes. So Wow.

Speaker:

Goodness. It doesn't seem like it. I've had this much coffee, and I still don't

Speaker:

have to go to the bathroom. That's amazing. Christmas miracle.

Speaker:

Of you. It's a festive miracle or a Christmas miracle. I don't

Speaker:

know. Awesome. But, so

Speaker:

this is this has been great. I think we kinda got to the bottom

Speaker:

of this is that basically, it's a nuanced conversation.

Speaker:

There's no simple answer. I have many

Speaker:

questions though. Why if you're learning LLMs and now, why are you

Speaker:

calling yourself a data engineer? But Yeah. That's a different thing.

Speaker:

Could just be semantics at that point. Could be.

Speaker:

But thanks, John. Thanks, Merdad.

Speaker:

Thanks, Hector and SQL Dev

Speaker:

DBA. I'm sure that's not the

Speaker:

name on on the driver's license, but

Speaker:

you never know. You never know. I wanna get a license plate holder

Speaker:

that has, like a, like, drop table.

Speaker:

Like, so that way when they

Speaker:

take a picture, they do the OCR. Boom. I've seen

Speaker:

the bumper stickers with longer Right. Right. Right. Right. Right. Secret

Speaker:

hacks on them, you know, secret injection attacks. No. Hey, man. Thanks

Speaker:

for tuning in. And, he legally changed his name. That's

Speaker:

cool. Apparently, there's a number of people

Speaker:

who have, like, a last name, and their

Speaker:

last name is Null. Oh, wow. And I

Speaker:

was looking up on YouTube. Like, there's, like, a number of, like, problems that people

Speaker:

have. And my first thought was, wait. Wouldn't that only be the case if

Speaker:

they encased it in quotes or single quotes? Like, wouldn't it?

Speaker:

Apparently, no. Some of these systems Depends. I mean, you

Speaker:

wanna talk old systems. I'm sure DMVs have some pretty

Speaker:

ancient technologies that are still running.

Speaker:

I mean, I can only imagine. Robert

Speaker:

Tables. Yes. Little Bobby Tables. Little Bobby Tables. That's the

Speaker:

one. I tried to name my kids something like that. But

Speaker:

You got overruled if I remember correctly. I did in so many ways. I'm

Speaker:

glad I didn't go with the, initials of, x

Speaker:

a m l. Yeah. That

Speaker:

that You could have. Xavier Anthony

Speaker:

Marcus was, on the table. See and

Speaker:

Lavinia? That would just Yep. That would just flow. That would be

Speaker:

your role. Though, since XAML kinda died,

Speaker:

it's probably a good thing. That's true.

Speaker:

So XML is even not as in vogue as it once

Speaker:

was. Well, you know our mutual friend, mister Kevin

Speaker:

Hazard, aka the Duke of Hazard. The

Speaker:

Duke. He says he says JSON is just hipster

Speaker:

XML. He's right, though. And there's another format called

Speaker:

JSONL. What? Yeah. JSONLines.

Speaker:

Never even heard of that. It I it's only the

Speaker:

I didn't hear about it until the product I work on, Rel AI, actually

Speaker:

uses it to store our data. And, basically, it's a different

Speaker:

between JSON and JSONL? Basically, it's a long line

Speaker:

where each line is a record of or a chunk of

Speaker:

JSON. That's how I interpret it. I'm sure I'll get

Speaker:

corrected, but please correct me on that one. Interesting. Yeah. I'm like, Jason

Speaker:

now? Like, what the heck?

Speaker:

Jason is an acceptable one. That's true. Jason

Speaker:

would've been a good name. But I didn't want him to get

Speaker:

teased on Friday 13th for the rest of his life.

Speaker:

So so,

Speaker:

thanks everyone for tuning in, man. This was awesome. We should do it more often.

Speaker:

Thumbs up on, or, you know, be sure to like, share, subscribe. I gotta do

Speaker:

this to all the things. Like, share, and subscribe. I can show off,

Speaker:

this little graphic.

Speaker:

You you know, Frank, we could do this, we could do this

Speaker:

a lot more often if there anytime I stir up some trouble, we do a

Speaker:

a live stream Oh, god. We do it every day. I Yeah. That's what I'm

Speaker:

saying. You know? No. I think it's interesting. I think it's good because, like, you

Speaker:

know, it the controversy

Speaker:

I think people are starting I just look last time I looked at the thread

Speaker:

thoroughly, people were talking past each other. Yeah. Which I

Speaker:

guess defines all of Reddit, mostly Internet.

Speaker:

You know, it is what it is. Yeah. But

Speaker:

cool, man. I gotta actually, now that I mentioned it, I do have to go

Speaker:

to the restroom. See. You did it to yourself. I did do it to myself.

Speaker:

And thank you, Miranda, for turning in. And

Speaker:

I will play the outro graphic. Excellent.

Links

Chapters

Video

More from YouTube