Skip to content
Exploring Machine Learning, AI, and Data Science

*Live* Tis the Season for SSIS

In this livestream, Frank and Andy discuss the timeless nature of backend enterprise tech, that, much like a Christmas special from decades ago, is still very much celebrated.

Moments

00:00 Exploring SSIS future in a festive episode.

08:28 Data engineering evolved from business intelligence systems.

10:57 Social networks project before Facebook’s popularity.

19:19 SSIS training informed data engineering concepts teaching.

24:56 Bill Gates moved project to immature Microsoft tooling.

29:10 Data engineering possible in 2024 using T-SQL.

35:23 Huge cloud companies surpass previous brick-and-mortar giants.

40:10 Old technologies endure; misconceptions about their age.

46:03 Evaluate change benefits: technical ease, business growth.

52:30 Cloud departure interests rise, SSIS assistance sought.

55:47 Big government agency utilizing diverse cloud platforms.

01:00:59 Security is crucial; clients’ preferences vary.

01:08:56 Certification issues hinder software updates and compliance.

01:10:02 People stick with older systems for reasons.

01:15:15 Proper GPU driver drastically improved loading time.

01:22:16 Repost increased engagement and communication with author.

01:25:45 Data scientists should learn SQL for simplicity.

01:31:06 Obsolete systems cause issues without quotes.

Transcript
Speaker:

In this special holiday themed episode, we're diving into a topic

Speaker:

that's as classic as Christmas Carols, but just as divisive as fruitcake.

Speaker:

And that topic is the future of SQL Server Integration

Speaker:

Services, SSIS. But wait, there's a

Speaker:

twist. This episode was recorded live, so if you notice

Speaker:

a different vibe, some festive banter, and maybe even a change in

Speaker:

our usual musical interludes, that's why. Think of it as

Speaker:

the holiday party version of our usual data driven discussions.

Speaker:

Together, we'll explore why SSIS, despite its vintage

Speaker:

status, remains a cornerstone of data engineering and why

Speaker:

dismissing it might just be a data driven mistake. So grab your

Speaker:

cocoa, settle in by the fire or your nearest CPU,

Speaker:

and let's get festive with some data talk.

Speaker:

Well, hello, and welcome to franksworld.comstream.

Speaker:

And, with me today is Andy, and I'm

Speaker:

looking for the lower third that has us both. There we

Speaker:

go. Frank and Andy Frank Lavinia and Andy Leonard, host of Data

Speaker:

Driven, which I might turn this into a podcast. I might take the

Speaker:

audio and and turn it into a podcast. What do you think about that? That'd

Speaker:

be good kind of festive stream and also kind

Speaker:

of up to date on things. And it gives me some more time that to

Speaker:

put together another episode that I had a really great

Speaker:

conversation with a guy who does red teaming for LLMs.

Speaker:

Nice. So which I think is a growth industry

Speaker:

and certainly a wise career move.

Speaker:

Speaking of career moves. Good thing.

Speaker:

Oh, we have a first comment. SQL dev d b a.

Speaker:

Hey, SQL dev. Awesome. So

Speaker:

this is this may be the first time we've done this. This feature's been around

Speaker:

for a while. No. We did it once or twice before. Did we do it

Speaker:

on recent? Like, months. Yeah. That we've done. So we're sharing

Speaker:

our so Frank's audience, people that are connected to Frank,

Speaker:

they're seeing this. People connected to me are seeing this. It's like it'll

Speaker:

because it told me Frank started this, and, then he sent me

Speaker:

the link. And as I joined in, it it said, hey. You can

Speaker:

share this with with your on your channels as well. So I

Speaker:

was like, oh, yeah. Click that. Oh, you know what it is? We did it

Speaker:

the other way. You were the main, and then I shared it on my channels.

Speaker:

That's what happened. That's what happened. Yeah. Yeah.

Speaker:

Well, it's cool, though. If you've never met me before hello?

Speaker:

That's Frank. Frank digs data on the socials and,

Speaker:

franksworld.com, datadriven.tv, which hopefully you know about that,

Speaker:

and impactquantum.com. So that's me.

Speaker:

And, yeah. So back to the segue.

Speaker:

Yeah. I was talking about how security and AI is a

Speaker:

good career move. And we were talking about, speaking of

Speaker:

career moves, 'tis the season for SSIS is the title of the

Speaker:

stream. And this kind of goes,

Speaker:

I'm sorry. Come on, man. It's fun. Right? It is.

Speaker:

It's awesome. So so and I had kind of done,

Speaker:

2 livestreams on this already, but one of them for, like, 10 minutes, I didn't

Speaker:

catch the fact that I had no audio. And then yesterday, I did one for

Speaker:

2 minutes, so I didn't catch the fact that I didn't do the audio. So

Speaker:

I figured I'd bring the troublemaker himself onto here. Although, strictly

Speaker:

speaking, you're not the original troublemaker on this. Well,

Speaker:

I participated in it. I'll I'll own my my part of the

Speaker:

trouble. You'll own your part of the trouble. So so I definitely will. Yeah.

Speaker:

What's the background here? Well and and I'll

Speaker:

I'll do a plug for, for andylehner.blog.

Speaker:

And if you go there, you can sign up for my newsletter over on the

Speaker:

right side. It's kinda hard to read because the widget is a little

Speaker:

narrower than it needs to be. But if you if you do that or if

Speaker:

you just look up engineer of data, I think it's

Speaker:

engineer of data dot substack.com.

Speaker:

But I I put a newsletter out today kinda talking about it. Yeah.

Speaker:

There's the site. Thanks, Frank. No problem. And, you see the subscribe

Speaker:

to my newsletter down there on the right, and there's a box on

Speaker:

the left where you type your email address and then on the right, you click

Speaker:

it's free. And it'll it should take you, right over to

Speaker:

Subsec, which by the way, I started using this year. And so far,

Speaker:

I'm pretty impressed. It's it's been a a

Speaker:

really interesting, experience for me. So the

Speaker:

trouble here here's, here's where the

Speaker:

trouble happened. I I have been, reading. I

Speaker:

caught a couple of articles just every now here and then, mostly

Speaker:

on LinkedIn, where people

Speaker:

would express an opinion about, you know,

Speaker:

SSIS stinks. I don't like it. It's old. It's was

Speaker:

so much trouble. And, you know, and they would just

Speaker:

kind of kind of poo poo share their their negative thoughts about

Speaker:

Azure sorry. About SSIS. And

Speaker:

I've, of course, I've worked in SSIS since

Speaker:

really before it came out, I got to work on that Rocks book project

Speaker:

with Brian Knight and I remember that book. Yeah. Yeah. 10

Speaker:

of us. Yes. Back when Rocks would put your picture on the cover of the

Speaker:

book. And have a copy around here somewhere. Yeah. That

Speaker:

yeah. Thank you, Frank. You know, it just but it's

Speaker:

so I got yes. I got kind of a boost out of my career,

Speaker:

and I did an awful lot in SSIS for a long time. And

Speaker:

every now and then, I still do. I used to

Speaker:

deliver training, as part of solid,

Speaker:

solid quality learning is what it was called when I joined it. Solid

Speaker:

queue. After that, I worked with them for a few years and I delivered

Speaker:

training developed by Eric Veerman and

Speaker:

also did consulting gigs. And I learned a lot,

Speaker:

about both data engineering and SSIS while I was

Speaker:

doing both those things. When I left solid q, I

Speaker:

think I put about a year or 2 between me and,

Speaker:

you know, and the business. Actually, it was about two and a half years because

Speaker:

I went to work for Unisys then as a ETL architect. I remember

Speaker:

that. You're up in Reston quite a bit because that's where it was. Oh, yeah.

Speaker:

Yeah. Frank. Now an apartment complex now, that building. Oh, is

Speaker:

it? Okay. I think so. Yeah. Okay. So Frank and I

Speaker:

have been friends since the before times, even before SSIS came

Speaker:

out. And, Well, no. I think you had just written the book at the

Speaker:

time. I I'm trying to remember. So Just moved to Richmond just when I

Speaker:met. It was November of:Speaker:Yeah. Yeah. November:Speaker:

another mutual friend that I won't name, but we're all still friends now.

Speaker:

And the book actually was published in

Speaker:January, I think, of:Speaker:

Yes. That's right. So it wasn't it wasn't quite

Speaker:

ready for for prime time. But oh, sorry. The the

Speaker:

book wasn't out. It was going through the process, and it takes a couple of

Speaker:

months from the from the time all of the drafts are finished

Speaker:

until they they make a book out of it. It was my very first,

Speaker:

book project. And, yeah, I I'm pretty sure I was I was so

Speaker:

excited. I was telling everybody, I worked on a book. Oh, yeah. Yeah. Because it

Speaker:

was for the Richmond Code Camp, which was in May, April of

Speaker:Yep.:Speaker:Yeah. It was:Speaker:

Developers on a plane, and I had the guy I photoshopped the

Speaker:

guy carrying your book. That's right. I do remember that.

Speaker:

Yeah. I have to find that picture somewhere. I've been I've been using

Speaker:

SSIS for a a long time. I would say I

Speaker:

learned more about data engineering, the

Speaker:

field and did more projects probably

Speaker:

in, in data warehousing where I used SSIS

Speaker:

for for the data engineering, data integration. I think it's important to to

Speaker:

to, 1, explain for those who may not know what SSIS

Speaker:

is, and 2, explain that data engineering was not always seen as a

Speaker:

discrete,

Speaker:

profession or or Yeah. It's a data engineering's a

Speaker:

relatively new word to describe what we do. It was called

Speaker:

the part of business intelligence.

Speaker:

Back even before all that, I think the first term I

Speaker:

heard was data acquisition,

Speaker:

and it was in it was sometimes that was that phrase

Speaker:

was used standalone. The most often,

Speaker:

at the time when I and this is what got me into databases

Speaker:

was doing system control and data acquisition or SCADA

Speaker:

systems. These were manufacturing systems where you collected data from

Speaker:

instruments on the floor. You gotta remember, IoT

Speaker:

was, you know, still somebody's dream back, you know, in

Speaker:

the 19 nineties. IoT. It was just OT back then. It just was

Speaker:

OT. You're right. It's funny. Yeah. But

Speaker:

but we still did acquire, data from,

Speaker:

plant floors and instruments that were mounted all over, but they weren't

Speaker:

Internet enabled at that time. They were, most of them were hardwired. A

Speaker:

few were using wireless. And so that's kinda what led me

Speaker:

into this whole this whole field. And the idea of the

Speaker:

field, is of data engineering, data

Speaker:

integration as we called it back then, is we do that data acquisition part.

Speaker:

We go find wherever the data lives, we go find it there.

Speaker:

And sometimes the data is a very static

Speaker:

list. It it could be even a text

Speaker:

document, created in notepad that

Speaker:

is tab separated or, you know, delimited

Speaker:

by character position or something like that. And a lot of old

Speaker:

old lookups, lookup data was that way. And I'm not making

Speaker:

that up. It was maintained in a a text EDI.

Speaker:

EDI. Yeah. So electronic data interchange. Yeah.

Speaker:

So, yeah, EDI is I have an interesting stories about EDI, but but one of

Speaker:

the things that really kept me away from the data space for a long time

Speaker:

was I didn't wanna be DBA. And this work, I think, had traditionally

Speaker:

been kind of merged with DBAs. Oh, absolutely.

Speaker:

But at some point, I don't know exactly when it really

Speaker:

evolved into its own discipline. And I remember.

Speaker:

Go ahead. Because I remember I tried to get you a job at a particular

Speaker:

company. I remember that. And what do they do? And what was

Speaker:

it? Why do we need a DBA? You don't need a DBA.

Speaker:

Right. And I think that I'm not DBA. That

Speaker:

was the funny part. Well, that was the fun. Well, we clearly did because at

Speaker:

the time there was a project going on,

Speaker:

and I think the term data architect is what you just said. You were I'm

Speaker:

not a DBM data architect. And then that fell

Speaker:

on deaf ears. And, ironically, like,

Speaker:

not like a couple months later, there was a project that we worked on that,

Speaker:

so many stories, and I'm just trying to protect the innocent and the guilty,

Speaker:

and myself, from from from libels. But, basically,

Speaker:

there was a project going on that when it was basically kind of

Speaker:

behavioral analysis of social networks. Right? This is before

Speaker:

Facebook. I think Myspace was around that sort of thing. But it was basically the

Speaker:

idea of organizational networking as a discipline. And it turned out that the

Speaker:

the software that we bought off the shelf would actually query the

Speaker:

database, bring everything back in from the database,

Speaker:

and then run through the filtering on the C Sharp

Speaker:

components on the web server. Gotcha. So

Speaker:

long story short, there was 0 optimization, hardly an

Speaker:

index. I mean, it was just a mess. A data architect

Speaker:

will use the terms of the day, would have slot spotted this right away. We

Speaker:

didn't. And it was just a massive disaster. And it's kind of one of those

Speaker:

things where there were a number of projects that that company was taking

Speaker:

on. Basically, one of their one of their core

Speaker:

business models was was brilliant actually was software maintenance. So you have an

Speaker:

existing application offshore or outsource it outsource it to

Speaker:

us, and we'll take care of it for you. And,

Speaker:

you know, it was really like an an an education

Speaker:

in kind of Jenga programming. Right? Where you had they wanted updates

Speaker:

to this stuff, but they didn't wanna pay to redo it. So you kinda, like,

Speaker:

had to replace rip and replace stuff. And there's one particular

Speaker:

instance where there was a SQL query that took like 14

Speaker:

minutes to bring back an answer. And I'm like, it's only like

Speaker:

like 30,000 records. Like, what what's the deal here?

Speaker:

And turns out there was no indexes, no nothing.

Speaker:

Well, you know, those indexes take up space. Right? Exactly.

Speaker:

Exactly. I mean, this is like why you should save space.

Speaker:

Joke. That's a joke. For one reason or the other, like, there was there was

Speaker:

no index. And I was like, well, let's add indexes. And like, no, no, no.

Speaker:

We can't change the schema. Okay. So what I end up what I end

Speaker:

up doing was creating temporary tables with indexes

Speaker:

and then copying all the data, and I still got it down to 2 minutes.

Speaker:

Nice. Which 1 minute and 59 seconds was copying the data,

Speaker:

and then one second was actually changing. Yeah. So, like, it was it was kind

Speaker:

of like what I call Jenga programming or Jenga architecture. You had to like they

Speaker:

wanted updates, couldn't touch too much, couldn't change anything,

Speaker:

couldn't improve anything because it was just it was a

Speaker:

time in my career that I think back of and I've kind of learned

Speaker:

many lessons, both hard lessons and soft skill

Speaker:

lessons. But Sure. But we digress. But, I'm just

Speaker:

gonna I'm just gonna take that answering your question in my usual

Speaker:

long winded way. SQL Server Integration

Speaker:

Services, came along, and it was probably

Speaker:

the, again, it was the thing that

Speaker:

spanned the longest part of my career. Before that, I worked with something called

Speaker:

data mirror. That was the first, I'd I'd

Speaker:

say the the first system like that. First bit of software that

Speaker:

way. Before that, I was writing my own. So I

Speaker:

was reading from these plant networks and writing to all

Speaker:

sorts of stuff. And I got into SQL Server because

Speaker:

I crashed access back in the nineties. So I ran,

Speaker:

I collected a 1,000 points of data every second for a long

Speaker:

weekend. And I wanna say the access file grew to about 4 gigs.

Speaker:

When I went to open it and start doing some analysis on it, it turned

Speaker:

out it wouldn't open. So 4 gigs is nothing now. Right? You

Speaker:

can do that on a smartwatch. But back then, a server

Speaker:

struggled, to open the file system. If you go back far

Speaker:

enough, would have freaked out or anything over certain size unless it was, like, NTFS

Speaker:

or something like that. Right? Yeah. And this this

Speaker:

wasn't. This was, one of the other OSs. But

Speaker:

so, you know, I went I went to, altavista.digital.com

Speaker:

and typed in Microsoft database, and I saw this

Speaker:

listing for something called SQL Server, and that's how it all started.

Speaker:

Well, then I I got got in as, working

Speaker:

on a data warehouse, and part of my job moved

Speaker:

into the database part of it. I actually was hired to do the reporting piece

Speaker:

of it, and lots of cool lessons learned there as

Speaker:

well. But on the database side, they use Data Mirror. I

Speaker:

think that company is still around. I'm not sure. But this is like 25 years

Speaker:

ago. And it was it was so cool,

Speaker:

and I was fascinated that somebody had built software to

Speaker:

orchestrate this collection of data. I was like, wow.

Speaker:

That is a good idea. You know, it always makes me feel better, Frank, when

Speaker:

smart people come up with an idea that I've also come up with independently.

Speaker:

It makes me feel like, okay. Maybe I'm onto something. Go through all of

Speaker:

that, data transformation services or DTS, and then finally, SSIS

Speaker:

and this big block. And

Speaker:

what I've noticed and I kinda noticed this trend started

Speaker:

maybe 4 or 5 years ago. I people complained about SSIS before

Speaker:

that. Don't get me wrong. And a lot of it is

Speaker:

because are you sitting down? It's

Speaker:

hard. We're not making it up.

Speaker:

Comparatively, though, like, I I remember when I was at

Speaker:

barnesandnoble.com and which just goes

Speaker:

back a ways. So if you bought a magazine at Barnes and

Speaker:Noble between:Speaker:

2012, 13, you

Speaker:

you interacted with the system I wrote, nice in the late

Speaker:

nineties or at least part of it anyway. So, you

Speaker:

know, that's how I learned EDI, right? Because we get these feeds

Speaker:

from publishers, literally a mainframe would dial up another

Speaker:

mainframe, download the file over a modem.

Speaker:

And, and this is how it worked. And what we did was we pulled down

Speaker:

the raw EDI files and I parsed it

Speaker:

and I had to do that and drop it into an informix database. So it

Speaker:

was a cool writing for GL scripts to to to take

Speaker:

that data in text format and then dump it

Speaker:

into an actual. You were doing data engineering. I was

Speaker:

doing data engineering, which is kind of funny. But like, you know, data engineering as

Speaker:

a discipline is not easy. Right? So SSIS being hard.

Speaker:

I mean, you know, brain surgery brain surgery is hard too. Right?

Speaker:

You you make a good point about it. And it, you know, it took me

Speaker:

a while, especially teaching it. And I would do

Speaker:

4 or 5 day course, originally with solid q and then

Speaker:

eventually on my own. I I wrote my own course. I

Speaker:

found myself adding to Eric's content when I would deliver the

Speaker:

material here. And don't get me wrong. Eric

Speaker:

is still a genius. He was then and he still

Speaker:

is. I just I I had a way of approaching

Speaker:

some, demos and examples that I felt kinda added

Speaker:

to the clarity of the information we were sharing. I

Speaker:

kind of expanded that out and wrote all my own material, my own I

Speaker:

use my own data, that I collect as part of my, weather station

Speaker:

here. And to this day, there are students that

Speaker:

are going through, recordings of that class.

Speaker:I made were back in December:Speaker:

and I recorded 3 courses on SSIS. The 4

Speaker:

day from 0 to SSIS course was, you know, will take you

Speaker:

from if you can spell SSIS to being a

Speaker:

functional, advanced beginner, low

Speaker:

end intermediate developer. And it was built for

Speaker:

that. It's got labs 13 12, 13 labs

Speaker:

that you do in 2 days, of that course. And then it talks

Speaker:

it kinda changes gears and goes to the care and feeding of SSIS

Speaker:

and ancillary topics. So

Speaker:

I learned a ton about the concepts

Speaker:

of data engineering on while as

Speaker:

while doing SSIS training and consulting and

Speaker:

development. So when I teach it, Frank,

Speaker:

I share these concepts that

Speaker:

I learned. Because you gotta keep in mind, this all came out around the same

Speaker:

time as a data warehouse toolkit book, by,

Speaker:

Kimball and his crew. And the in

Speaker:

fact, I don't know what the relationship was between Microsoft

Speaker:

and Kimbell, but I do know from the horse's mouth

Speaker:

that the, data flow task in SSIS

Speaker:

was modeled to load, Kimball data

Speaker:

warehouses. There's just a lot of functionality baked right in

Speaker:

that, you know, targets those star schemas, and, you know,

Speaker:

it's it's built to do that. There's so, you know, there

Speaker:

was that aspect of it. So at the same time, I'm reading

Speaker:

and learning and, you know, and then going out and teaching

Speaker:

and, you know, and and consulting. There's

Speaker:

this nice amalgam going on. I'm getting information from books.

Speaker:

I'm applying that information on consulting gigs. I'm

Speaker:

figuring out new ways to solve, you know, problems I hadn't

Speaker:

seen before, And then I'm training. So I'm just

Speaker:

rolling all that together. When I do the training, I'm sharing with people, hey. Here's

Speaker:

some first principles, if you will Right. Of data engineering.

Speaker:

And we call it data integration and BI back then.

Speaker:

And star schemas and why you use them and how they work and, you

Speaker:

know, kind of the trade offs that you get. Data explodes a little

Speaker:

bit. Talking about concepts like staging, data,

Speaker:

the benefits of it, why you like, how

Speaker:

you would wanna build your staging tables.

Speaker:

If you're reading from a flat file, everything in

Speaker:

that file is text. Now the text may be

Speaker:

numbers. It may be dates, but it's really just text.

Speaker:

So you built the stage tables with and bar charts.

Speaker:

So at, you know, stuff like that because you wanna get in and get out

Speaker:

just quickly. Memory than the way to do it would be in memory and then,

Speaker:

like, do validation as you do the insert and things like that. There's a there's

Speaker:

a 100 different ways to slice that. Yeah. There really are.

Speaker:

But, you know, when you did, that was that was just pieces and parts of

Speaker:

saying, okay. You know, Tim, I'm teaching you how to use this mechanism,

Speaker:

if you will. Right. SSIS. But I'm also sharing with

Speaker:

you how you would use it and then why you would

Speaker:

use it that way. And, you know, so there's more to

Speaker:

it than just the data engineering. And the point I wanted to make thanks,

Speaker:

Hector. Merry Christmas to you too, Hector. The data

Speaker:

engineering all by itself, just that world,

Speaker:

that's hard all by itself. Yeah. Absolutely. And then the tool

Speaker:

itself was extremely

Speaker:

flexible. And, you know, from the years that you and I have been sharing

Speaker:

about stuff, anytime you say it's it's flexible, you're

Speaker:

also saying, the the it's a sonic way of

Speaker:

saying it's complex. Right. And if

Speaker:

it wasn't flexible, people would say that it's too simple. And, like, it's just

Speaker:

one of those things where now that I'm in a job where I am in

Speaker:

a on the product group, what Microsoft would call PG, a product group or or

Speaker:

team. Yeah. We call it a BU. I I understand. Like, there's

Speaker:

only so many hours in a day that you have engineers and

Speaker:

there's time to market. You have to kind of make these trade offs.

Speaker:

And, you know. That's it. I mean, that's that that I mean,

Speaker:

I had this real eye opening moment with with I think suspect was the guy

Speaker:

who introduced us, who was an evangelist at Microsoft back in the day.

Speaker:

And, you know, I wanted some new shiny feature in Visual

Speaker:Studio:Speaker:

And he kind of pointed out like, look, even Microsoft has limited

Speaker:

resources in terms of people, time and testing and material and

Speaker:

things like that. And I was like, you know, I mean, my god, if Microsoft

Speaker:

has that problem, then I guess everyone has that problem. You know? It turns out

Speaker:

they're just a bunch of software developers just like the rest of us. Turns out

Speaker:

they're all humans. Although maybe now it's mostly AI. Who knows? But,

Speaker:

it's getting there. So so so, you know, I think we both kind

Speaker:

of set the stage for the controversy here. Right. SSI has

Speaker:

been around for at least 20 years,

Speaker:

maybe 25 and SQL Server itself. Let's let's remind folks the

Speaker:

history of SQL Server. It was originally who was it a

Speaker:

partnership with? Sybase? Yes. I believe it was a

Speaker:

Sybase product, completely. And I don't know if it was like And it was like

Speaker:

version 6. Got into the mix, and there was a collaboration

Speaker:

or something, and then they ended up with it,

Speaker:

owning it. That's my best guess on it. I actually

Speaker:

I I know I haven't spoken to her in a while, but I was I'm

Speaker:

friends with and and have co worked with, with

Speaker:

Caitlin Delaney. And she was

Speaker:

with Sybase. Oh, okay. Yep.

Speaker:

So, you know and did we have her as a guest on the show? I

Speaker:

know we wanted to. We we totally need to because that would be Yeah. Interesting

Speaker:

story because I first heard of SQL Server when I was at Barnes and Noble

Speaker:

because at the time we were ready to launch in 19 this is why I

Speaker:

left Barnes and Noble. We're ready to launch by Christmas of 96 with a

Speaker:

Yeah. Linux or Unix based based system based on Spark, Oracle,

Speaker:

and a few other things. No. I'm sorry. 4 g l. It was Ultimate

Speaker:

Formics. And, you know, we had the hardware. We had

Speaker:

everything set up. And then as the story goes, Bill

Speaker:

Gates and, one of the Riggio brothers who was the CEOs

Speaker:

of kind of co CEOs of Barnes and Noble at the time.

Speaker:

Bill Gates had kind of I don't know what he'd done, Jedi Mind Trick.

Speaker:

In August, September of, like, 96,

Speaker:

basically said, no, we're ripping everything we've built so far and we're moving

Speaker:

it over to Microsoft tooling, which at the time was not really mature. I

Speaker:

mean, it was this is like inter dev. I think we had a beta version

Speaker:

of visual inter dev. Yeah. Yeah. Which

Speaker:

was not the best product at the time. Right? It was, you know You know,

Speaker:

I used it At the time. At the time.

Speaker:

Yeah. I I used it, and if you came

Speaker:

from, like, cold fusion or some other development platform.

Speaker:

Yes. Was also awful. Yes. But yeah. So

Speaker:

So I started on inter dev. In fact, that was the first tool

Speaker:

that I I remember downloading for, Visual

Speaker:

Studio. I don't think I downloaded it. I think I went somewhere and bought a

Speaker:

CD or something. Yeah. Yeah. I think I found it in her dev 97 CD,

Speaker:

which was the the second or third version. But, I mean, I we we had

Speaker:

everything written in per on CGI Pearl scripts. Like, we had everything,

Speaker:

and it was just a very different era. But my

Speaker:

take was and this was my I was at the meeting with the CEO and

Speaker:

everyone else. Like, if we don't launch by this Christmas,

Speaker:

people are not going to use us as a habit. Amazon will

Speaker:

take the mindshare and this and that. And then then the

Speaker:

CEO said, sit down, s t f u. Basically, you don't know

Speaker:

how to sell books. You may know technology, but you don't know how

Speaker:

to sell books. Now we can look back at Jeff

Speaker:

Bezos' super yacht and his, you know, moon

Speaker:

missions and all that. These guys have super yachts and moon missions.

Speaker:

Right? They do not, actually. Oh. And my well, I

Speaker:

mean, I'm pretty sure they live in an oceanfront thing in Long Island. But,

Speaker:

he didn't know anything about selling books online either. So I can kinda I

Speaker:

can sit back here, you know, some, you know, good God almost 30 years

Speaker:

later and kind of be smug about it. Right? Right. But

Speaker:

it's just it's just funny. Right? Like, so so what's interesting is and I think

Speaker:

this really cuts to the bone of what this controversy is. And I

Speaker:

have the thing queued up. I can kind of show the screen where you posted

Speaker:

it, where

Speaker:

the fundamentals haven't really changed. Not at all.

Speaker:

Right. Yeah. Binary is still binary.

Speaker:

The debates about schema optimization and things like that are still

Speaker:

very much the same today as they were

Speaker:

20 years now. The numbers are bigger. The stakes are arguably bigger.

Speaker:

But for the most part, the fundamentals haven't changed. And and

Speaker:

I would say this is really where it kind of boiled down to. And this

Speaker:

is this is where the controversy starts. So buckle up, kids.

Speaker:

Let's see. I will share the screen. There's actually

Speaker:

2. I think you talked about one of them. The choices? I'm only

Speaker:

aware of 1. This is the this is the one post,

Speaker:

and I'll drop you the link, to to one of

Speaker:

the others. Right? Yeah. I'll put it in a chat.

Speaker:

I'll I'll send that to you here. Just a second. Along can can understand. So

Speaker:

this is what I saw. And it was basically Kendra

Speaker:

Little, who was a I would say legendary. Scary

Speaker:

smart. She's legendary in in in in the sequel

Speaker:

kind of family, right? Hashtag sequel family. Is that still a thing? I

Speaker:

think so. She's legendary. She used to work at Redgate. I think she worked at

Speaker:

Microsoft, too, at a time.

Speaker:

I think so. But I'm not positive. Well, we can look at LinkedIn. If only

Speaker:

we had that information. But anyway,

Speaker:

so you basically so if you read this and she says

Speaker:

so it says strong disagree. Don't run after every shiny

Speaker:

thing. Again, that is good advice. But, Lord, I would assume

Speaker:

that is her saying. But Lord, don't learn SQL Server and

Speaker:

SSIS if you want to be a data engineer. That's 2 decades too

Speaker:

out of date. Sincerely, a SQL Server expert. I think that's

Speaker:

a bit harsh. She's right about this part. Don't try to change chase out

Speaker:

there if you show anything. So apparently, I can't, and I

Speaker:

can't select a thing. So I read that,

Speaker:

and and I know there's more controversies that are in there as I as

Speaker:

I look at the thing. And you said I humbly submit

Speaker:even in the year of our Lord,:Speaker:

using T SQL, this foul year of our

Speaker:Lord,:Speaker:

T SQL, SIS, ADF, Fabric Data Factory

Speaker:

and other technologies supported by Microsoft, which I thought

Speaker:

clearly Microsoft's not going anywhere. Right? Yeah.

Speaker:

And so I basically said

Speaker:

fundamentals never grow out of style. Then I think I wrote again somewhere

Speaker:

like when I looked at the context of it because that's

Speaker:

not what you're supposed to do apparently in social media. You're supposed to react right

Speaker:

away. I did that, by the way, Frank.

Speaker:

I'm guilty. I did not go look at the context. So this is the

Speaker:

original context. I well and, you know,

Speaker:

you pointed that out and I'll I'll be honest, I I'm still

Speaker:

running on second hand information. I have not yet clicked it and gone back

Speaker:

to, to our guest post. Now I can see

Speaker:

it. Now you can see. So so this is what struck me is, well. This

Speaker:

is what struck me as odd. And I know we had talked about it and

Speaker:

I had talked about it. You talked about it. We talked to each other about

Speaker:

it. You know, we talked to our dogs about it. I don't know. Like, but

Speaker:

like, it was kind of like so so when I read the thing, it gets

Speaker:

even stranger. Right? So Yeah. He was talking to

Speaker:

someone, and I guess strictly speaking, even this is secondhand knowledge. Right?

Speaker:

But, so that's the data

Speaker:

scientist in there. Like, well, strictly speaking, this data is also all right. So

Speaker:

so look looking to someone to get a job as a data engineer. Okay?

Speaker:

Right. Unfortunately, he was learning about LLMs and other ML stuff.

Speaker:

I'm like, that's not data engineering. That's a

Speaker:

AI engineering or data science type work. That's more like I think he's he's trying

Speaker:

to set him straight from that. He's like, you're learning the wrong things. That's how

Speaker:

I read those two sentences. I mean, I would say you're learning the right things

Speaker:

if you wanna be an AI practitioner.

Speaker:

Yeah. But I wouldn't call I wouldn't, you know, read up on Langchain,

Speaker:

you know, Ollama and anything LLM and all that stuff

Speaker:

and then call myself a data engineer. I mean, that's Yeah.

Speaker:

That's like a cardiologist cutting up you know, doing your taxes. You

Speaker:

know what I mean? Like Sure. Or or cutting open your brain. Like,

Speaker:

I mean, I suppose there's some similarities, but it's not

Speaker:

the same. Well, I I do like bullet number 1.

Speaker:

Yeah. You know, let's see that. Yeah. This is something I think

Speaker:

that you point out quite a bit. So when you give your talks, either on,

Speaker:

SSIS or ADF, you ask

Speaker:

people, like, how many people here have workloads running in the

Speaker:

in the cloud or right? And then only a quarter of the hands

Speaker:

go up. Well, it's it grew to about

Speaker:

40% the last time I did it, but it's been over a year

Speaker:

since I since I spoke live and asked that question, ran

Speaker:

that little survey. There's a slide usually hidden in,

Speaker:

all of my presentations that has survey up near the very

Speaker:

top. Right. You know, it just and that's that's what the survey is about.

Speaker:

And often, especially

Speaker:

say the last, I said it's been over a year. So let's say from a

Speaker:

year ago and then back maybe 4 years of asking that

Speaker:

question. Almost every time I did that and people

Speaker:

didn't see everyone else's hand go up with theirs,

Speaker:

the those people would come up to me at the end. And usually, their

Speaker:

first comment was, I didn't know

Speaker:

that it was most of the people here were not doing

Speaker:

production jobs in the cloud at this point point with data. I thought we

Speaker:

were way behind and we're the only ones. And my response would be

Speaker:

2 fold. The first would be, that's because Microsoft

Speaker:

marketing is doing an astounding job. That is not a swipe

Speaker:

at Microsoft Marketing. If anything, they deserve a raise

Speaker:

because they were so effective at communicating

Speaker:

how cool this is Right. And how these larger

Speaker:

companies are doing it. You all of the big shows, keynotes,

Speaker:

There's some list of big companies, and they're almost all of them or

Speaker:

companies that you'd wanna work for because it's prestigious.

Speaker:

That's so I don't know if you want it on my personal market. Seem like

Speaker:

everybody's doing it. And I I know I know for a fact it's not always

Speaker:

true because when I worked in the sales for Microsoft, we

Speaker:

would encounter them and there was a pejorative term used internally called server

Speaker:

huggers. Okay. Right. Because like, oh, they're

Speaker:

server huggers. They'll never go to Azure. Right.

Speaker:

So so now, you know, I used to see that it's server hugger as a

Speaker:

pejorative. Now in light of kind of maturity and

Speaker:

working, with more customers and being

Speaker:

more aligned in the open source kind of realm and dealing with

Speaker:

international customers who have very real regulatory concerns.

Speaker:

You're right. Call them smart. Right. It's not, you know,

Speaker:

I didn't so much drink the Kool Aid is I became one with

Speaker:

the Kool Aid. You couldn't tell where I ended and where it began, where I

Speaker:

kind of had this deep programing experience

Speaker:

of. Yeah. That's not always the answer. Right. And I

Speaker:

think that dealing with LLMs and AI and things like

Speaker:

that, I think really makes that more obvious.

Speaker:

Right? Yeah. I totally agree with that. And, you know,

Speaker:

to be fair, and I wanna start with, you know, with being as

Speaker:

positive as I can about this. If I was It's not a negative on any

Speaker:

from scratch. Wasn't. No. I I'm just saying. But if I'm starting

Speaker:

today, day 1, and I wanna go, be a

Speaker:

starter company and and work with data, I it

Speaker:

would be foolish. Foolish to start

Speaker:

today and not go to the cloud. Absolutely.

Speaker:

So and and the reasons are numerous. Yeah. Here's the

Speaker:

thing. The companies there are a handful of

Speaker:

companies, really large companies, mind you,

Speaker:

that have started sent in the cloud age. Let's just call it

Speaker:

that, or the Internet age. There's a small number of them

Speaker:

that have gone on to be huge, but they are really huge.

Speaker:

They're overpowering, oversized. They're larger than the

Speaker:

companies that are previous to the Internet age companies

Speaker:

that have made their way into the Internet. And that's that's

Speaker:

not an accident. However, those

Speaker:

companies, the brick and mortar companies, are the companies calling consultants like

Speaker:

me and asking me to help them either

Speaker:

transition from a purely on premises

Speaker:

environment, managing their data into a cloud environment

Speaker:

or the and back before that, in 20 years ago, when I was first

Speaker:

getting called to do this kind of work, they were just trying to figure out

Speaker:

how to collect their data and then analyze it. And

Speaker:

so, you know, SSIS was a great way to do that. T

Speaker:

SQL was everywhere. Azure Data Factory didn't exist. Yes.

Speaker:

Much less Fabric Data Factory. And so we were just trying to solve

Speaker:

this business problem. And I was trying to couch couch my responses,

Speaker:

especially there was a thread that that got combative, I

Speaker:

would say. And, you know, as we went went down

Speaker:

through that, and I kept trying to say, and

Speaker:

I did. I said over and over again that, you know,

Speaker:

my job is to go help solve these business problems.

Speaker:

And what I meant by that opening line,

Speaker:

that T SQL, SSIS, Azure Data Factory, Fabric Data

Speaker:Factory, even in:Speaker:

data engineering. I I meant that, and I'm not back backing

Speaker:

off that for one minute. I I misunderstood the context of the question,

Speaker:

and I didn't really understand until I listened to your stream

Speaker:

last night where you had gone back and done what I should have done and

Speaker:

read the original post. And you said, yeah. It's kind of a mixed mesh

Speaker:

post. The guy's talking about data engineer, but he's also talking about LLMs

Speaker:

and machine learning. And in the middle of that, he

Speaker:

throws out, you know, this comment about SSIS,

Speaker:

how 90 99.5% of the companies are still using. I

Speaker:

think that estimate is high. I I think it was more of

Speaker:

a, let's make this point that there's still a lot of companies out there

Speaker:

using, T SQL and SSIS to

Speaker:

accomplish this. And this is something that I can't find the comment that I put

Speaker:

in there. I'm looking for it now, but. Yeah. Some of the

Speaker:

comments I can't get to anymore. I don't know why. Maybe they were

Speaker:

reported or maybe they're. Who knows? Right. I mean, social media

Speaker:

does weird things to people psychology. But the

Speaker:

point that I think that I wanna say

Speaker:

that Kendra overlooks. I think everyone overlooks it.

Speaker:

Data and back end systems have a

Speaker:

longer shelf life. And I say

Speaker:

this as someone who was, what, 10, 15 years ago,

Speaker:

strongly ensconced in client development. Right? Whether it was your

Speaker:

Windows, Windows Phone, or other types of Windows

Speaker:

based devices. Right. Or web development. Right.

Speaker:

Those technologies turn over pretty quickly.

Speaker:

Right. You know, you're likely to get

Speaker:

multiple updates per year on a device phone, like an app on

Speaker:

a device, but you're likely to never see,

Speaker:

a radical change or redesign. You'll you'll see a

Speaker:

radical change or web redesign of a website or portions of a website

Speaker:

couple times a year maybe. Right? But you're never gonna see

Speaker:

a radical redesign of a data back end system,

Speaker:

but once or twice a decade. And It's true.

Speaker:

Yeah. And mostly what drives that is scale,

Speaker:

not features. Right. Not features. It's just date or just tend yeah.

Speaker:

Exactly. Right? So if you a 100 x and who could who could accounted for

Speaker:

that, you know, going to the project started. It's a problem. Still a

Speaker:

wonderful problem to have, but but a problem nonetheless. Well, and there's

Speaker:

also the fact that, you know, it's 2,000 whatever now, and

Speaker:

there's still mainframes running. Right? There are still not not not to to

Speaker:

knock on IBM too hard because they are the company of Red Hat. But,

Speaker:

d b 2 is still around, still getting updates. Still backbone of

Speaker:

many Fortune 100 companies that also share the stage with Satya

Speaker:

at these big Microsoft events too. Right? Like Which was mind blowing

Speaker:

for people from the old days of Microsoft. Right? Well,

Speaker:

that's a whole other thing. But, like, you know but, I

Speaker:

mean, it it really boils down to, like, these technologies have a longer shelf

Speaker:

life. So if something is 20 I think we get

Speaker:

hung up. 1 of the threads sub threads in here gets hung up on, you

Speaker:

know, 30, 20 year old technology. We're thinking that, well, you know,

Speaker:

there's a meme of the the little monkey puppet, like, you know,

Speaker:

giving a side eye and then goes like a cringe face, like, and a side

Speaker:

eye where it's like, oh, Windows is, you know, I don't know, 40 year old

Speaker:

technology. And I'm thinking, like, some, you know, Unix people or Linux slash

Speaker:

Linux people are, like, 40 years old is old. You

Speaker:

know? I mean, this stuff goes back much further. So it's but

Speaker:

it's still like and that's not a knock. It's just

Speaker:

No. It's just now that we're in this

Speaker:

industry now for as long as we've been in it and the

Speaker:

industry's been around longer this long, there's just

Speaker:

stuff that is gonna just start aging out, but it doesn't age out as

Speaker:

quickly as we think it does. It's not like it's not like the iPhone. Sure.

Speaker:

Right? Where you the iPhone I don't know what number they up

Speaker:

to. 16, 17. Right? Oh, well, suddenly my iPhone 15 looks bad, and that

Speaker:

happens every year or 2. You this you don't see that in

Speaker:

database systems. Right? The only impetus to really move, say, from, like, SQL Server

Speaker:to:Speaker:

And that's a whole big project. Yeah. The maintenance cycle. So it goes out

Speaker:

of maintenance. And then you worry that if something crazy happens,

Speaker:

you can't get support for it. And that's

Speaker:

kinda like, you know, it's it it's sort of it I'll say this.

Speaker:

It's analogous to your phone starting to run slow for some unknown

Speaker:

reason. That's funny. Something something on SQL.

Speaker:

Yeah. Something something. Sybase something. Well, and you think about all the

Speaker:

I mean, I mean, and contrary to this, contrary to that statement of these things

Speaker:

have long life shelf lives. Yeah. Is the fact that I mentioned

Speaker:

Informix earlier. Raise your hand if you heard of Informix. Right?

Speaker:

So I've heard of Informix. You've heard of Informix? I mean, we don't count.

Speaker:

But but, like no. But, like, I remember my first

Speaker:

experience with Informix was because some alum of Fordham had

Speaker:

because it was a big shot at Informix. And, and I think we had somebody

Speaker:

who was also a big shot at Silicon Graphics. So we had SGI machines

Speaker:

running at Formix. Right. So I remember my first UNIX I used was an

Speaker:

IRIX system. Right. Which most people today

Speaker:

wouldn't even know what what that means. Right. And, you know, but Informix is

Speaker:

out of business. Sybase is gone.

Speaker:

I can't even think of other names. I know there's more.

Speaker:

Right. But really, the only things that it those have probably been

Speaker:

migrated to SQL Server or Oracle. Well,

Speaker:

or some form of Postgres or something like that. And I

Speaker:

I hear you. You know, there's there's an argument to be made

Speaker:

for, you know, the the cost of maintaining

Speaker:

old software. Right. There there definitely is.

Speaker:

I'll say this about SSIS. I if you learned

Speaker:

SSIS in probably in 2,005 era, between

Speaker:

2,005 and 2,008, that engine

Speaker:

I I don't know how many lines of code were

Speaker:ged before it was upgraded to:Speaker:

or r two, but it changed. There were some performance tweaks in there. It was

Speaker:

obviously, faster. And then again, that happened in

Speaker:the:Speaker:

I love that SQL tab.

Speaker:

You know, it's dead. Long live Crystal. So was Crystal ever database or was it

Speaker:

just Crystal Reports? Reports is all I knew. I didn't know about it as a

Speaker:

database. That's all I I I use Crystal Reports and

Speaker:

my favorite thing was it filling up the drive because

Speaker:

it kept caching things. But I

Speaker:

remember the whole idea of just because you place it somewhere, it doesn't

Speaker:

mean it's actually gonna end up there. Like, the whole thing is

Speaker:

but sorry to cut off. That's okay. But SSIS in general,

Speaker:learned it even in, you know,:Speaker:

in November, I think, of 2,005. Even if

Speaker:

you learned it then, it at at a fundamental level, it

Speaker:

hasn't changed that much. And whereas you'll see other software

Speaker:

you Visual Studio is, you know, a software development platform

Speaker:

that allows you to do c sharp and v v and, you know, all of

Speaker:

the stuff. And it allows it still supports for us. I know. I haven't tried

Speaker:

v v in 9 years now. So

Speaker:

it's been a while. But if you look at

Speaker:

how much most software changes from a developer

Speaker:

perspective, and SSIS is software development. So as your

Speaker:

data factory, and any data engineering, that software development,

Speaker:

SSIS is probably in the 95%

Speaker:

of what it was. If if you knew the fundamentals

Speaker:in:Speaker:

2024. And Right. Part of the decision

Speaker:

to go make the upgrade, we talked about, you know, maintenance wonders and stuff, and

Speaker:

I I get it. And it's not the same as your phone slowing down. I

Speaker:

said that, but that's a bad analogy. But Well, it is also

Speaker:

it's also, I think, also very relevant to Windows 10. Right? If you're

Speaker:

on Windows 10, your updates are gonna stop in October. That's

Speaker:

true. I don't wanna get on that soapbox and rant. Sure. No. But I

Speaker:

I mean, there's I get reasons for that as well. I don't

Speaker:

like that it's gonna change because I like Windows 10. But,

Speaker:

but yeah. Well, there's there's I'm gonna join you in not

Speaker:

going down that road. But I'll say this. Hey,

Speaker:

Maddie. How are you? The, the

Speaker:

just the fundamentals of data engineering haven't changed. And the

Speaker:

tool itself, you know, if you knew it back then.

Speaker:

And it's, you know, you know it now. And so if you

Speaker:

learned it now, you could go back then and still work

Speaker:

in the previous versions of it with, very little headache. And

Speaker:

that speaks a lot speaks volumes to the,

Speaker:

the team that designed and built that. And

Speaker:

so in addition to the technical reasons for doing this, the business

Speaker:

reasons, kind of revolve around one of my favorite

Speaker:

phrases. I mentioned this in today's newsletters. It's a compelling

Speaker:

reason. Do you have a compelling reason to make this change?

Speaker:

And business people think about this all day every day because

Speaker:

the amount of money that they make, the profit is based directly

Speaker:

on the amount of money that they, you know, they spend and are they getting

Speaker:

this value for it. So if they can improve the performance of

Speaker:

something, say 10 times, and a result of

Speaker:

that is they get, 5 times as many customers,

Speaker:

then that's not a bad investment. That'll just work. But

Speaker:

if you're coming to me and I'm a business that existed

Speaker:

before the Internet, if you're coming to me and saying,

Speaker:

I want you to change to this completely different model,

Speaker:

where, you know, and and the way it's presented

Speaker:

often is you can save money. And that's

Speaker:

true because if I start a new business today, I'm I

Speaker:

couldn't even compete. I'm not gonna be able to stand up the

Speaker:

servers, you know, take that time and buy that hardware and float

Speaker:

that that inventory that I need to manage all that. Whereas,

Speaker:

I can I can pay rent essentially every month

Speaker:

on that service? Right? Right. I I always like to say I

Speaker:

always like to say if I wanted to start a bookstore today,

Speaker:right, versus:Speaker:right, or:Speaker:

I mean, Barnes and Noble spent a ton of money. I don't have the exact

Speaker:

number, but I can there's probably tens of 1,000,000 of dollars, probably closer to a

Speaker:

$100,000,000 to just before they had their first customer.

Speaker:

Right? Wow. And that but that was the heyday of the

Speaker:

dotcom. Right? Because they were you know? But then

Speaker:

but if you wanted to start a bookstore today, whether or not it's a good

Speaker:

idea, let's let's just suspend our disbelief for a

Speaker:

second. You can probably do it on a on an average credit

Speaker:

card limit. Because

Speaker:

because your IT is enabled. Right. And and you pay like I

Speaker:

said, you pay fractions of what you would have to do in the brick and

Speaker:

mortar. And most of the initial spend isn't gonna be your servers or hardware.

Speaker:

It's gonna be in development and marketing. Right? Getting the word out

Speaker:

because it's such a noisy market. It did the the market has radically changed.

Speaker:

And I also think imagine go ahead. I'm sorry.

Speaker:

What? Imagine? I was gonna say imagine that you've built

Speaker:

Right. This infrastructure on premises already. You've got all of this done.

Speaker:

It's a sunk cost. We can debate about how to feel about sunk cost.

Speaker:

Right. But it's there. You spent the money and it's there. And

Speaker:

you're not gonna get that 5 x income boost when you move to

Speaker:

the cloud. In fact, in some cases, not

Speaker:

all by by any stretch, but in enough cases, you

Speaker:

move to the cloud and it costs you money. Because when

Speaker:

you're getting the presentation about starting using the metrics of this

Speaker:

new company being started today Right. You're, you know, you're told the

Speaker:

truth. You're not being lied to at all in in any of this.

Speaker:

But often, systems that were designed software

Speaker:

and front end back end systems that were designed, you know, from

Speaker:ineties through the mid early:Speaker:

Those systems were architected in a whole different mindset

Speaker:

of what's the prevalent mindset for today. And as a result of

Speaker:

that Yeah. Yeah. As a result of that, one of the things missing from

Speaker:

the spreadsheet calculation that you're gonna get the ROI

Speaker:

from moving off your on premises servers to the cloud is

Speaker:

that couple of $1,000,000 and about 18

Speaker:

months, of the hit that you're gonna have to

Speaker:

spend rearchitecting Yep. All of your systems so that they

Speaker:

now fit today's paradigm. And frankly,

Speaker:

if you are interested in doing that, you you could go do that

Speaker:

at any you could have done that at any time in the last 10 years

Speaker:

and made that shift. But people didn't do

Speaker:

it because the business people didn't do it because the ROI was not dead. There

Speaker:

was not enough return on that investment. If they wanted to, they

Speaker:

would have spent that money then, but it wasn't gonna improve the bottom line.

Speaker:

In fact, it was gonna hurt the bottom line. And so you see

Speaker:

companies now make this move into the cloud and

Speaker:

then, yeah. Yes. That is a

Speaker:

that that's an astute question to ask. So for those who may be

Speaker:

listening and not viewing this, it says is SQL SQL dev

Speaker:

d b a says, I use Brent's and I'm assuming Brent Ozarks. Brent

Speaker:

Ozars. Problem are you trying to solve by changing this for justifying

Speaker:

upgrades? Brilliant. That's that is brilliant. And he's

Speaker:

right. And the you know, but it's compelling to hear and

Speaker:

read the case studies of of companies that, you

Speaker:

know, were able to do use to to access

Speaker:

$10,000,000 worth of hardware, like you said, on a credit card. And think,

Speaker:

wow, what would that do for us? And the answer is sometimes, yeah, it'll

Speaker:

revolutionize your business. You'll 10 x coming out of this. But other

Speaker:

times, it's like, no. You'll point 8 x.

Speaker:

You know, this isn't as compelling. So it's interesting because, like, I think

Speaker:

there's a number of and I found the article. I'll pull it up. But but

Speaker:

one of the examples, it was either Dropbox or Box. I forget which company it

Speaker:

was. But but they had basically started off, I think, in

Speaker:

AWS. Mhmm. And then they got to a certain size. They actually

Speaker:

figured out it's cheaper for them to design their own servers that are optimized for

Speaker:

mass storage Mhmm. Than doing it. So they started building their own hardware and their

Speaker:

own stuff. But I could tell you, if they were a startup and they went

Speaker:

to a VC saying, we want to start with this on prem, they would have

Speaker:

been laughed out of the building. Yep. Today. Yes. Today they would

Speaker:

have. They mean, you know, and it's

Speaker:

it just shows that the the shifting economics of cloud versus on

Speaker:

prem and and other types of things that I don't think people really have figured

Speaker:

out yet. So this was a really interesting I'm gonna share this tab if

Speaker:

I can show it on the screen. Sources. But that that

Speaker:

use case is you can't, you know, having the compelling

Speaker:

reason to migrate to the cloud, and you can do that upfront.

Speaker:

It's harder. But exactly what you're showing there, you're

Speaker:

sharing that that idea of leaving the cloud, that's

Speaker:

growing. And it it's growing across the board. And I

Speaker:

one of the, metrics for that that's

Speaker:

directly related to what we're talking about here today with

Speaker:

with data engineering, is that there

Speaker:there's been an increase in:Speaker:

number of, people that reach out to me to talk

Speaker:

about, SSIS help with their systems.

Speaker:

And, I mean, I do consulting in, you know, ADF and fabric, and

Speaker:

most of my consulting has been in ADF. When SSIS was involved,

Speaker:

it was in lifting and shifting SSIS into an Azure SSIS,

Speaker:

integration runtime. But all of a sudden, after

Speaker:

2, 3, 4 years of that, that shifted this

Speaker:

year. And people started reaching out to me with SSIS on

Speaker:

premises consulting things, and I kept up with it. So I was

Speaker:

able to do it. And but there's other

Speaker:

evidence that I will not share. I probably I may be able to, but I'm

Speaker:

just not going to. But it's even better evidence than my

Speaker:

anecdotes about people more people reaching out to me. Right. That

Speaker:

the amount of SSIS being executed in

Speaker:

the world has increased, and it's a double digit percentage

Speaker:

increase just in the past few months.

Speaker:

And I I think I now this is where I start speculating, and I

Speaker:

don't know the answer to that. But we have a our mutual

Speaker:

friend that we, another mutual friend you and I connected with

Speaker:in November of:Speaker:

future. Recently recently worked for a

Speaker:

year and a half, 2 years for this large agency that's

Speaker:

not part of the government, but does money supply stuff.

Speaker:

Oh, okay. I know. I know. Yeah. After getting his MBA from Sloan, you

Speaker:

know, which no. Sloan. No. Sloan is

Speaker:

important. The, you know MIT. School with MIT.

Speaker:

Right. He's a graduate with that. Super smart.

Speaker:

He shares with me when I'm telling him this story, I give him that stat,

Speaker:

and he says, here's what's going on. Economically,

Speaker:

money is more expensive today than it was. And

Speaker:

so he said he said that as he's telling me this as a

Speaker:

cautionary tale because he says it's gonna change. It's good. Money's gonna get

Speaker:

cheap again, and people are gonna flock back to the cloud. That's his

Speaker:

theory that it's all being driven by money, and I don't think he's wrong,

Speaker:

especially I think that's one level. I think that's one lever. I think there's

Speaker:

more than one lever. That is certainly a big one. But I you

Speaker:

know, as someone who I you know, my previous role at Red

Speaker:

Hat and my current role at Red Hat, I have to think globally. Right? And

Speaker:

we don't again, not a commercial for Red Hat even though the

Speaker:

fedora is there. You know, one of the things we do

Speaker:

is we basically provide a data platform end to end that

Speaker:

can run-in any cloud on

Speaker:

prem or, you know, one of the hyperscale. Or hybrid. Yeah. Yeah. Or

Speaker:

hybrid. Right? Where and there was one customer that I spoke with

Speaker:

before I won opportunity to leave. They were, big government agency.

Speaker:

And this big government agency, you know, they have

Speaker:

their own data centers, even though there was a push to

Speaker:

get rid of them all. But they also have because of way contract

Speaker:

government contracts work in the US, they had, foot

Speaker:

you know, money to spend in AWS, money to spend on Azure, and I think

Speaker:

even money to spend on Google Cloud. So the one

Speaker:

advantage that we had that the other ones couldn't is that the he called them

Speaker:

the soft costs of training people how to do he'd do the same

Speaker:

thing to do linear regression in SageMaker and

Speaker:

push them out of production in SageMaker and Azure and in Google.

Speaker:

Right? Yeah. And this was one tool. You learn it

Speaker:

once. You administer it once. The same glass.

Speaker:

It was the same thing. I think those environments are very real. Now those are

Speaker:

probably limited to large customers or kind of the government

Speaker:

agencies that have these kind of contracts and things like that.

Speaker:

Yeah. But also Mhmm. You have a number of

Speaker:

countries that it's just not a good look to move

Speaker:

your data out of country. Right now, in the

Speaker:

US and Canada, we don't have this issue because there's plenty of all the hyperscales

Speaker:

have footprints in Canada and the US. But if you're in

Speaker:

Latin America, which is this where this example comes from, right, there's only

Speaker:

at least as Red Hat defines Latin America, includes Mexico, and basically all the

Speaker:

way down to Antarctica. Mhmm. And only just,

Speaker:

like, 30 some odd countries. Right? Someone's gonna write me hate mail saying that this

Speaker:

is the exact number, but let's just keep the math simple. It's 30. We

Speaker:

love those mail. We do. We love we love the mail. We learn things every

Speaker:

time. Right. I talk about them personally when I get corrected at

Speaker:

at the dinner table because I wanna share that with No. I mean, it's it's

Speaker:

good. I'm not saying don't do it. I'm just trying to keep the math simple

Speaker:

because it's Friday before, you know, basically, we're sure holidays. Sorry, Frank.

Speaker:

I I derailed you. No. That's fine. Only 3 countries

Speaker:

from Mexico down to Antarctica have hyperscaler presences.

Speaker:

Now, the 4th one in the but out of 30.

Speaker:

Right. So it's actually 10% or less realistically. I think it's like

Speaker:

37 countries. I asked Wikipedia and stuff like that.

Speaker:

So less than 10%. Right? Right. If you're in a

Speaker:

country that doesn't have a footprint. If you're in that 90%.

Speaker:

You have to ship it out as a country. You have to be okay with

Speaker:

that or do roll your own solution on a thing. So there was a

Speaker:

government we we won a big contract because they

Speaker:

wanted to do advanced AI and they wanted

Speaker:

to keep it in country. Right? Doesn't necessarily have to be on prem. Could just

Speaker:

be, like, you know, an Equinox data center down the street or something like that.

Speaker:

Right. But within their thing. And it was a government agency, so it wasn't

Speaker:

computer science or even data science. It was political science that really kinda was the

Speaker:

driver there. Right? Because if I'm in country x and I have to move

Speaker:

my and I'm a government agency in country x, I have to move my data

Speaker:

to a sovereign country y. Not a good look.

Speaker:

Yeah. Right? And,

Speaker:

you know, would it really matter? I don't think so. Like, in a but

Speaker:

from a legal point of view, it kinda does. Like, where the data resides in

Speaker:

the residency. And I think if you go to the Azure website

Speaker:

now, they'll actually tell you where the data resides. And they actually interestingly

Speaker:

enough, they get down to granular, at least on the US side to the

Speaker:

state. Right? So, like, it'll say, you know, Virginia and

Speaker:

stuff like that. We have a comment from, or not. I'm gonna hide my

Speaker:

screen so I can look up the Azure map and kind of

Speaker:

demonstrate that. And then I have to figure out where the sources are. There we

Speaker:

go. So you wanna read the comment while I do that while I'm distracted? Sure.

Speaker:

So so many comments, but there are clients who are

Speaker:

considering the technology used in the software package, and they

Speaker:

may escape when they see the old school,

Speaker:

stuffs. I'm so maybe.

Speaker:

And and that may be, you know, I hate to be that guy that says,

Speaker:

oh, that use case is invalid. I don't think so. I'm not aware of it.

Speaker:

That doesn't mean it's invalid. I'm not aware of a lot of things.

Speaker:

But but maybe. And, you know, definitely

Speaker:

have mixed emotions, about that. If you're buying

Speaker:

because the the if you're a client and you're

Speaker:

buying from, some company and you

Speaker:

decide to go to a different company because of the

Speaker:

technology stack that's being used behind there, I

Speaker:

don't know. I think that says more about you as a client than it does

Speaker:

about the company. If they're delivering the service and it's, you know,

Speaker:

the the the rules of data engineering are you get accurate

Speaker:

data as fast as possible, and those priorities

Speaker:

are in that order. Yeah. I don't think the old school stuff second.

Speaker:

The only risk of the old school stuff is it's still maintained or there's still

Speaker:

security packages. That would I mean, if I were in that

Speaker:

position, I would be like, oh, I mean I mean, if you're still using, say,

Speaker:

Sybase 6. Right? You know, like Yeah.

Speaker:

You know, got a little there's definitely a line there, and it's drawn

Speaker:

based on it's drawn different places first. And some of the

Speaker:

reasons, that it is drawn in different places is

Speaker:

security is huge these days. I mean, that's gotta be your number one

Speaker:

concern. And and, you know, it it

Speaker:

goes from there. But if you're delivering the service securely,

Speaker:

I'll just pick that one, then I would say

Speaker:

that, you know, that if if you

Speaker:

lose a client because you're not using the new shiny,

Speaker:

I don't know what you can do about that. I'm trying to think I'm not

Speaker:

gonna say the client's wrong for feeling that way. They probably have valid reasons

Speaker:

for feeling that way. But if, you know, if they wanna if they

Speaker:

wanna make that decision based on that. And I'm looking at Frank's graphic here

Speaker:

of the is that the data centers? This is the one I was telling you

Speaker:

about. Right? So this is this is just Azure, but I would say it's a

Speaker:

pretty good proxy for the other hyperscalers. Right? Mhmm. I would

Speaker:

say Azure at one point had more. I I haven't kept up.

Speaker:

Well, that was one of our talking points when I worked at Microsoft. Right? We

Speaker:

have more than to be honest. Right? But I would say if it's not exactly

Speaker:

more, it's close enough. Right? So there's Mexico. You see the United

Speaker:

States is pretty well covered. So is Canada. Right? So if you were a Canadian

Speaker:

company, you had to keep it in Canada. You had an option. Right?

Speaker:

Yeah. If you're an American company, you have pretty good choices.

Speaker:

If you're Mexico, yep. But if you're in any of these countries in Latin America,

Speaker:

down through South Wales. We only had okay. So now there's Chile.

Speaker:

Okay. Gotcha. Right? So I'm sorry. Now there's

Speaker:

4.

Speaker:

So my math is going to get more complicated right away. Right. So, there's

Speaker:

Brazil. Actually, no, there's still 3.

Speaker:

So there is no footprint for Azure in Argentina.

Speaker:

These little blue things you see, that is Colombia. Those are networking

Speaker:

pops. So basically, from a

Speaker:

networking point of view, if computer science were the only thing that would matter,

Speaker:

then that be that would be acceptable. But data residency is

Speaker:

the issue. So if I go here, the US East 1.

Speaker:

Right? It'll tell you that its

Speaker:

location is Virginia and it's stored at rest in the United States.

Speaker:

Like like here.

Speaker:

Is there any more details? There isn't. US.

Speaker:

Yeah. They used to. Yeah. They don't talk much about that,

Speaker:

about where they are. But one of the

Speaker:

US East Georgia. 1 of the Yep. The US

Speaker:

East 2 is down in Danville, isn't it?

Speaker:

They are or Mecklenburg. I forget which. Well, essentially, they chose a

Speaker:

picture of Richmond. Right.

Speaker:

I mean, these are, you know, with

Speaker:

we kinda touched briefly on on politics in a geopolitical

Speaker:

slash, sovereignty strategic way.

Speaker:

Right. These are huge. And I I know I'm not the one, thinking

Speaker:

about it. But Look at this. Yeah. There's one in Israel. Those satellites, Frank,

Speaker:

are getting it, by the way. Those those

Speaker:

satellites on the graphic, those things are moving at way faster than

Speaker:

normal satellites. Oh, yeah. Yeah. I mean, satellites are going to

Speaker:

change things, but, like, in terms of where the data sits at rest is really

Speaker:

where because ultimately, I think it really boils down to when are the

Speaker:

local police going to barge down a door with a with a court hopefully

Speaker:

with a court order and basically copy everything. Right. That's really what

Speaker:

matters. Right. That turns out that's really what ended up mattering. But

Speaker:

if you look at the Middle East, right, like 1, 2,

Speaker:

3 countries have it.

Speaker:

Right. And that entire region, you

Speaker:

know, obviously with geopolitical tensions being what they have been for a number of

Speaker:

years. Yeah. Moving your data center to any one of these countries may be an

Speaker:

issue for you for your organization or your

Speaker:

regulatory. Right? Europe is kind of the same thing, right, where, you

Speaker:

know, there's Switzerland, there's Italy. And I

Speaker:

know that there's different kind of things in terms of Germany.

Speaker:

It was actually I don't know if it still is now, but it

Speaker:

might have been a, I used to live in

Speaker:

Frankfurt, actually. Yeah. There was actually what they call a

Speaker:

sovereign cloud because there was concern that if it was a US company owning

Speaker:

a data center, that US courts would have jurisdiction there, which is a

Speaker:

brilliant move by a a past administration. I say

Speaker:

brilliant sarcastically in case you're didn't get pick up on that.

Speaker:

Where they thought that they could basically issue a a court order to

Speaker:

demand something from here in Ireland. And

Speaker:

Microsoft fought that because they realized, like, wait a minute. That would mess up our

Speaker:

entire that would cause a lot of problems. Yeah.

Speaker:

And, ultimately, they dropped the case before it was finally

Speaker:

decided. But in order that they could, thing, at one point,

Speaker:

anyway, this is actually owned by a German

Speaker:

company, managed by a German company, and it's leased to Microsoft to to

Speaker:

to have that concern. I think China also operates the same way.

Speaker:

Frank, I was commenting on the, satellites on the graphic

Speaker:

there. Oh, that they were moving around. Yeah. Yeah. They are moving very, very

Speaker:

fast. And it keeps they're moving with us.

Speaker:

But, I mean, keep in mind, though, like, keep in mind, though, that

Speaker:

we're just talking about data residency. There's other things that if you're building a real

Speaker:

solutions, other things to consider. Yeah. Right? Like And there's

Speaker:

a whole lot to that. And Yeah. Yeah. Oh, absolutely. And,

Speaker:

you know, part of the part of it is,

Speaker:

part of what what happens when you start kinda going back to

Speaker:

the data engineering, platforms and stuff that you use.

Speaker:

There are sound business reasons for not making a change,

Speaker:

and there are some unsound business reasons that will

Speaker:

confine you to not making a change. And I I think about this. I'll put

Speaker:

it in context of, of SQL Server.

Speaker:

Companies will come up and they this has happened, and I still have clients

Speaker:

running applications on old servers

Speaker:

because the company that so they

Speaker:

they serve, their clients include

Speaker:

enterprises that care an awful lot about

Speaker:

checking boxes and auditability and all of that stuff. Regulatory

Speaker:

type things, which is not bad. It's just the

Speaker:

way that it is. That's their their business demands something

Speaker:

like this. These companies were formed. They were stood up, and they've got SQL

Speaker:

Server 2,005 running or 2,008 or stuff that's been

Speaker:

out of the maintenance cycle at Microsoft for a long,

Speaker:

long time. And unknown it's also not a well known

Speaker:

fact that if you don't wanna upgrade to version x or

Speaker:

y, you can pay extra money, and Microsoft will maintain

Speaker:

and provide you patches. Right? There there's rumors that

Speaker:

there's at least as of a few years ago, there were still Windows 95 systems

Speaker:

that were, you know and that sounds absurd.

Speaker:

Yeah. But Well, you walked down the, entry to,

Speaker:

Delta flight, and there was one that's what was it? One is 97, I

Speaker:

think, sitting there. 98. Yeah. 98, was it? Yeah. One is 98.

Speaker:

Sorry. Yeah. One is 98, boxes sitting there for the longest time. They're still

Speaker:

I believe they're 1 to 7 now. Still. I I

Speaker:

saw XP. XP. You're right. It is XP. Yeah.

Speaker:

So, you know, you just I kinda noticed this, like, wow. I hadn't seen

Speaker:

that in a while. But it's it's not about

Speaker:

will the new technology run that

Speaker:

SQL Server 2,005 database. The answer is clearly yes.

Speaker:

Well, you can always virtualize something. Right? Like, that's something that, like I

Speaker:

mean, there's that compatibility levels help. Right. There's a number of things that

Speaker:

do it. But here's the kicker. If the application is

Speaker:

not certified to run on that

Speaker:

and you're for you can change it, and you be maybe you have changed

Speaker:

it and tested it and go, yeah. It works. We'll just move it to, you

Speaker:know, SQL Server:Speaker:

know it works. But the people you're serving,

Speaker:

people who care way more about checking all the boxes and the regulations

Speaker:

being a 100% and auditable, they won't

Speaker:

allow you to. And it gets even more complex when that company that

Speaker:

originally sold you that software 20 years ago is no longer in

Speaker:

business. Right. So you have no path forward.

Speaker:

I mean, the only Or they get bought by another company that you

Speaker:

don't really like. Exactly. That's happened too.

Speaker:

Exactly. Then A lot of mainframe companies were brought up by I don't wanna

Speaker:

name names, but, like, were brought up, and they it really was, like,

Speaker:

ironically, because they what they do, they they knew they had them. And

Speaker:

ironically, a lot of mainframe migrations happened because of

Speaker:

that. Like, it was And so you've got, you know, you've got that

Speaker:

angle where people are sticking with older systems for whatever reason.

Speaker:

And it's, you know, it goes like I'm saying, my point is that this goes

Speaker:

beyond just the data engineering realm. There are

Speaker:

there are compelling reasons to use,

Speaker:

older software. It may not be anybody's, you

Speaker:

know, satisfactory answer, but it is, you know, those

Speaker:

reasons exist. And if you're the, you know, if you're a

Speaker:

developer who likes using the new shiny and learning the new

Speaker:

stuff, I'm one of those. That's why I'm teaching courses on fabric data

Speaker:

factory right now and watching as it kind of some

Speaker:

days it works and some days it doesn't. We've had that happen on a

Speaker:

number of deliveries this year, with that.

Speaker:

So if you read what I wrote about this and

Speaker:

you come away with Andy's against the new stuff, well, you're just

Speaker:

as wrong as wrong can be. That's not the case at all.

Speaker:

You know, it's an

Speaker:

interesting I mean, so back to the lecture at hand, what kind of kicked this

Speaker:

all off and inspired the stream was

Speaker:

this post where I think the short

Speaker:

answer is everybody's a little right. Everybody's a little

Speaker:

wrong. And as a consultant, you can appreciate these two

Speaker:

words. It depends.

Speaker:

Right? Because, like, you may want to upgrade to the new shiny. I know every

Speaker:

developer wants to do that. And I think the comment for some reason I can't

Speaker:

say is like basically hiring managers will put in a job description. All

Speaker:

there's also the other matter of job descriptions and, you know, job requirements

Speaker:

are. They're always a 100% accurate. Disconnecting

Speaker:

from consensus reality. Yes. I like to say.

Speaker:

But they may want someone with, like, say, ADF

Speaker:

and and and this, but then actually have them working on systems and SSIS

Speaker:

because the hiring manager knows that he or she may not have that open req

Speaker:

for a while and has a in the back of

Speaker:

the mind the idea of moving to that someday.

Speaker:

Sure. But realistically, for the next 2 years, you're gonna work in this site.

Speaker:

Yeah. I mean And there are still large

Speaker:

consulting companies out there that develop brand new

Speaker:

applications in SSIS, brand brand new data

Speaker:

engineering data warehouse. One worse than that or one better depending on your

Speaker:

point of view. A few years ago, I think it was on dotnetrocks. They

Speaker:

were talking about telemetry from Visual Studio. And this back

Speaker:

when I cared about Windows client development.

Speaker:Basically, WPAF came out in:Speaker:

2007. Right? XAML.

Speaker:

No. Not XAML. Metro or modern

Speaker:applications, UWP came out in:Speaker:

2012. Right? So there's been multiple

Speaker:

frameworks to write when and it's been a number since. But, again, don't really care

Speaker:

about those client development anymore.

Speaker:

Windows Forms is still the number one of

Speaker:

all those, like, ways to develop Windows applications that run on Windows

Speaker:

Mac. Windows Form is still accounts for 80% of development.

Speaker:

Wow. Something someone's going to like, please email me in

Speaker:

hate with hate mail, not hate mail, but like tell me the exact number. But

Speaker:

it was still. He's off. Well, and they kept saying in Visual

Speaker:Studio:Speaker:

line for Windows Forms. We're not updating. We're not adding anything.

Speaker:

And the future is from now on. Right?

Speaker:

Until the future became something else. And then when

Speaker:

I last installed, I think it was Visual

Speaker:Studio:Speaker:

There was improved. They added stuff to Windows Forms, which is kind of funny because

Speaker:

they said they never would. Yeah. But it just because they.

Speaker:

Demand. Right? Customer demand. Ultimately, customer demand

Speaker:

is what pays the bills. So you've got to be very mindful of that.

Speaker:And, you know, if you if it's:Speaker:

a Windows Forms app, I have questions.

Speaker:

You know, I'm not saying I disagree, but I have many questions.

Speaker:

Arguably, you could say the same thing for UWP or

Speaker:

WPF. Right? You know,

Speaker:

But, again, it really depends. Like, in the last time I had worked with

Speaker:

WPF professionally was it was for when I

Speaker:

was at a, between my stints

Speaker:

at Microsoft, and there was a, you know, there was

Speaker:

a customer who was a mortgage company, and they basically had their mortgage

Speaker:

intake form written in WPF. Right?

Speaker:

And they were having performance issues with it. And I I looked at the code,

Speaker:

and I was like, this is a good lesson, I think, is

Speaker:

that, you know, they loaded up, like, some 6, 700 controls

Speaker:

all at once. Right? Wow. Because there were a lot of

Speaker:

fields and but they were all collapsed and things like that. And I was like,

Speaker:

well, I'm looking at this, and I'm, like, testing it. And I'm like, oh,

Speaker:

dear god. This is gonna be a nightmare because it's 600 controls. You could do

Speaker:

lazy loading and things like that, but then Sure. There could be unintended

Speaker:

consequences there. And then then I happen to notice when I load the

Speaker:

app, the CPU spikes, but the GPU was

Speaker:

hardly touched, which the whole promise of WPF was

Speaker:

that it would offload as much of the rendering

Speaker:

Yeah. Over to the GPU as possible because it was basically built on

Speaker:

XNA, which was a gaming framework. But that, again,

Speaker:

different different sidetrack, and different lifetime ago.

Speaker:

So I'm like, what the heck is going on? So then I'm like, I looked

Speaker:

at the some of the machines. I'm like, they had the generic

Speaker:

GPU driver. So I'm like, just

Speaker:

for grins, let me see if I

Speaker:

can get the proper driver for this device.

Speaker:

All of a sudden the 30, 40 seconds it took to load that initial

Speaker:

screen went down to 5 seconds.

Speaker:

Wow. And that's a big jump. That was a

Speaker:

big jump. And that was like, and I said to the guy,

Speaker:

it's like, look, just installing this driver, you get

Speaker:

a, you know, massive increase in it. Right? Do you

Speaker:

really wanna architect it or are you trying to just you want this to be

Speaker:

faster? Like, what's considered acceptable? And he said, well, under 10 seconds would be

Speaker:

acceptable. And I was like, I could do you better. How about 5 and a

Speaker:

half? Right. And I showed him

Speaker:

and he goes, well, not everybody has a GPU in their device. And I was

Speaker:

like, well, like, you know, this GPU costs

Speaker:

about, I think, 189. For some reason, 189 is stuck

Speaker:

under $200. Yeah. And I'm like,

Speaker:

so you'd have to install it. You have to think about the labor of installing,

Speaker:

like, this cheap GPU. Right? And this, he goes, that's

Speaker:

fine. He goes, look. The cost of redeveloping and

Speaker:

rearchitecting and retesting this versus $200 per

Speaker:

box, plus whatever it takes for somebody to go in with a screwdriver

Speaker:

and update the drivers. Right. It was so like we

Speaker:

ended up not having to touch the code at all. Right? Yeah. It was just

Speaker:

a matter of a driver update, which nice. Because it was like a fixed

Speaker:

price kind of support contract. I was the hero because I

Speaker:

solved the problem with about 3 out 3 to

Speaker:

5 hours of work. Nice. And the customer was happy because

Speaker:

they didn't have to re architect anything. Everybody wins.

Speaker:

Everybody wins. I love it. Happens. But but it it's just it

Speaker:

just goes to show you, like, sometimes the most cost

Speaker:

effective approach isn't is to not

Speaker:

touch anything. You know, and it's although it was a

Speaker:

relatively small amount of money, and and,

Speaker:

manageable amount of time for the client,

Speaker:

sometimes in if you look at the, you know, kind of the the

Speaker:

big, performance tuning

Speaker:

picture. And I I ran into some of this, in the past where

Speaker:

Well, it's not always a happy ending. As as an engineer,

Speaker:

I want to, you know, to fix the the thing that I'm engineering. And if

Speaker:

it's software, then I wanna make the software perform better. And

Speaker:

if it's, you know so I went through one of those experiences and

Speaker:

then, number of circumstances where

Speaker:

but the end result was we we

Speaker:

threw money at it and

Speaker:

bought, better disks.

Speaker:

And I remember telling me about this. I remember you telling me this. Oh, it's

Speaker:

a specific long story, but, yeah, it's back from about 12 years ago. Yeah.

Speaker:

You were, like, just a long SSDs, and it got fast

Speaker:

enough. And so when I did the math on that was the

Speaker:

enterprise level project, there were dozens of people

Speaker:

tuning on a tiger team, and we did make it

Speaker:

go faster. And as a test, what I did was I

Speaker:

rolled the code back to where it was when we

Speaker:

started. I will say now though, tiger the term tiger

Speaker:

team after my Nobody knows what I mean. Sorry. I know what you mean. I

Speaker:

know what you mean. Of individuals, dozens of individuals focused on solving

Speaker:

a particular problem. It's it's like, Although in some cases

Speaker:

slurring. Focused on not solving a particular problem, but that's a story

Speaker:

for another day. Different different story. You're right. But, yeah, we

Speaker:

we did and and I rolled everything back and ran the old

Speaker:

code that we had optimized. You know, it ran super

Speaker:

fast on the SSDs, and it was running okay. It was acceptable,

Speaker:

barely, on, the spindles. But when we rolled it back, it

Speaker:

was, it was the same difference. We actually got a touch more

Speaker:

performance just off the SSDs. And when it you know, you do the math on

Speaker:

that. At that time, SSDs were rather new, and the amount

Speaker:

that I wanted was, not trivial.

Speaker:

I asked for it on a lark, and I was surprised when it showed

Speaker:

up. So she did work for Microsoft. Okay. I'm not

Speaker:

surprised. Kendra is, scary as much. She did consulting. We

Speaker:

Brent's name came up a a few times. She worked with Brent for a while.

Speaker:

She worked at Redgate. Not everyone knows what Redgate is, but they're kind of a

Speaker:

big deal in this in this situation. Yeah. Yeah. They're a big deal. So, like,

Speaker:

clearly, like, I I just find I'd love to get her, like, initial

Speaker:

opinion after factoring in all kind of all of this is that,

Speaker:

you know, ironically, Sync don't run after every shiny thing.

Speaker:

But the the thing that that guy that was originally learning was is

Speaker:

the new shiny thing, ironically. Like, so there's a lot of layers to this.

Speaker:

There's even if you take this kind of at face value of not knowing the

Speaker:

context, there's a lot of layers. But, like, beyond that, there's even more layers. Like,

Speaker:

it becomes this multidimensional problem. It's true. It's like an

Speaker:

ogre. Many layers. Many I was wondering when we're

Speaker:

gonna have a movie reference because it's been a while. There we go. Boom. Shrek's

Speaker:

a classic. Shrek is a classic. The

Speaker:

sequel is not so much, but that's a common case.

Speaker:

Sure. Very, very few Matrix movies looking at here. It was

Speaker:

you know, I I saw the, activity on this and it's best because

Speaker:

since I posted this, I can see how many people looked at that. The the

Speaker:

link I sent you for the other one is kinda the one that started it.

Speaker:

I don't know if you have that link in chat. I think If you wanted

Speaker:

to click on that. I have That was that was a

Speaker:

few days, maybe a week before this. That was the first one I

Speaker:

commented. It was similar sentiment and I shared, you know, again, I

Speaker:

joined the conversation and then, I can't believe is

Speaker:

SSIS dead? Yeah. That's the one. So that

Speaker:

particular one has gotten, like, probably 12

Speaker:

to 15 times as many views. It's happened to You know, you mentioned And

Speaker:

it was the first time I had piped up about anything like this. I'm I

Speaker:

commented on the original one. LinkedIn will pop this up if you write a

Speaker:

lengthy comment like like that one. And

Speaker:

I said, that window exist. Looks like Windows XP

Speaker:

era. Well, that's SSIS. That is

Speaker:

a funny. Like, it's just funny. Yeah. But, And I

Speaker:

commented when this LinkedIn popped up and said, hey. Do you wanna repost this

Speaker:

since you wrote this comment? And I said, sure. And it just pastes the

Speaker:

comment up at the top of the repost. But I can see, like, the number

Speaker:

of people that and that one drew a lot more,

Speaker:

a lot more comments. And like I said, 12 to 15 times

Speaker:

the views. And, and and I communicated

Speaker:

with the original author just to touch about that at all. It it

Speaker:

went nowhere near as I'm

Speaker:

trying to think of the right word. No nowhere. It it

Speaker:

didn't get nearly as heated. I'll say it that way. And it

Speaker:

may just be that, you know, that I'm saying heated, but there was

Speaker:

there was one individual in particular who just very passionate about the tools that they

Speaker:

that they used for. I I I wanted to ask,

Speaker:

that individual, you know, how

Speaker:

many SSIS packages they developed and put into production.

Speaker:

Because I I I I would again, I think I know the answer

Speaker:

to that. This in the this individual conversations like that.

Speaker:

Say that again? Which individual? Not in that one.

Speaker:

Oh. Not it's in the other range. The one we've been looking at. This angle

Speaker:

here, though. Right? Nothing against Kiwi

Speaker:

ETL pools. What do you Yeah. Appreciate so this is this

Speaker:

is an argument of GUI versus Yeah. Straight

Speaker:

code. I think that's also an interesting concept. Low code, no

Speaker:

code versus you know? I like them both, but,

Speaker:

you know I do too. It's for me, it's which solution

Speaker:

matches best and as a consultant who goes to work with other companies a

Speaker:

lot Right. A big factor for me is What they have. You know, what

Speaker:

happens when I'm gone? Can you support it? What what are you most

Speaker:

familiar with? Even if I'm working in a tech a technology

Speaker:

technician, a technology like SSIS, there's really a

Speaker:

couple of ways you can, float it. It. That's good, John.

Speaker:

2 was good. I'm referring to a 3, and I think there was a 4th

Speaker:

one. Oh, really? I didn't even know there was a 4th one. So there you

Speaker:

go. But in SSIS, you can take a more And

Speaker:

the matrix 2 was good as well. Matrix 2 was good as well. I was

Speaker:

That's true. Yeah. You you could take, like, a more

Speaker:

DBA approach, a more T SQL driven approach, or you

Speaker:

can take a more dot net driven approach because you have both

Speaker:

execute SQL tasks and SQL sources and destinations and transformations,

Speaker:

and you have a script task and a script component.

Speaker:

And, you know, which way do you go? Which is the right way to go?

Speaker:

Well, it depends. If you've got a bunch

Speaker:

of dot net developers being tasked with maintaining

Speaker:

the, ETL after I leave, no way. 5.

Speaker:

Good god. Shrek 5 or matrix 5.

Speaker:

Either one sounds like a terrible idea. You know,

Speaker:

whoever's gonna be left behind support net, you gotta make sure that Right. You've done

Speaker:

a a fair enough job of representing their preferences,

Speaker:

on that. And, you know, I I know

Speaker:

I know consultants who come in and say, you know, we gotta do it

Speaker:

this way and you're bad and wrong and you're, you know, you're gonna go out

Speaker:

of business in 18 months if you don't listen to me. Right. That

Speaker:

if I also think too, like, when I first read it I'm sorry. I don't

Speaker:

say hire somebody else. Right. I mean, I when I first read

Speaker:

it, I first read it as should you learn SQL?

Speaker:

Okay. Right? And that's a big debate in the data science community is should should

Speaker:

data scientists learn SQL because, you know, Python can do everything. Just your no. No

Speaker:

knock on my Python can do everything. Should you use Python for everyone everything, I

Speaker:

think, is another question. Right? And I think the answer is no. Yeah. I think

Speaker:

if you're a data scientist if you're a

Speaker:

data scientist or even an AI kinda engineer, whatever the word is

Speaker:

this week, you should still learn SQL

Speaker:

because this one, it's way less complicated than anything else you're doing. And

Speaker:

2, it's kind of the the the the lingua

Speaker:

franca of anything data or data interaction. Right? And

Speaker:

again, Frank, the context comes into play here. Right. There's the there's the

Speaker:

mechanical tool. It's describing a software, you

Speaker:

know, a language is mechanical is one one way of looking at it. But Right.

Speaker:

There's also the problem you're trying to solve. And if if you're gonna do

Speaker:

data exploration or managing or whatever you wanna call it,

Speaker:

then I don't really care which mechanism you use. But don't

Speaker:

tell me one of these mechanisms is better than the other. It

Speaker:

Without a qualifier. For this particular task.

Speaker:

Yeah. So it could be that, you know, one

Speaker:

of them is is good at this one particular thing, and I I would argue

Speaker:

this. I did in my newsletter. Oh, strike 5. Okay. In my

Speaker:

newsletter, I argued, every single tool has something

Speaker:

that they're stronger and there's there's some

Speaker:

feature or some aspect of it that's better than all of

Speaker:

the others. And they also have some weakness

Speaker:

that's worse than all of the others. It's true. And and so

Speaker:

you gotta, you know, you gotta strike that balance. It's gonna depend

Speaker:

on your use case, the parameters, the things that are important to you.

Speaker:

Sometimes it's cash on hand.

Speaker:

Sometimes it's the servers, hardware that you're forced to work

Speaker:

with because you're owned by somebody and they they're not upgrading. They're not

Speaker:

giving you that staffing concerns. The people, their

Speaker:

experience, their languages, that their Well, how easy is it to find someone that

Speaker:

necessarily knows SQL versus knows Python?

Speaker:

Or, the bill rate for someone that knows SQL is probably gonna be different than

Speaker:

the person who knows Pandas. I think so.

Speaker:

I I think there are differences there, but, again, it's gonna depend on the company.

Speaker:

That's true. You're worth our job market. Right. You're worth

Speaker:

more to, you know, this company as a SQL developer than, you

Speaker:

know, maybe than a Pandas developer is to that that company. That's a

Speaker:

possibility. Go go where you get paid the

Speaker:

most, but realize especially if you're new, and I think that

Speaker:

person that in Kendra's original, post that

Speaker:

she was, she was jamming on, that

Speaker:

that was a different scenario. You're right. There's a number of squirrelly things about that

Speaker:

original post, but if you're

Speaker:

a young developer and just getting started, you're in college. I've got a

Speaker:

daughter studying computer science in college right now. And my

Speaker:

recommendation to her is first, learn everything that you

Speaker:

can. Do as much as absolutely as you can.

Speaker:

Pick up that knowledge. But,

Speaker:

be aware that there's more to it than just the, you

Speaker:

know, the the the brain exercise you get and the

Speaker:

thrill you get from seeing the code execute. Keep that thrill. Keep that spark

Speaker:

alive. Right. Right. Right. Right. But, you know, real

Speaker:

realize there's often more to it. And some of those factors you have no control

Speaker:

over, and the person you're working for has no control over.

Speaker:

And, Ed, you know, there's just a number of things totally external to the experience

Speaker:

of writing code that often

Speaker:

impact the experience of writing code. That's very true.

Speaker:

That's very true. K. Actually, going on for,

Speaker:

like, 90 minutes. So Wow.

Speaker:

Goodness. It doesn't seem like it. I've had this much coffee, and I still don't

Speaker:

have to go to the bathroom. That's amazing. Christmas miracle.

Speaker:

Of you. It's a festive miracle or a Christmas miracle. I don't

Speaker:

know. Awesome. But, so

Speaker:

this is this has been great. I think we kinda got to the bottom

Speaker:

of this is that basically, it's a nuanced conversation.

Speaker:

There's no simple answer. I have many

Speaker:

questions though. Why if you're learning LLMs and now, why are you

Speaker:

calling yourself a data engineer? But Yeah. That's a different thing.

Speaker:

Could just be semantics at that point. Could be.

Speaker:

But thanks, John. Thanks, Merdad.

Speaker:

Thanks, Hector and SQL Dev

Speaker:

DBA. I'm sure that's not the

Speaker:

name on on the driver's license, but

Speaker:

you never know. You never know. I wanna get a license plate holder

Speaker:

that has, like a, like, drop table.

Speaker:

Like, so that way when they

Speaker:

take a picture, they do the OCR. Boom. I've seen

Speaker:

the bumper stickers with longer Right. Right. Right. Right. Right. Secret

Speaker:

hacks on them, you know, secret injection attacks. No. Hey, man. Thanks

Speaker:

for tuning in. And, he legally changed his name. That's

Speaker:

cool. Apparently, there's a number of people

Speaker:

who have, like, a last name, and their

Speaker:

last name is Null. Oh, wow. And I

Speaker:

was looking up on YouTube. Like, there's, like, a number of, like, problems that people

Speaker:

have. And my first thought was, wait. Wouldn't that only be the case if

Speaker:

they encased it in quotes or single quotes? Like, wouldn't it?

Speaker:

Apparently, no. Some of these systems Depends. I mean, you

Speaker:

wanna talk old systems. I'm sure DMVs have some pretty

Speaker:

ancient technologies that are still running.

Speaker:

I mean, I can only imagine. Robert

Speaker:

Tables. Yes. Little Bobby Tables. Little Bobby Tables. That's the

Speaker:

one. I tried to name my kids something like that. But

Speaker:

You got overruled if I remember correctly. I did in so many ways. I'm

Speaker:

glad I didn't go with the, initials of, x

Speaker:

a m l. Yeah. That

Speaker:

that You could have. Xavier Anthony

Speaker:

Marcus was, on the table. See and

Speaker:

Lavinia? That would just Yep. That would just flow. That would be

Speaker:

your role. Though, since XAML kinda died,

Speaker:

it's probably a good thing. That's true.

Speaker:

So XML is even not as in vogue as it once

Speaker:

was. Well, you know our mutual friend, mister Kevin

Speaker:

Hazard, aka the Duke of Hazard. The

Speaker:

Duke. He says he says JSON is just hipster

Speaker:

XML. He's right, though. And there's another format called

Speaker:

JSONL. What? Yeah. JSONLines.

Speaker:

Never even heard of that. It I it's only the

Speaker:

I didn't hear about it until the product I work on, Rel AI, actually

Speaker:

uses it to store our data. And, basically, it's a different

Speaker:

between JSON and JSONL? Basically, it's a long line

Speaker:

where each line is a record of or a chunk of

Speaker:

JSON. That's how I interpret it. I'm sure I'll get

Speaker:

corrected, but please correct me on that one. Interesting. Yeah. I'm like, Jason

Speaker:

now? Like, what the heck?

Speaker:

Jason is an acceptable one. That's true. Jason

Speaker:

would've been a good name. But I didn't want him to get

Speaker:

teased on Friday 13th for the rest of his life.

Speaker:

So so,

Speaker:

thanks everyone for tuning in, man. This was awesome. We should do it more often.

Speaker:

Thumbs up on, or, you know, be sure to like, share, subscribe. I gotta do

Speaker:

this to all the things. Like, share, and subscribe. I can show off,

Speaker:

this little graphic.

Speaker:

You you know, Frank, we could do this, we could do this

Speaker:

a lot more often if there anytime I stir up some trouble, we do a

Speaker:

a live stream Oh, god. We do it every day. I Yeah. That's what I'm

Speaker:

saying. You know? No. I think it's interesting. I think it's good because, like, you

Speaker:

know, it the controversy

Speaker:

I think people are starting I just look last time I looked at the thread

Speaker:

thoroughly, people were talking past each other. Yeah. Which I

Speaker:

guess defines all of Reddit, mostly Internet.

Speaker:

You know, it is what it is. Yeah. But

Speaker:

cool, man. I gotta actually, now that I mentioned it, I do have to go

Speaker:

to the restroom. See. You did it to yourself. I did do it to myself.

Speaker:

And thank you, Miranda, for turning in. And

Speaker:

I will play the outro graphic. Excellent.

About the author, Frank

Frank La Vigne is a software engineer and UX geek who saw the light about Data Science at an internal Microsoft Data Science Summit in 2016. Now, he wants to share his passion for the Data Arts with the world.

He blogs regularly at FranksWorld.com and has a YouTube channel called Frank's World TV. (www.FranksWorld.TV). Frank has extensive experience in web and application development. He is also an expert in mobile and tablet engineering. You can find him on Twitter at @tableteer.