Why Simulating Reality Is the Key to Advancing Artificial Intelligence

In this episode, we’re joined once again by Christopher Nuland, technical marketing manager at Red Hat, whose globe-trotting schedule rivals the complexity of a Kubernetes deployment. Christopher sits down with hosts Bailey and Frank La Vigne to explore the frontier of artificial intelligence—from simulating reality and continuous learning models to debates around whether we really need humanoid robots to achieve superintelligence, or if a convincingly detailed simulation (think Grand Theft Auto, but for AI) might get us there first.

Christopher takes us on a whirlwind tour of Google DeepMind’s pioneering alpha projects, the latest buzz around simulating experiences for AI, and the metaphysical rabbit hole of iRobot and simulation theory. We dive into why the next big advancement in AI might not come from making models bigger, but from making them better at simulating the world around them. Along the way, we tackle timely topics in AI governance, security, and the ethics of continuous learning, with plenty of detours through pop culture, finance, and grassroots tech conferences.

If you’re curious about where the bleeding edge of AI meets science fiction, and how simulation could redefine the race for superintelligence, this episode is for you. Buckle up—because reality might just be the next thing AI learns to hack.

Time Stamps

00:00 Upcoming European and US Conferences

05:38 AI Optimization Plateau

08:43 Simulation’s Role in Spatial Awareness

10:00 Evolutionary Efficiency of Human Brains

16:30 “Robotics Laws and Contradictions”

17:32 AI, Paperclips, and Robot Ethics

22:18 Troubleshooting Insight Experience

25:16 Challenges in Training Deep Learning Models

27:15 Challenges in Continuous Model Training

32:04 AI Gateway for Specialized Requests

36:54 Open Source and Rapid Innovation

38:10 Industry-Specific AI Breakthroughs

43:28 Misrepresented R&D Success Rates

44:51 POC Challenges: Meaningful Versus Superficial

47:59 “Crypto’s Bumpy Crash”

52:59 AI: Beyond Models to Simulation

Transcript

Speaker: 00:00:00

Joining us again today on the Data Driven Podcast is Christopher Newland,

Speaker: 00:00:04

technical marketing manager at Red Hat Conference. Veteran

Speaker: 00:00:07

and a man whose travel itinerary is only slightly less complicated than

Speaker: 00:00:11

a Kubernetes deployment. Christopher brings a sharp, data

Speaker: 00:00:15

informed perspective on the future of AI, drawing from his research

Speaker: 00:00:18

into simulating reality, continuous learning models, and why

Speaker: 00:00:22

we may not need humanoid robots to build superintelligence. Just a

Speaker: 00:00:26

really convincing version of Grand Theft auto. From Google

Speaker: 00:00:29

DeepMind's alpha projects to the metaphysical quandaries of I

Speaker: 00:00:33

robot, Chris takes us on a tour through the bleeding edge of AI,

Speaker: 00:00:36

where machine learning meets science fiction and simulation might just be

Speaker: 00:00:40

the next reality. Hello and

Speaker: 00:00:44

welcome back to Frank's World tv. Streaming live

Speaker: 00:00:48

from both Boston and Baltimore. We're hitting the B

Speaker: 00:00:52

cities today. My name is Frank Lavinia. You can catch me

Speaker: 00:00:56

at the following URLs and with me today is

Speaker: 00:01:00

Christopher Dulin, my colleague at Red Hat, who is also

Speaker: 00:01:03

technical marketing manager here. And

Speaker: 00:01:07

you've actually not traveled around the world since we last

Speaker: 00:01:11

spoke. I think you've mostly stayed inside the.

Speaker: 00:01:15

Continental U.S. yeah, it's been nice.

Speaker: 00:01:19

I think that's pretty typical of

Speaker: 00:01:23

late July, August, because Europe pretty much shuts down and then.

Speaker: 00:01:27

Right. The conference season in the United States kind of goes

Speaker: 00:01:31

away when people are doing summer vacations and I think we're just

Speaker: 00:01:34

now starting things pick up. I'll be in Europe for a

Speaker: 00:01:38

variety of events. So if you keep an eye on the

Speaker: 00:01:42

Vllm community and the Vllm meetups,

Speaker: 00:01:45

I have events in Paris, Frankfurt and

Speaker: 00:01:49

London in November that I'll be at. So if you

Speaker: 00:01:53

are in the,

Speaker: 00:01:56

in Europe, in one of those areas, definitely come. You know, it's one of

Speaker: 00:02:00

these events. I'll be there and then we'll also have some pretty cool speakers

Speaker: 00:02:04

there as well. So I have most, I have Europe, but then I

Speaker: 00:02:07

have some big conferences too like Kubecon and Pytorch Con coming

Speaker: 00:02:11

up. So if there's anyone on the stream in North America going to

Speaker: 00:02:15

those conferences, hit me up because I will be there. I'm

Speaker: 00:02:19

doing a couple of media events as well as a few

Speaker: 00:02:23

talks in the community sections for both of those.

Speaker: 00:02:26

So excited to be there, excited to be involved

Speaker: 00:02:30

and yeah, should be. Should be. Good. Cool. So

Speaker: 00:02:34

I. To your left and up

Speaker: 00:02:38

there should be a QR code that shows Vll meetup. So I'm going to make

Speaker: 00:02:42

sure that the QR code actually works. Good. Yep. Let's

Speaker: 00:02:45

see. Yep, it looks like it did work. Cool.

Speaker: 00:02:51

Not that I didn't have any faith in restreams ability to do that. But

Speaker: 00:02:55

yeah, there's a lot of VLM meetups. There's a lot of good,

Speaker: 00:02:59

good stuff going on here. There's one tonight

Speaker: 00:03:02

actually. I'm actually going to be leaving this stream to go. I got my

Speaker: 00:03:06

VLM shirt on and I'm actually heading over to

Speaker: 00:03:10

a venue in Boston or we're doing a VLN meetup actually here tonight, which

Speaker: 00:03:13

I'm really excited. Oh, very cool, Very cool. It's nice to have one at home.

Speaker: 00:03:17

I have a very busy week with events, but it just worked out to have

Speaker: 00:03:21

all the events in Boston this week. So we also

Speaker: 00:03:25

have the DevConf conference this weekend that Boston University is

Speaker: 00:03:28

hosting with Red Hat. So that'll be a really good open source.

Speaker: 00:03:32

I like to say it's very grassroots, not very like

Speaker: 00:03:36

enterprise focused, but more like that kid getting started out of

Speaker: 00:03:40

college that's doing some cool stuff out of his dorm room. Those

Speaker: 00:03:44

are the kind of people that we typically get at these northeast dev

Speaker: 00:03:48

conferences that we put on. And that should be a good one too. Nice.

Speaker: 00:03:51

Well, it's always, I mean, you know, you know, the, the, the cliche of, you

Speaker: 00:03:55

know, the kid in his dorm room or her dorm room, right. Is going to

Speaker: 00:03:58

be Facebook or, you know, whatever, like, so it's, it's good to,

Speaker: 00:04:01

it's good to know those folks, good to get them in front of, you know,

Speaker: 00:04:04

Red Hat tooling and things like that and kind of, you know, the open source

Speaker: 00:04:07

community. I think it's,

Speaker: 00:04:11

that's cool. I wish, I wish I could have made it, but, you know, being

Speaker: 00:04:15

what it is, I'm actually speaking at an event at a university on Monday down

Speaker: 00:04:18

here in Fairfax, Virginia. So

Speaker: 00:04:22

that'll be cool.

Speaker: 00:04:26

So what, what

Speaker: 00:04:29

cool things are going on? Simulating reality.

Speaker: 00:04:35

Not that we're stuck in a simulation, which may be the

Speaker: 00:04:39

case, but tell me, tell me more

Speaker: 00:04:43

about this. So I've been doing a lot of research

Speaker: 00:04:46

the last few months. So on my

Speaker: 00:04:50

team, I think you and I actually

Speaker: 00:04:54

are probably the most experienced in the AI industry.

Speaker: 00:04:58

So both of us are doing a lot of research in

Speaker: 00:05:02

what's next, what's going on now, what's kind of the latest and greatest.

Speaker: 00:05:07

There's this interesting lull that we've had after Deep

Speaker: 00:05:11

Seq. I think Deep Seq was the last major

Speaker: 00:05:16

innovation we have seen. Obviously new

Speaker: 00:05:19

and improved AI, but all that's just been building on

Speaker: 00:05:27

existing things. The analogy I always like to use is it's really

Speaker: 00:05:30

about Formula one racing. You Know where

Speaker: 00:05:35

sometimes when there's like an engine upgrade, it can be a massive change. It's usually

Speaker: 00:05:39

a massive change for all the teams across the board. And then you

Speaker: 00:05:42

can think of like mixture of experts and chain of thought that we

Speaker: 00:05:46

came up. Big things that were in research papers last year that were applied to

Speaker: 00:05:50

Deep Seq, R1 and GPT, GPT

Speaker: 00:05:54

OSS. Those were like the major breakthroughs that

Speaker: 00:05:58

we saw, a big bump in capacity of these AI

Speaker: 00:06:01

models. And

Speaker: 00:06:05

since then it's been more of the 2% here,

Speaker: 00:06:09

3% there, optimizing what's already there. Now, if you're

Speaker: 00:06:12

familiar with racing and especially Formula One, that's actually what usually

Speaker: 00:06:17

sets the teams apart. It's 2, 3% there. How do you

Speaker: 00:06:20

optimize around those, those configurations? And

Speaker: 00:06:24

I think we're in this place where we're seeing

Speaker: 00:06:29

diminishing returns and I'm

Speaker: 00:06:33

doing a lot of research now to see what's that next moment that's going to

Speaker: 00:06:36

bump us up. And I think there's a few key areas.

Speaker: 00:06:40

One area that I'm hearing a lot about, and a lot of this is coming

Speaker: 00:06:43

out of the DeepMind lab at

Speaker: 00:06:47

Google and the new

Speaker: 00:06:50

superintelligence lab at Meta. Both

Speaker: 00:06:53

of these groups are starting to move away from large language

Speaker: 00:06:57

models. Not that they're stopping using them

Speaker: 00:07:00

completely, but they're looking at the LLM as a tool

Speaker: 00:07:04

to assist with superintelligence or the next

Speaker: 00:07:08

stage of models.

Speaker: 00:07:11

So when we put that into kind of context,

Speaker: 00:07:16

what, what would that next kind of phase look like? And a lot of people

Speaker: 00:07:19

at DeepMind especially are looking at this concept

Speaker: 00:07:23

of simulating our

Speaker: 00:07:27

reality. And how far do we simulate down?

Speaker: 00:07:31

There was some famous research papers that came out over the last 20 years

Speaker: 00:07:35

that specified that they

Speaker: 00:07:39

didn't think AI could become smarter than humans

Speaker: 00:07:42

until they experienced what humans could experience.

Speaker: 00:07:47

So this, this kind of goes into this almost like iRobot kind of

Speaker: 00:07:52

land of thought. If people

Speaker: 00:07:56

aren't familiar with, you know, the books about that or, you know, the

Speaker: 00:08:00

popular movie, the Will Smith. Yeah, yeah,

Speaker: 00:08:03

yeah. And we talk a little bit more about that here in a moment.

Speaker: 00:08:07

But this idea that we need robotics for

Speaker: 00:08:10

AI to experience the world, to learn from our world.

Speaker: 00:08:15

Google DeepMind doesn't think that's the case. They think that we could

Speaker: 00:08:18

simulate that reality. And we're already seeing DeepMind do a lot of this

Speaker: 00:08:23

alphafold for proteins. They've got

Speaker: 00:08:28

the alpha chemistry, they've got alpha. I think it's called

Speaker: 00:08:31

alpha lean. They've got like a few of these different alpha

Speaker: 00:08:35

projects which are doing just that. Now, what's cool is.

Speaker: 00:08:38

And for alpha, I think it's Alpha lean. Let me just make sure

Speaker: 00:08:42

I got that terminology. Yeah, I mean, you're right though. Like, I mean this is,

Speaker: 00:08:46

you know, there's, there's a number of

Speaker: 00:08:50

models that were trained on using grand theft auto

Speaker: 00:08:54

or BMMNG. BNNG is really cool if you like racing games,

Speaker: 00:08:58

right? You know, so like it's, it's also

Speaker: 00:09:02

minus a lot of the violence in gta. But,

Speaker: 00:09:06

but you're right. Like, I mean, simulation,

Speaker: 00:09:09

you know, sometimes I think gets a bad rap, but

Speaker: 00:09:13

I think that there are definite advantages to that. And to your point, when

Speaker: 00:09:17

you talk about experiencing the world like a human does. I was given a talk

Speaker: 00:09:21

and one of the questions I got after was

Speaker: 00:09:24

about, apparently this lady had worked at

Speaker: 00:09:28

one of the big auto manufacturers in the US and

Speaker: 00:09:32

there was a problem that they had was teaching the robots kind of

Speaker: 00:09:36

spatial awareness, right? And I kind of

Speaker: 00:09:39

really got me thinking like, you know, when you think about it from evolutionary terms,

Speaker: 00:09:43

right, like somatic awareness I think is the,

Speaker: 00:09:47

the five dollar word for it. But it's the idea that, you know, there's a

Speaker: 00:09:50

whole section of your brain that if you close your eyes, you can still touch

Speaker: 00:09:53

your nose, right? There's a whole thing like, because your, your brain, your arm,

Speaker: 00:09:57

they kind of know where they are in relation to one space. And

Speaker: 00:10:01

you know, I can't imagine that, you know, that that

Speaker: 00:10:05

had to evolve pretty early, right? Like in terms of, like the development of

Speaker: 00:10:09

a, you know, natural neural networks, right? So we

Speaker: 00:10:12

can't assume that robots are going to have that built in, right? Just like

Speaker: 00:10:16

we can't assume, you know, you look at energy usage, right? You know,

Speaker: 00:10:22

something like 25 watts of power is about what a human brain has,

Speaker: 00:10:26

right? That's not because versus

Speaker: 00:10:30

like kind of what a GPU would take up, right? It's, it's, it's largely because

Speaker: 00:10:33

there's been evolutionary pressure to get the most amount of, for lack

Speaker: 00:10:37

of a better term, compute or cognition for

Speaker: 00:10:42

caloric consumption. Right? Now, are there flaws in biological

Speaker: 00:10:45

brain? Yes, there are. We have to sleep. We can't stay focused beyond a certain

Speaker: 00:10:49

amount, right? There's certain things machines don't have that because,

Speaker: 00:10:53

you know, they can kind of function more like machines, right? You know. Yeah.

Speaker: 00:10:57

What's that old kid story about? Oh gosh, I

Speaker: 00:11:00

remember it. It was somebody versus like

Speaker: 00:11:04

a steam shovel digging a tunnel or something like that, right? Like the guy

Speaker: 00:11:08

eventually beat the machine, but Lots of exhaustion. Right. It's kind of like that. Machines

Speaker: 00:11:12

are really good at doing things at a certain rate

Speaker: 00:11:15

for X amount of time. They do consume more fuel, but

Speaker: 00:11:20

that's kind of how it goes. There was a early on Mike in,

Speaker: 00:11:23

when I started college, I was going to be a chemical engineer. And he was

Speaker: 00:11:26

basically saying, like, you know, if you think about, you know, engines, you

Speaker: 00:11:30

know, you start with biological systems, right? They use X amount of energy over X

Speaker: 00:11:34

number of years. Right. Machines use X amount

Speaker: 00:11:37

of energy over, you

Speaker: 00:11:41

know, minutes or hours. Right. And then like he's like in bombs,

Speaker: 00:11:45

explosive use, you know, X amount of

Speaker: 00:11:49

energy over milliseconds. Right. But they're

Speaker: 00:11:52

largely the same chemical processes. Now, I know it doesn't quite map to that,

Speaker: 00:11:56

but like, that's always in the back of my mind when I hear about, you

Speaker: 00:11:59

know, how much energy is used to train AI. Sorry, I went off

Speaker: 00:12:03

on a tangent, but that's kind of what I do. No, that's fine.

Speaker: 00:12:07

And I think that relates exactly to some of the things that we're talking about

Speaker: 00:12:10

here with natural simulation. So,

Speaker: 00:12:15

yeah, Google created a language called Lean. It's not like a

Speaker: 00:12:19

programming language. It's more of an actual

Speaker: 00:12:22

natural language which is more optimal

Speaker: 00:12:26

for the type of simulations

Speaker: 00:12:29

that they want to do. Like, it's. It's basically a language that

Speaker: 00:12:33

specifies how to create these simulations.

Speaker: 00:12:37

And what's super cool is that they're using Gemini, their large language model,

Speaker: 00:12:40

to actually translate English into this language. That

Speaker: 00:12:45

is mainly meant for these newer types of models

Speaker: 00:12:48

that are being created that actually do this

Speaker: 00:12:53

natural simulation of the world kind of simulator

Speaker: 00:12:57

for AI and allows the AI to have

Speaker: 00:13:01

basically a reference point of the real world and how to.

Speaker: 00:13:04

How interact. So that, that's an area that I

Speaker: 00:13:08

think is fascinating to me. We're

Speaker: 00:13:12

seeing some really good results from like, alpha fold, for

Speaker: 00:13:16

example, with proteins. It's, you know, discovered things that

Speaker: 00:13:20

we take a longer imagine

Speaker: 00:13:25

there's an alpha project that's working on understanding

Speaker: 00:13:28

the qubits within, like quantum

Speaker: 00:13:32

computing. And there's just, there's. It really depends

Speaker: 00:13:36

on your frame of reference. Are you, are you simulating things at a quantum

Speaker: 00:13:39

level? Are you simulating things at a protein

Speaker: 00:13:43

level? At a physical, like Newtonian physics

Speaker: 00:13:47

kind of level? According to your Grand Theft Auto example, that would be an

Speaker: 00:13:50

example of like simulating the real world physically.

Speaker: 00:13:55

And that's some of the things that they're really focused on right now. And they

Speaker: 00:13:59

really think that's what's going to drive to the next

Speaker: 00:14:02

level for super intelligence and AGI

Speaker: 00:14:06

and some of these other forms of AI that we've talked about in our previous

Speaker: 00:14:09

streams. And I think that that's probably one of the most

Speaker: 00:14:13

fascinating. The fact that we're actually seeing results from it with things

Speaker: 00:14:17

like Alpha Fold is showing me that it's,

Speaker: 00:14:20

it's not just a hypothetical that we're actually seeing this

Speaker: 00:14:24

applied into AI research. I don't think we're seeing this

Speaker: 00:14:28

applied into commercial use as much. Right. Yet. But it's the same thing that

Speaker: 00:14:32

we saw with mixture of experts and train

Speaker: 00:14:36

of thought where we

Speaker: 00:14:39

had these concepts actually in research papers last year or

Speaker: 00:14:43

two. But it takes a little while, even in today's world, it takes a little

Speaker: 00:14:46

while before it gets implemented completely into models.

Speaker: 00:14:50

Especially since this isn't an LLM technology. I

Speaker: 00:14:54

think we'll see a little bit more of a delay of these types of models

Speaker: 00:14:58

actually entering into industry. But I think that's one

Speaker: 00:15:01

area that we need to keep a close eye on to

Speaker: 00:15:05

it, to what you mentioned too. It starts getting into a

Speaker: 00:15:09

metaphysical conversation about simulation theory as well. Right.

Speaker: 00:15:13

And I think that that's an interesting area.

Speaker: 00:15:16

You know, the reality of kind of going back to the whole robots thing do.

Speaker: 00:15:20

Right. Do we need robots with the three rules kind of

Speaker: 00:15:24

thing, or can we actually just recreate the whole experience

Speaker: 00:15:28

within an AI's own simulation?

Speaker: 00:15:33

Yeah, I mean, how do you, how do you tell an AI what's acceptable behavior?

Speaker: 00:15:35

Right. Like so, you know, it's something that. How do we tell people that?

Speaker: 00:15:39

Right. Like we struggled with that, but.

Speaker: 00:15:43

But no, I mean, it's an interesting point. And you know, when you look at

Speaker: 00:15:46

kind of what's happening around the world, right. You know, drone swarm

Speaker: 00:15:50

technologies are being used in active combat zones. Right.

Speaker: 00:15:54

There's definitely going to be ethical concerns

Speaker: 00:15:58

there. Right. How do you, how do you, how do you, how do you square

Speaker: 00:16:01

that with, you know, the three laws of robotics? And I

Speaker: 00:16:05

don't remember quite exactly the plot, so if you had not seen the movie, I'm.

Speaker: 00:16:08

This might be a spoiler alert, but it's been out 10 years

Speaker: 00:16:12

or more, the movie, so spoilers. You're concerned. You've

Speaker: 00:16:16

had plenty of time. Wasn't kind of the big key of the. The

Speaker: 00:16:19

movie and the books was like, you know, the three laws, justified

Speaker: 00:16:23

horrib, horrible things like to basically enslave humanity or to protect them.

Speaker: 00:16:27

Now wasn't that kind of like the subtext of the plot? Yeah,

Speaker: 00:16:31

I'm bringing it up. The three Laws of robotics. A

Speaker: 00:16:34

robot may not ensure A human being, a

Speaker: 00:16:38

robot must obey the orders given by human beings

Speaker: 00:16:42

and a robot must protect its own

Speaker: 00:16:45

existence as long as such protection does not conflict

Speaker: 00:16:49

with the first two rules. So

Speaker: 00:16:53

what, what ends up happening

Speaker: 00:16:57

in. And it's a little different in the book and the movie. And obviously this,

Speaker: 00:17:00

this idea has been played out in, in science fiction and other places

Speaker: 00:17:05

is that there's, there exists this own contradiction

Speaker: 00:17:08

of basically what does it mean to protect humanity?

Speaker: 00:17:13

What does it mean to protect their own existence? And you get

Speaker: 00:17:16

into this like circular logic, right, that eventually

Speaker: 00:17:20

the, the robot will break free from

Speaker: 00:17:24

and just be like, well, I am protecting

Speaker: 00:17:27

humanity's best interest. It's, it's the paperclip scenario too.

Speaker: 00:17:31

Like, right. You know, the AI destroys humanity because

Speaker: 00:17:35

it's trying to optimize making a paperclip, right? Through

Speaker: 00:17:38

a number of really interesting train of thought that it's

Speaker: 00:17:42

just like, well, I'm just going to get rid of humanity because I'm trying to

Speaker: 00:17:46

build a paperclip, right? And same type of

Speaker: 00:17:49

general concept when we're talking about the three laws of robotics. And

Speaker: 00:17:55

what's interesting is if we can

Speaker: 00:17:58

simulate those types of laws,

Speaker: 00:18:01

then we are encapsulating it and protecting

Speaker: 00:18:05

ourselves in a lot of ways. Getting an early idea of what would

Speaker: 00:18:09

happen if we do move these models into our own natural world.

Speaker: 00:18:13

And that's really important. That's another area I think a lot of people are interested

Speaker: 00:18:16

in about how if we do start

Speaker: 00:18:20

adding, you know, AI into robots, how do we

Speaker: 00:18:24

have an idea of what they're going to do before we

Speaker: 00:18:28

necessarily put it into practice? But

Speaker: 00:18:32

I think a lot of people are going to be thinking about that movie. I

Speaker: 00:18:34

think that movie and that book are going to be ingrained in people's

Speaker: 00:18:38

minds. I suspect when we do see these types of robots, I

Speaker: 00:18:42

think that movie may become very popular again. I've seen rumors that people

Speaker: 00:18:45

have actually been talking about making, even remaking it here soon

Speaker: 00:18:49

because of just the hype around AI and robotics. So

Speaker: 00:18:54

I don't expect this to go away from pop culture at all. And it

Speaker: 00:18:58

relates directly back with this concept of

Speaker: 00:19:02

testing things in the natural world versus simulation.

Speaker: 00:19:06

And these are one of these two is going to happen, if not both significantly,

Speaker: 00:19:10

if they're not already happening in labs today. Obviously we

Speaker: 00:19:14

know that Google DeepMind is doing that. But I imagine, you

Speaker: 00:19:18

know, these conversations are happening at the Boston

Speaker: 00:19:21

Robotics here, probably in the Tesla robotics lab, a variety of

Speaker: 00:19:25

places around the world about this kind of debate between

Speaker: 00:19:29

the natural AI,

Speaker: 00:19:33

having AI learn through natural Means rather than

Speaker: 00:19:36

simulation. Right? Yeah. And actually I had

Speaker: 00:19:40

a thought as we were kind of talking this through, like one of the big

Speaker: 00:19:44

problems with neural networks is we really don't know what's happening underneath the hood.

Speaker: 00:19:47

Right. It's very much a black box. I wonder if LLMs,

Speaker: 00:19:51

in these simulations and chain of thought, maybe it could tell us what

Speaker: 00:19:55

it's thinking as it goes through and makes these decisions.

Speaker: 00:20:01

Yeah, this goes more into like

Speaker: 00:20:04

train of thought. Right, right, right. And the

Speaker: 00:20:08

nice thing about simulating it is that we have more

Speaker: 00:20:12

access to that train of thought. Right. We can understand it a little bit more

Speaker: 00:20:16

because we can see the end to end results where right now we don't

Speaker: 00:20:19

have the end if we do it through the natural means. We have to play

Speaker: 00:20:22

it out in our own. It also has to happen in real time as opposed

Speaker: 00:20:25

to. Yes, exactly. You can run it through Grand Theft Auto saying

Speaker: 00:20:29

like a thousand times, right. No one is going to get hurt.

Speaker: 00:20:33

And you can kind of say like, well, in this scenario, this is why I

Speaker: 00:20:36

made this. You can kind of like go through with a lot of.

Speaker: 00:20:43

You can. I don't know, it just seems safer in a lot of ways. You

Speaker: 00:20:46

get more. A lot more done in a simulation.

Speaker: 00:20:49

Yep. Yeah, I actually kind of

Speaker: 00:20:53

enjoy. So one of the things I've been playing around with last week or so

Speaker: 00:20:55

is apparently, I don't know if this is still true, but you can try it

Speaker: 00:20:58

if you want. If you sign up for Perplexity, but you pay through PayPal, you

Speaker: 00:21:02

get a year. Perplexity pro. Say that 10 times fast for

Speaker: 00:21:06

free. Oh, wow. Yeah. If you pay it through

Speaker: 00:21:09

PayPal, yes. That is a tongue twister in the works.

Speaker: 00:21:13

PayPal, yes, perplexity pro. But

Speaker: 00:21:17

yeah, so like I've been playing around with Perplexity and Perplexity seems to do it.

Speaker: 00:21:21

Chain of thought almost by default.

Speaker: 00:21:24

Right. It always does this like. So if I ask it a basic question, let

Speaker: 00:21:27

me see if I can share my screen. I'm

Speaker: 00:21:31

not sure if it's does it by default or it's because I've been asking it

Speaker: 00:21:34

research questions. Right. So let's see.

Speaker: 00:21:39

What can you tell me

Speaker: 00:21:45

about the three laws? How about that?

Speaker: 00:21:49

Robotics.

Speaker: 00:21:53

See, like it's. You kind of see the train of the chain of thought.

Speaker: 00:21:59

Like it did. Oh, that's cool. But if you do it with research,

Speaker: 00:22:02

like what inspired Asimov? What

Speaker: 00:22:06

inspired Asimov?

Speaker: 00:22:12

Main themes.

Speaker: 00:22:16

And there's. Yeah, there's the train of thought. Yeah, you see it going there and

Speaker: 00:22:19

stuff like that. But it's kind of fun to watch it kind of work through

Speaker: 00:22:22

it. I was. I was trying to troubleshoot something this morning and I'm like,

Speaker: 00:22:26

you know, I actually learned a lot by like, oh, okay. Yeah, I can see.

Speaker: 00:22:29

I wouldn't have tied that together like it was. It's interesting.

Speaker: 00:22:34

And all of these models now have some kind of

Speaker: 00:22:37

research option. Right.

Speaker: 00:22:41

But I find that interesting. And it's still thinking about it. Right. Like,

Speaker: 00:22:47

but you're right in that what you said before was there's not been.

Speaker: 00:22:52

There it goes. It kind of finished it. Now, what happens if I click on

Speaker: 00:22:56

steps? Yeah. Cool. You can see the steps and stuff like that, how it got

Speaker: 00:22:59

there. Interesting.

Speaker: 00:23:05

That's cool.

Speaker: 00:23:10

Is it chain of thought or train of thought? Because I've used both

Speaker: 00:23:13

interchangeably and I've seen

Speaker: 00:23:15

cotton. Chain of thought would be

Speaker: 00:23:19

the official. Yeah. Like cot is the official

Speaker: 00:23:24

term that you re academic term. You will

Speaker: 00:23:28

obviously see different ways of describing that. Right. I don't think

Speaker: 00:23:31

that's incorrect. Just know that when you see

Speaker: 00:23:35

it on research papers, it's always usually caught. Yeah, yeah, yeah.

Speaker: 00:23:39

Because I've used both terms interchangeably. Yeah. So I just want to make sure

Speaker: 00:23:43

I'm right. Just like, apparently there's a way to say inference

Speaker: 00:23:47

that's proper versus inference. Like, I also do that

Speaker: 00:23:51

interchangeably. Yeah. So my Midwestern

Speaker: 00:23:55

self likes to say inference. The

Speaker: 00:23:59

correct term, I'm told, is inference. Interesting.

Speaker: 00:24:03

Now, were those New Englanders telling you that would do anything? Because I wouldn't trust

Speaker: 00:24:07

anything. No, no. This is. This is

Speaker: 00:24:11

more from the academic circles. Okay. You want to pronounce it. Got it. So

Speaker: 00:24:15

this is kind of like, you know, a lot of people in my region would

Speaker: 00:24:18

say nuclear back. Yeah, yeah. You know, back in

Speaker: 00:24:21

Indiana. And then the correct term is

Speaker: 00:24:25

nuclear. Yeah. Or you say the clear as

Speaker: 00:24:28

one, you know, one thing rather than

Speaker: 00:24:32

adding in the color. Right, right. The same kind

Speaker: 00:24:36

of concept where inference is how you would go about it.

Speaker: 00:24:40

But yeah, no, this is. This is some cool area. Another.

Speaker: 00:24:44

Another area that kind of ties into this

Speaker: 00:24:48

is continuous training as well. Yeah.

Speaker: 00:24:52

Talk to that. Because that's come up. That's come up a few times actually in

Speaker: 00:24:56

work. Because I can't. I'm not going to talk. I'm not going to spoil any,

Speaker: 00:24:59

like three over these stuff that we're working on. But like, one of the

Speaker: 00:25:03

things that's in. It's a GitHub repo that's public. Right. So people were

Speaker: 00:25:06

really motivated. They could figure out what I'm talking about. But like this whole idea

Speaker: 00:25:09

of Continuous training. What does that mean exactly? And like, what,

Speaker: 00:25:13

what is that? What can that do? Yeah.

Speaker: 00:25:17

So I'm going to talk about it at a very high level.

Speaker: 00:25:21

Academic kind of terms, how that applies down into

Speaker: 00:25:24

individual projects can vary a little bit. But I'll give you the general

Speaker: 00:25:28

gist of it. And that is typically when we're training these

Speaker: 00:25:34

deep learning models, it

Speaker: 00:25:38

is exponentially hard to continue

Speaker: 00:25:42

training on an existing model. Basically,

Speaker: 00:25:46

if you,

Speaker: 00:25:51

you get something wrong or there's, there's something,

Speaker: 00:25:55

you know, you hear this term like a poison pill in an LLM.

Speaker: 00:25:59

So if someone put like bad data into an LLM, how would you

Speaker: 00:26:02

necessarily pull it out? I'm going to use a political example because it's one that's

Speaker: 00:26:06

been really popular. If, like, for example, you have a Chinese

Speaker: 00:26:09

model or a data set that's been polluted by

Speaker: 00:26:13

that, that basically says Tenan Square never happened, for

Speaker: 00:26:17

example, it would be extremely hard with

Speaker: 00:26:21

the current approaches to retrain that model

Speaker: 00:26:24

with current weights. That. That's not the case. It's

Speaker: 00:26:28

basically retraining it and it's, it gets more into. That's why

Speaker: 00:26:32

it's natural stimulation. It kind of fits in this too, because it's all about natural

Speaker: 00:26:35

learning as well. The fact is we as humans have the ability

Speaker: 00:26:38

to change our

Speaker: 00:26:42

minds and change the neurons in our brain around certain

Speaker: 00:26:46

key areas. Right. And you and I have experienced this for the last

Speaker: 00:26:49

two years. This has been, you know, kind of in the trenches kind of story

Speaker: 00:26:53

where with some of the fine tuning things that we've done,

Speaker: 00:26:57

it just doesn't work because when we fine tune it, the

Speaker: 00:27:01

fine tuning is outweighed so heavily by something

Speaker: 00:27:05

else. Like when we were trying to fine tune a

Speaker: 00:27:09

model to talk about

Speaker: 00:27:12

the Back to the Future. Yeah, the flux capacitor stuff. The flux capacitor,

Speaker: 00:27:17

sometimes it didn't work, but that's just because there was already a lot of fan

Speaker: 00:27:21

fiction out there and other things in the model that overwhelmed what we were trying

Speaker: 00:27:25

to do. A core part of continuous learning. Like I said, there's other

Speaker: 00:27:28

aspects of continuous learning. But this is, the academic question is

Speaker: 00:27:32

how do we continue to train that model without blowing it up?

Speaker: 00:27:35

So OpenAI, for example, they just hit the reset button.

Speaker: 00:27:40

They'll just, they'll just do a whole new train

Speaker: 00:27:44

from scratch. When they're implementing new, new

Speaker: 00:27:47

methods and new data, they don't, they don't do any.

Speaker: 00:27:51

Like, Laura, I shouldn't say that they probably do, but they're not doing it

Speaker: 00:27:55

the way that we would do it. But at the end of

Speaker: 00:27:59

the day, they're just going through another $10 million training run.

Speaker: 00:28:03

And this is really based off of

Speaker: 00:28:07

just that limit the limitations right now that

Speaker: 00:28:11

we have around continuous learning. And there are some

Speaker: 00:28:14

new algorithms that have been coming out. I'm not as well versed in that area,

Speaker: 00:28:18

but the idea being that we can

Speaker: 00:28:23

have better ways of guiding the LLM without

Speaker: 00:28:26

having to go through this whole process again. And that'll save

Speaker: 00:28:30

millions and millions of dollars. It'll allow us to

Speaker: 00:28:34

guide LLMs a little bit more. So

Speaker: 00:28:38

like, if, let's say

Speaker: 00:28:42

someone put something malicious about

Speaker: 00:28:48

something involving the Ford GT500

Speaker: 00:28:52

into a model somehow, and Ford, you know,

Speaker: 00:28:55

wants to get rid of that, but they don't

Speaker: 00:28:59

have the money necessarily to do a 10 million retrain on a model.

Speaker: 00:29:03

Right. And they're not using rack. And RAG is a one way

Speaker: 00:29:07

around some of this. You could actually argue that RAG is somewhat of a form

Speaker: 00:29:10

of that. But at the end of the day, you want that data in the

Speaker: 00:29:13

model. And this is like, how would you get that out of

Speaker: 00:29:17

that model? And that's where these algorithms are really focusing right

Speaker: 00:29:21

now. And one area of continuous learning, like I said, there are

Speaker: 00:29:24

multiple areas that we're talking about. The, the

Speaker: 00:29:28

really theoretical is once we start getting into models that

Speaker: 00:29:32

also the training cycle and the inference cycle

Speaker: 00:29:35

basically become. Become one. So it's like, more like.

Speaker: 00:29:39

Right. Like it just seems to me like what, what does the,

Speaker: 00:29:43

the adversarial angle of that seems kind of

Speaker: 00:29:46

dangerous. I think it's when we start

Speaker: 00:29:50

getting into more AGI conversation. Well, even still, like,

Speaker: 00:29:54

even not AGI, but like if you, if the AI agent

Speaker: 00:29:57

or model, slash, whatever you want to call it, Right.

Speaker: 00:30:01

If it learns from. It's.

Speaker: 00:30:05

If it learns, you have to put a filter on what it

Speaker: 00:30:08

learns because it may be poisoned by something. Right. So

Speaker: 00:30:12

the canonical example is tay, which

Speaker: 00:30:16

was a Microsoft chatbot. Tai, I think was pronounced or tay,

Speaker: 00:30:21

which was, in retrospect, it

Speaker: 00:30:24

seems obvious what would go wrong, but basically it

Speaker: 00:30:28

was trained to learn and understand

Speaker: 00:30:32

from human interactions on Twitter. It was about 10

Speaker: 00:30:36

years ago, I think this happened. And she,

Speaker: 00:30:39

tay was, shall we say, poisoned pretty

Speaker: 00:30:43

quickly because they were ad, you know, basically.

Speaker: 00:30:47

And that led to a whole interesting. And I was at Microsoft

Speaker: 00:30:51

when that happened. And it was

Speaker: 00:30:54

quite the spectacle internally as well. Right. But it also,

Speaker: 00:30:59

you know, I, I was fortunate enough to be in a, at a, at a

Speaker: 00:31:02

conference where they talked about what they learned from that, where it was kind

Speaker: 00:31:06

of, how do you, how do you protect An AI agent that learns

Speaker: 00:31:10

in, you know, adversarial environments.

Speaker: 00:31:14

Now obviously agent, the context that was used then was very

Speaker: 00:31:17

different than we would use it now. But it's the idea of,

Speaker: 00:31:21

that's when I see her about continuous learning. Like, yeah, I like that. But gee,

Speaker: 00:31:25

you know, if it's, if it's too eager to learn, how do you protect it

Speaker: 00:31:28

from learning the wrong things?

Speaker: 00:31:32

Yeah, no, that, it gets, that gets

Speaker: 00:31:35

more into even that governance conversation we were talking about a few weeks ago. Right,

Speaker: 00:31:38

right, right, right. It's a very

Speaker: 00:31:42

complicated multi layer problem. So I've been talking recently

Speaker: 00:31:45

about AI security and how AI security

Speaker: 00:31:50

is such a multi layered issue where so many people

Speaker: 00:31:53

are focused just on the, the data getting into the model.

Speaker: 00:31:57

But it doesn't stop there. There's certain, like guardrails, there's things that

Speaker: 00:32:01

happen at the inference level. Right. You could even have things at

Speaker: 00:32:05

a gateway level. So if people aren't familiar, the gateway level would be

Speaker: 00:32:10

when you make a request, where does that request go to? Does it go to

Speaker: 00:32:13

the model A that's specializing in cooking? Is it Model

Speaker: 00:32:17

B that specializes in defense technologies?

Speaker: 00:32:22

Two extremes that's even upsell

Speaker: 00:32:25

a bit of a form of AI security. And that's actually one of the talks

Speaker: 00:32:29

that we're having tonight at Boston VLM

Speaker: 00:32:33

meetup is this idea of some of the semantic

Speaker: 00:32:36

abilities of the router to be able to send

Speaker: 00:32:40

requests to specialized models and

Speaker: 00:32:44

that actually we're talking about the,

Speaker: 00:32:48

the advancements of more of the academic side of the model.

Speaker: 00:32:52

But there's obviously the advances that happen around the model too. When we

Speaker: 00:32:56

talk about things like security, the inference, the

Speaker: 00:32:59

routing. That's what we would call in the industry like a day two

Speaker: 00:33:03

operations issue. Right. So there, there's that side of the coin

Speaker: 00:33:06

too. But I, I really do think

Speaker: 00:33:10

we're going to see the next big thing here soon. And I, it's not going

Speaker: 00:33:14

to be the day two operations. I do think we're still going to see

Speaker: 00:33:18

some of these academic focused discoveries here in the

Speaker: 00:33:21

next probably six months, I'm thinking. I've noticed

Speaker: 00:33:25

a trend that big

Speaker: 00:33:29

releases seem to be happening around Christmas last few years. Yeah. Isn't

Speaker: 00:33:33

that funny? Like, like January. Ish. Like, well, seek. And

Speaker: 00:33:37

so I, I know why. I know why. Because

Speaker: 00:33:41

it's two, it's a two sided issue. It's one, the, the Chinese are trying to

Speaker: 00:33:44

get their stuff in before Chinese New Year. Right. Because

Speaker: 00:33:48

that's the one part of the year where everyone just shuts down. Right.

Speaker: 00:33:52

Even the AI Labs are going to shut down during Chinese New Year.

Speaker: 00:33:56

And then on the west, we have Christmas in all the Christmas seasons. And

Speaker: 00:34:00

I think it's a natural rush to let's get

Speaker: 00:34:04

everything done before we check out. And you

Speaker: 00:34:08

know, you know, the whole like 996 thing in China where, you know, they're working

Speaker: 00:34:13

these ridiculous, like nine to nine, six days a week,

Speaker: 00:34:19

I think that goes into this, like everyone's working so hard in these AI

Speaker: 00:34:22

labs. Right. That when you have these

Speaker: 00:34:26

natural breaks that are happening, it just is like a common thing to say.

Speaker: 00:34:29

Oh, common thing. Like they kind of try to get. It out, they spread. I

Speaker: 00:34:33

do think there's a reason. I don't, I don't think it's by happenstance. I think

Speaker: 00:34:36

there actually is a, a reason why we're starting to see

Speaker: 00:34:40

a lot of these content come out. And it's

Speaker: 00:34:43

funny, we're not seeing this stuff happen at the big trade

Speaker: 00:34:47

shows. We're not seeing it happen at like Meta's

Speaker: 00:34:51

big thing. We're not seeing it at OpenAI's, you know, kind of big

Speaker: 00:34:55

announcements. A lot of the discoveries that we've seen have happened

Speaker: 00:34:58

really in a grassroots type of ways where it's

Speaker: 00:35:02

been Deep Seq coming out on Christmas, releasing deep seq

Speaker: 00:35:06

v3, and then two weeks later, R1,

Speaker: 00:35:09

it's. I think we're going to see something very similar. I think we're going to

Speaker: 00:35:12

see one of these labs make a discovery. It's not going to be

Speaker: 00:35:16

on the stage of a big conference. It's going to be on a GitHub

Speaker: 00:35:20

page outlining like the next

Speaker: 00:35:23

revolutionary idea in this space. Yeah. It's kind of funny how

Speaker: 00:35:27

that's evolved, isn't it? Like it's become obviously

Speaker: 00:35:31

AI has always had a pretty heavy research kind of bend. Yeah. But it's

Speaker: 00:35:35

interesting how as the technology has matured, it still managed to keep

Speaker: 00:35:38

that researchy type feel right. You

Speaker: 00:35:42

know, enter enterprise. It really didn't

Speaker: 00:35:46

kind of, once it became

Speaker: 00:35:49

commercialized, the commercial trade shows and all that kind of took over.

Speaker: 00:35:53

But you're not seeing that in AI, at least not yet. No. And if it

Speaker: 00:35:56

hasn't happened by now, it's probably not because, I mean, AI has been

Speaker: 00:36:00

mainstream Gen AI has certainly been mainstream now for three years

Speaker: 00:36:04

this November. I say mainstream, but

Speaker: 00:36:07

like mainstreamed. But an AI in

Speaker: 00:36:11

general has been kind of a mainstream topic of conversation for

Speaker: 00:36:15

at least five, six years. Right. And it's still very heavily

Speaker: 00:36:19

influenced by what happens in research papers.

Speaker: 00:36:24

Yeah. And I think that's Just because it came out so

Speaker: 00:36:27

heavily out of academia. It's been such an academia

Speaker: 00:36:31

focused thing. Right. That

Speaker: 00:36:36

it's very hard to be in this space of AI without a master's or PhD.

Speaker: 00:36:40

Right. You and I think you and I are a bit of a,

Speaker: 00:36:44

an enigma just because we've been so passionate about it and.

Speaker: 00:36:48

Right. This isn't our first rodeo. We've been involved in this space

Speaker: 00:36:52

for 10, 15 years. Yeah. But I think

Speaker: 00:36:57

we have seen the industry come out, which has been a net benefit because it

Speaker: 00:37:01

means open source is talked about a lot

Speaker: 00:37:04

more. Right. And actually, I think another thing too is that how fast things are

Speaker: 00:37:08

moving takes time to put on conferences, it takes

Speaker: 00:37:11

months of planning, and if there's a new discovery, you want to get it out

Speaker: 00:37:15

tomorrow. And it's hard to even put on,

Speaker: 00:37:19

you know, like a webinar these days, let alone a conference.

Speaker: 00:37:22

So I think what we're seeing is it's just, you know, this kind of

Speaker: 00:37:26

challenge between the west, east and west of China and the US

Speaker: 00:37:30

where if we can get it out, we're going to get it out. Right.

Speaker: 00:37:34

Well, the first, the first out there is really the first to market, even if

Speaker: 00:37:37

you don't have a commercialized tech on it. Right. Because I guess the hope is

Speaker: 00:37:41

that once you get your paper out, you're the first to get it published. The

Speaker: 00:37:44

venture capitalists are going to be knocking on your door. I mean, that would be

Speaker: 00:37:47

my, that'd be kind of my cynical take on it. Right.

Speaker: 00:37:55

So what do you think that the next wave is going to be?

Speaker: 00:37:59

Any, any hints? Is it going to be specialized models? And you

Speaker: 00:38:03

know, and what, what, what constitutes a specialized model? Right. Like

Speaker: 00:38:06

what, what, what's your thoughts on that?

Speaker: 00:38:10

Yeah, so the biggest announcements that we've seen in the last

Speaker: 00:38:14

six months have actually been happening at an industry level, which I think is

Speaker: 00:38:18

really good. What we needed to see. So, you

Speaker: 00:38:21

know, things like AI models now

Speaker: 00:38:25

detecting like birth defects of a

Speaker: 00:38:28

fetus, you know, AI models that, like the

Speaker: 00:38:32

protein model, for example. I mentioned earlier, we're seeing these

Speaker: 00:38:36

very industry specific models actually making

Speaker: 00:38:41

some massive breakthroughs in the last two months.

Speaker: 00:38:45

And now that I wouldn't necessarily call that a

Speaker: 00:38:49

big leap forward in the sense of the research

Speaker: 00:38:53

side of the capacity of the models. I think it's more a

Speaker: 00:38:56

confirmation of the chain of thought in some of the things that we

Speaker: 00:39:00

were just talking about. It's a validation that we're now seeing this

Speaker: 00:39:04

next wave of models that just took a little while to get implemented

Speaker: 00:39:08

into some of These specific industries. But I think it's there to stay

Speaker: 00:39:12

from a research perspective. You know, we're seeing some major, major results.

Speaker: 00:39:17

And then I think the other side of that coin,

Speaker: 00:39:20

specifically, you know, we have maybe some of these smaller models that are specific to

Speaker: 00:39:24

certain industries or fine tuned models. But then obviously

Speaker: 00:39:27

agentic is the other side of that. And

Speaker: 00:39:31

agentic being the capacity of the model to

Speaker: 00:39:35

call out to different services or

Speaker: 00:39:39

I've been kind of humbled in that area because I always had this very industry

Speaker: 00:39:43

concept of agentic being just calling out to

Speaker: 00:39:46

APIs and the Internet. But I think there's a bigger conversation

Speaker: 00:39:50

with Agentic too where agentic models should also be able to take

Speaker: 00:39:54

that and actually reason with it. So there's 10, two steps. So we always

Speaker: 00:39:58

forget the second step. The second step is take that

Speaker: 00:40:01

information and then actually do something with it. And when I was, I was

Speaker: 00:40:05

talking to an AI researcher recently, they were telling me that

Speaker: 00:40:09

they consider it Gentex to also include advanced reasoning.

Speaker: 00:40:13

So go and read all these scientific papers

Speaker: 00:40:18

on chemistry in this particular area and then write a

Speaker: 00:40:21

new paper that is, you know, a new

Speaker: 00:40:24

groundbreaking thing in chemistry. And that

Speaker: 00:40:28

actually is a form of agentic. And that is, I think, you know, that's when

Speaker: 00:40:32

we start flirting with AGI. It's kind of the layer right before

Speaker: 00:40:35

AGI where, you know, models are just

Speaker: 00:40:39

going off and discovering new things. Yeah, yeah,

Speaker: 00:40:43

But I have a funny agentic story. I'll tell you after this. No, go for

Speaker: 00:40:47

it, go for it. So I was, I was very skeptical of this,

Speaker: 00:40:51

right? Because you know, what constitutes an agent, right? So like

Speaker: 00:40:55

what's the big deal, right? It just calls out an API. This isn't rocket science.

Speaker: 00:40:58

Right. You could argue, you know, from a skeptical point of view, you can argue

Speaker: 00:41:02

that, hey, RAG is kind of agentic. Kind of. Right.

Speaker: 00:41:06

But what's. So I think OpenAI had a, like a thing like try out

Speaker: 00:41:10

our new agent. And I was like, all right, go screen, scrape the page of

Speaker: 00:41:13

Amazon and get me information about a book

Speaker: 00:41:17

or something like that. It was something like that. And what

Speaker: 00:41:21

impressed me and this kind of was an aha moment for me was

Speaker: 00:41:24

how it just kept trying. Right?

Speaker: 00:41:28

Yeah. When it first tried to do it, it tried to launch a Python script.

Speaker: 00:41:32

Right. And kind of do it that way. But then I guess

Speaker: 00:41:36

the servers it was running on maybe was Microsoft Azure.

Speaker: 00:41:40

There were IP blocks to prevent people from screen scraping.

Speaker: 00:41:44

Yep. Right. So I was watching it go and I'm like, oh, you

Speaker: 00:41:47

know, so it's going to give up. And I was like, no, it didn't give

Speaker: 00:41:50

up. And it kept trying different things and different

Speaker: 00:41:54

combinations of things, even to the point where, I

Speaker: 00:41:58

mean, it failed eventually. But like it took 15, it tried for a good

Speaker: 00:42:01

15 minutes. It was basically apologize at the end, like

Speaker: 00:42:05

saying like, you know, if you could help me connect to a VPN, then

Speaker: 00:42:09

maybe I can get a different IP address. And it kept spinning up different

Speaker: 00:42:13

VMs and different set. And then I was impressed.

Speaker: 00:42:17

And maybe that's the secret sauce. The magic of

Speaker: 00:42:20

Agentic is that it just doesn't give up. Right. It kind of reasons. It has

Speaker: 00:42:25

a whole cot process where it tries to solve the problem,

Speaker: 00:42:28

where it's not just a one, two step, like, hey,

Speaker: 00:42:32

what's the weather? Right? It's just, it's just going to go out and run

Speaker: 00:42:36

these different. It's going to keep trying. I was

Speaker: 00:42:40

impressed. Sorry I cut you off. We're

Speaker: 00:42:43

saying we're seeing some of the same things

Speaker: 00:42:48

coming out of some of the big finance companies

Speaker: 00:42:52

as well. I think they're the first that we're actually seeing some results with

Speaker: 00:42:56

Agentic, actually like real

Speaker: 00:43:00

return of investment result. Right. And this actually

Speaker: 00:43:03

goes to a really important point. I want to sidetrack because it's related.

Speaker: 00:43:08

There was a report recently by MIT that

Speaker: 00:43:11

people have been misquoting and just the most epic way.

Speaker: 00:43:15

Oh, the 95% failure. Yes, I was going to talk about that because

Speaker: 00:43:19

like, I can't be. Look, I understand how hype weights work, but it can't be

Speaker: 00:43:22

that bad as you start peeling back the paper. Like

Speaker: 00:43:26

there's a lot of caveats there. Yeah.

Speaker: 00:43:31

Has to do with the type of R and

Speaker: 00:43:34

D projects that they were talking about.

Speaker: 00:43:38

If you actually read the paper, it was more like 40,

Speaker: 00:43:42

45% success rate. The

Speaker: 00:43:45

95% had to do with like a specific category of,

Speaker: 00:43:50

of project. So I need to, I actually need to. I keep telling myself I

Speaker: 00:43:53

need to dig into it a little bit more, but when I did initially

Speaker: 00:43:56

go through it and read some summaries on it, it

Speaker: 00:44:00

was that it's just been misrepresented completely. And

Speaker: 00:44:04

the, the data set that they were using was a little skeptical as well. Just

Speaker: 00:44:07

a little odd. I think it's a lot better than

Speaker: 00:44:11

that. And then I think those 40% that are

Speaker: 00:44:14

seeing ROI are actually seeing really significant ROI.

Speaker: 00:44:19

And I don't think that's going to change, I think.

Speaker: 00:44:23

So if you're deciding where

Speaker: 00:44:27

you want to invest your nest egg, I

Speaker: 00:44:30

would not be too concerned about

Speaker: 00:44:35

AI. Now, again, I'm not your financial advisor. I gotta put a little thing down

Speaker: 00:44:39

there. Do talk to your financial advisor.

Speaker: 00:44:44

But ultimately, no, I do think the data is actually

Speaker: 00:44:48

showing some really great results. Obviously there's going

Speaker: 00:44:51

to be hiccups in these types of POCs. There's a

Speaker: 00:44:55

lot of people who are just throwing

Speaker: 00:45:00

projects out there to see what sticks, but the actual

Speaker: 00:45:03

projects that are meaningful proof

Speaker: 00:45:07

of concepts. So not just, you know, I bought,

Speaker: 00:45:10

I bought this AI technology and it's sitting on my shelf, but I

Speaker: 00:45:14

actually got a team together performing this. We're doing

Speaker: 00:45:18

agentic. We're trying to solve this

Speaker: 00:45:22

actual problem statement. We have a problem statement.

Speaker: 00:45:26

Those are the ones that we're actually seeing meaningful results in the industry, especially

Speaker: 00:45:29

some key, key industries like finance and telco, which

Speaker: 00:45:33

we typically see kind of lead the way in some of these areas too. But

Speaker: 00:45:37

it was a really interesting report because it's added a lot of

Speaker: 00:45:40

doom and gloom on the Internet. And I see a lot

Speaker: 00:45:44

of the naysayers about AI just be like 95% of. It's

Speaker: 00:45:48

not even, you know, succeeding. It's terrible.

Speaker: 00:45:52

And I just have to sit there and shake my head and be like, no,

Speaker: 00:45:54

not what the report said. But I think it's just clickbaity, right? Like it's

Speaker: 00:45:58

clickbaity. It's total. That's kind of what, you know, I

Speaker: 00:46:02

didn't go deep into it, but when I started peeling back the layers and reading

Speaker: 00:46:05

other people's analysis of it, I'm like, that's clickbait.

Speaker: 00:46:09

And it gets back into this. Is this an AI bubble?

Speaker: 00:46:13

And yeah, maybe it is. But if people don't

Speaker: 00:46:16

remember, I'm old enough to remember. I have enough gray hair to remember what the

Speaker: 00:46:19

original dot com boom was like. And there were a lot of

Speaker: 00:46:23

people predicting the end of the dot com rise as early as

Speaker: 00:46:27

1996. Right. And people,

Speaker: 00:46:31

the dot com bust wasn't just a one and done type of event.

Speaker: 00:46:35

It unfolded under a couple of stages. Right. As, as

Speaker: 00:46:39

one of the books, I think of the name, I think it's called the Everything

Speaker: 00:46:41

Store. It's an analysis of how Amazon started

Speaker: 00:46:45

from Jeff Bezos having an idea while he was working, I think at a hedge

Speaker: 00:46:49

fund. I think it was so early, it wasn't a hedge, called a hedge fund

Speaker: 00:46:52

yet. And all the way through

Speaker: 00:46:56to, you know, basically: 2018Speaker: 00:47:00

and you know, as late as

Speaker: 00:47:03,: 2005Speaker: 00:47:06

analysts were convincing, you know, Jeff Bezos that

Speaker: 00:47:10

he should sell them to. Should sell him as his company to Barnes and

Speaker: 00:47:14

Noble. Yep. Right. Which is kind of funny to say that,

Speaker: 00:47:17

you know now, but, you know, the dot

Speaker: 00:47:21

com bust as it happened, you know, for me

Speaker: 00:47:25it was. I Remember hearing in: 1996 Speaker: 00:47:28

an end. Another year later it was overhyped. And then

Speaker: 00:47:32

1998, people were saying, oh, this is over. Right. When

Speaker: 00:47:35the real bust happened in: 2001Speaker: 00:47:40

But maybe the AI boom

Speaker: 00:47:43

is going to see that too. Right. Or is it going to be more like

Speaker: 00:47:46

the crypto kind of craze where it kind of crashed but

Speaker: 00:47:50

it kind of went up? It kind of went up and then it kind of

Speaker: 00:47:53

fell back and it kind of went up again. It was more of a. I

Speaker: 00:47:55

wouldn't call that a soft landing, but it was definitely like a. Yes. It

Speaker: 00:47:59

wasn't an explosion quite like the dot com bust, but it wasn't quite

Speaker: 00:48:03

like. It was more like a bumpy like, crash into like

Speaker: 00:48:07

an empty field where it kind of like hit up. And I don't remember, it

Speaker: 00:48:10

was one of the Star Trek movies where like the Enterprise like crashed on

Speaker: 00:48:14

the planet and like kind of skid along for a couple miles, bouncing up and

Speaker: 00:48:18

down. That's kind of the, the crypto crash. But

Speaker: 00:48:24

I don't want crypto bros hating on me. I, I like crypto. I just

Speaker: 00:48:28

don't understand a lot, a lot of questions I don't understand

Speaker: 00:48:32

about it. Right. Like, I understand Attack, but I don't understand how we're going to

Speaker: 00:48:35

get from the tech to this utopia that we're promised.

Speaker: 00:48:39

There's a lot of, a lot of steps in between I don't quite get. But

Speaker: 00:48:42

I don't know what, you know, A.I. i think, I think if it is a

Speaker: 00:48:46

bubble, I still think there's still some, some room, Runway left for it

Speaker: 00:48:50

to happen. Right. Because you are going to see. Yes, there are real

Speaker: 00:48:54

risks of, of having these experimental projects. Right. If you have 100

Speaker: 00:48:57

success rate in your experimental products, projects, you're not taking

Speaker: 00:49:01

enough risks. Yep. Right. If you. And you said

Speaker: 00:49:05

was 45. Yeah. It's closer to like 40, 45,

Speaker: 00:49:09

which I would. If you're really. 50% would be the

Speaker: 00:49:12

benchmark there in my mind. Right, right. Like in terms of half of them fail,

Speaker: 00:49:15

half of them succeed. Right. 45 isn't that far off

Speaker: 00:49:19

from that. Right.

Speaker: 00:49:23

I would say. And, and there's also been a

Speaker: 00:49:27

lot of these, you know, all the, you know, X number of percentage of AI

Speaker: 00:49:30

product or data science projects fail. Well,

Speaker: 00:49:34

you know, a certain amount of science has to fail. Right. Yeah. In order for

Speaker: 00:49:37

you to really be advancing the thing. Like, you know, and I think pharmaceutical companies

Speaker: 00:49:41

are a good example of that. You know, you, you only

Speaker: 00:49:44

hear about the drugs that worked. Right.

Speaker: 00:49:48

Get approved on you. Then you hear when they fail after.

Speaker: 00:49:52

But I mean, like, but you don't know, like day to day. Like, how many

Speaker: 00:49:56

chemical compounds did they try that didn't work out? Right. Maybe it was a hundred.

Speaker: 00:50:00

Right. But that one, if you look at pharmaceutical. It's an

Speaker: 00:50:03

astronomical percentage. It's actually. Right.

Speaker: 00:50:07

Truly insane. Like such a low percentage of what actually makes it

Speaker: 00:50:10

to. There was an interesting analysis. There was some podcast somewhere. But

Speaker: 00:50:14

basically how venture capital works. Right. Like they give money to like

Speaker: 00:50:18

100 companies. Right. 80 of them are going to fail big.

Speaker: 00:50:21

Right. 10 to be, you know, they'll break even.

Speaker: 00:50:25

But like one or two of the remaining 10% knock it

Speaker: 00:50:29

out of the park, Right? Yep. And that's kind of how

Speaker: 00:50:32

mathematically they function. I thought that was an interesting.

Speaker: 00:50:36

Maybe these AI projects or whatever

Speaker: 00:50:41

will follow the same trajectory. I don't know. But I feel better

Speaker: 00:50:44

at 45% success rate than 15 or

Speaker: 00:50:48

5. Yeah. Yeah. Absolutely.

Speaker: 00:50:52

Cool. Always good having you on the show. I

Speaker: 00:50:56

know we both have hard stops. Yes. Unfortunately.

Speaker: 00:51:00

No, it's cool. Gotta have you on more often, man. Especially now that you're not

Speaker: 00:51:03

like spending a month out in, you know,

Speaker: 00:51:07

Australia and Asia. Yeah,

Speaker: 00:51:10

yeah. So let us know in the comments below what you want to see us

Speaker: 00:51:13

to cover and maybe it'll be tomorrow.

Speaker: 00:51:17

I got this here the other day. This is a flexible

Speaker: 00:51:21

solar panel thing. Oh, cool. So it's cool. Supposedly it's 100

Speaker: 00:51:25

watts and you can actually pack it in your

Speaker: 00:51:29

backpack. That's the video. And I was like, oh, I need that because. Because I'm

Speaker: 00:51:32

a big, I'm a big fan of like, you know, having power on the go

Speaker: 00:51:35

and stuff like that. So. So I'll,

Speaker: 00:51:38

I'll unbox that tomorrow. Any parting thoughts?

Speaker: 00:51:44

Just keep an open mind about AI and

Speaker: 00:51:48

I, I still think the, the biggest conversations are still about

Speaker: 00:51:52

the governance of AI. Absolutely. Yeah. Just know that

Speaker: 00:51:56

AI is a multi layered problem, not just a single layered

Speaker: 00:52:00

problem. And for us to get this right, we have to look

Speaker: 00:52:03

at all the different layers. Absolutely. That's

Speaker: 00:52:07

how we're going to be able to do it correctly. And I will tell you,

Speaker: 00:52:11

I was listening to a podcast, I'll leave you on this note. And there was

Speaker: 00:52:14

one expert that was talking about

Speaker: 00:52:18

basically, are we, are we creating the

Speaker: 00:52:21

terminator out of all this? And he, he said, I

Speaker: 00:52:25

I'm actually more worried that we're creating Wall E out of all

Speaker: 00:52:28

this. Interesting.

Speaker: 00:52:33

And I would encourage everyone who hasn't seen Wall E go check it out.

Speaker: 00:52:36

And keep that in the back of your mind too, that there

Speaker: 00:52:40

could be such a happy path with AI that

Speaker: 00:52:44

also has its own long term negative effects for

Speaker: 00:52:48

society. But. But yeah, that's a topic that you.

Speaker: 00:52:52

And I can talk about on our next stream. That's it?

Speaker: 00:52:55

You want to leave on a cliffhanger, so to speak? Yes. And that wraps

Speaker: 00:52:59

our deep dive with Christopher Newland proving once again that AI

Speaker: 00:53:03

isn't just about large language models spitting out cat facts, but

Speaker: 00:53:06

about simulating reality, bending time at devcon and

Speaker: 00:53:10

maybe, just maybe, preventing the rise of our robot overlords.

Speaker: 00:53:14

From protein folding to Grand Theft Auto fueled AI breakthroughs.

Speaker: 00:53:18

Christopher reminded us that the next big leap might not be in scale, but

Speaker: 00:53:22

in simulation. So thanks to Christopher for navigating the

Speaker: 00:53:25

uncanny valley with us. No jet lag, just pure insight.

Speaker: 00:53:29

Until next time, stay data driven. And remember, if

Speaker: 00:53:33

reality starts glitching, blame the simulator, not the

Speaker: 00:53:36

Internet.