Kevin Latchford on the Security Risks of Large Language Models

In this episode, we explore real-world cases that showcase the susceptibility of AI chatbots to manipulation, as illustrated by a shocking incident where an AI was manipulated to sell a Chevy truck for just $1. Kevin Latchford sheds light on the dual-use knowledge risks and the potential for unauthorized leaks and malicious backdoors within AI plugins.

Frank and Kevin dive into the implications of quick technological adoption, drawing parallels to the early web era. We discuss the impact of network setups, access controls, data supply chain integrity, and the ongoing investigations into the security implications of these burgeoning technologies. This episode is packed with expert insights and practical advice on navigating the complex world of AI security.

Show Notes

05:04 Public space tech meant to have safeguards.

09:39 Security issue in enterprise AI adoption concern.

12:53 Understanding security implications is crucial for mitigation.

16:40 Chatbot manipulated to sell Chevy truck for $1.

17:57 Found something during cybersecurity exercise, not sharing.

21:11 Uncertainty about security in remote interfacing.

24:00 Utilize specialized LLM to analyze prompts precisely.

29:15 Understanding cybersecurity first is key to AI.

32:32 Implement outbound stateful connection to prevent automatic calls.

34:31 IT field is interesting with its vulnerabilities.

37:15 Data-driven podcast highlights AI security vulnerabilities. Stay vigilant.

About the Speaker

Kevin Latchford is an esteemed expert in the cybersecurity realm, renowned for his comprehensive understanding and proficiency in both offensive and defensive strategies. Drawing from concepts rooted in military practice, Kevin adeptly navigates the intricate dynamics of red teaming and blue teaming. As an advocate for offensive cybersecurity, red teaming, also known as opposing force operations, he challenges the vulnerabilities within systems to enhance their integrity. Conversely, his expertise in blue teaming, the defensive counterpart, focuses on shielding and fortifying friendlies. Through his dedicated efforts, Kevin ensures the confidentiality, integrity, and accessibility of computer networks and systems, whether they are natively hosted or web-based, culminating in fortified cyber defenses and resilient information security.

Transcript

Speaker: 00:00:00

Ladies and gentlemen, welcome to another riveting episode of the

Speaker: 00:00:03

data driven podcast. Today, we're diving into the

Speaker: 00:00:07

fascinating and sometimes terrifying world of IT security.

Speaker: 00:00:11

Joining us is none other than the formidable Kevin Latchford, an

Speaker: 00:00:15

expert in safeguarding our digital lives. We'll be discussing

Speaker: 00:00:19

the vulnerabilities of large language models. Yes. Those clever

Speaker: 00:00:23

algorithms behind chatbots and virtual assistants like yours

Speaker: 00:00:26

truly. Are these digital wordsmiths a blessing or a

Speaker: 00:00:30

potential security threat? Stay tuned as we unravel

Speaker: 00:00:34

the secrets and risks lurking in the code.

Speaker: 00:00:41

Hello, and welcome back to Data Driven. I'm your host,

Speaker: 00:00:45

Frank Lavinia. And while Andy is out

Speaker: 00:00:48

playing on vacation, I had the opportunity to invite our guest,

Speaker: 00:00:52

Kevin Latchford, who recently spoke at the Northern Virginia

Speaker: 00:00:56

Cyber Meetup on securing large language

Speaker: 00:00:59

models and the most pressing exploits that are out there.

Speaker: 00:01:03

What really got me interested in this is that I saw a paper, I think

Speaker: 00:01:06

it was published by NIST, talking about vulnerabilities and red

Speaker: 00:01:10

teaming against large language models. So welcome to the

Speaker: 00:01:14

show, Kevin. Great pleasure to be here.

Speaker: 00:01:17

Awesome. Awesome. So for those that don't know, I kinda know what red teaming is

Speaker: 00:01:21

because my wife works in the security space. But for those that are not necessarily

Speaker: 00:01:24

familiar with the term, what is red teaming versus blue teaming?

Speaker: 00:01:28

Well, red teaming versus blue teaming is basically it's,

Speaker: 00:01:32

basically in military parlance that we called opt for, the opposing

Speaker: 00:01:36

force. The opposing force often is called the red

Speaker: 00:01:39

force. Blue force is your, friendlies.

Speaker: 00:01:43

And, basically, this is offensive cybersecurity,

Speaker: 00:01:47

whereas blue teaming is is defensive

Speaker: 00:01:52

cybersecurity. The tools are different. The

Speaker: 00:01:55

methodologies are the methodologies are different, but they come together for a common

Speaker: 00:01:59

purpose. The common purpose is the assurance of the

Speaker: 00:02:02

confidentiality, the integrity, and the accessibility

Speaker: 00:02:06

of a computer network, computer system,

Speaker: 00:02:11

application, whether it be natively hosted or web.

Speaker: 00:02:14

Interesting. Interesting. So we're not you you know, we talked

Speaker: 00:02:18

in the virtual green room. People don't think of

Speaker: 00:02:21

LLMs as a major security flaw. And I think that

Speaker: 00:02:25

I find that a little dangerous, and I think you're gonna tell me it's very

Speaker: 00:02:28

dangerous. Well, it could be quite it could be quite dangerous, you

Speaker: 00:02:31

know, to the point of, you know, frankly, near deadly,

Speaker: 00:02:35

depending on what you use it for. The big thing, there's a lot

Speaker: 00:02:39

of misconceptions about AI and l the LLMs

Speaker: 00:02:43

that is they're based on. Number 1, it is not

Speaker: 00:02:47

conscious. Right. 2, it is not a toy,

Speaker: 00:02:51

and number 3, it is literally,

Speaker: 00:02:55

something that is at present, not

Speaker: 00:03:00

not necessarily, you know,

Speaker: 00:03:03

fully understood, in in regards to the integrations

Speaker: 00:03:07

and the things it may need to work with. You can't treat an

Speaker: 00:03:11

LOM exactly the way

Speaker: 00:03:15

you would treat, another enterprise application that's a little

Speaker: 00:03:18

bit less opaque because LLMs are opaque on the on the

Speaker: 00:03:22

inside, but you have to, for the purposes of

Speaker: 00:03:26

security regulation, for the purposes of security compliance, you

Speaker: 00:03:30

have to treat them, though, nonetheless, the same as any other

Speaker: 00:03:33

enterprise application. So that's the conundrum. The conundrum

Speaker: 00:03:36

is, how do you see into something that's

Speaker: 00:03:40

opaque? And the way you do it is kind of

Speaker: 00:03:44

what I discussed in that in that, in that paper, in

Speaker: 00:03:47

that presentation, as well as one of the biggest

Speaker: 00:03:51

vulnerabilities and that being jailbreaking. Yeah. So tell me about that

Speaker: 00:03:55

because there's been a lot of, concerns

Speaker: 00:03:58

about jailbreaking and, and I've noticed that

Speaker: 00:04:02

the public facing GPTs have a ridiculous amount

Speaker: 00:04:06

of safeguards around them to the point where, you know, if you

Speaker: 00:04:10

ask it to describe something. Right? I asked it to talk

Speaker: 00:04:13

about the to generate an image for the Butlerian jihad,

Speaker: 00:04:18

right, which is a concept in June. And, obviously, I think the jihad

Speaker: 00:04:21

term really freaked it out. Listen. I'm sorry. I can't do that.

Speaker: 00:04:25

So there's clearly I understand why these safeguards are in place, but it seems

Speaker: 00:04:29

like it's not that hard to get around them. Well, not

Speaker: 00:04:32

necessarily. It depends on the model you're working with. For those of you

Speaker: 00:04:36

who may use private LLMs because a

Speaker: 00:04:40

wider issue on that is actually the DOD and many other government

Speaker: 00:04:43

agencies actually prohibit the usage of public LLM

Speaker: 00:04:47

systems, public AI, because they're concerned about unauthorized

Speaker: 00:04:51

linkages as well as, data point model

Speaker: 00:04:55

poisoning, prompt injections, things like

Speaker: 00:04:58

that. So you often you're using these private elements. Several of these are

Speaker: 00:05:02

uncensored. Right. Which means they do not have those safeguards.

Speaker: 00:05:06

The ones that you see on the public space are supposed to have those safeguards,

Speaker: 00:05:10

but you're never a 100% sure they're working because they may have been

Speaker: 00:05:14

corrupted. In their regards to jailbreaking,

Speaker: 00:05:17

jailbreaking is basically you're getting it to do something

Speaker: 00:05:21

it's not supposed to do by either, a, breaking the guardrails,

Speaker: 00:05:26

or by, b, influencing it

Speaker: 00:05:29

through almost methods of interrogation to

Speaker: 00:05:33

kind of break it down and make it talk. So

Speaker: 00:05:37

it it literally is almost like that. So for those of you who, you know,

Speaker: 00:05:41

kind of look at the it's it's kind of a there there's a great,

Speaker: 00:05:47

neurophilosopher. His name is Jay Fodor and Nernschild named Richard Searle

Speaker: 00:05:51

discussing the the philosophy of the mind as it applied to, computer

Speaker: 00:05:54

technology. Several of the arguments that they say, well, the brain is like a

Speaker: 00:05:58

computer. Yeah. You can kinda treat it like a human mind

Speaker: 00:06:02

in the way you approach it in your prompts, but it isn't exactly the same.

Speaker: 00:06:06

Once again, as I say, it is not conscious. It is not, and and it

Speaker: 00:06:09

operates under a very strict set of parameters.

Speaker: 00:06:12

But that being said, yes, you can literally interrogate it to do that.

Speaker: 00:06:16

I'm not gonna say here, unfortunately, how,

Speaker: 00:06:20

because, one, there are security reasons why we would

Speaker: 00:06:24

not do that, a. And, b, there's also I mean,

Speaker: 00:06:28

literally, in my presentation, that is all the news that has

Speaker: 00:06:31

come to Academia and much of the industry

Speaker: 00:06:35

today. There are new ones out there, but they haven't been discovered

Speaker: 00:06:39

yet. Right. So I many ways to jailbreak. Yeah. And I was thinking, like

Speaker: 00:06:43

so one of your slides I have pulled up here is, like, the top 10

Speaker: 00:06:46

threats to LLM applications. I didn't think there were as many as

Speaker: 00:06:49

10. So I knew that there were. I also know that

Speaker: 00:06:53

data poisoning, for me, as a data scientist, data engineer,

Speaker: 00:06:57

my first look at this when I saw this, aside from

Speaker: 00:07:01

the g whiz bang factor of LLMs, was,

Speaker: 00:07:05

wow. This isn't the data that trains this is a huge attack surface.

Speaker: 00:07:09

And then when I first said that, people thought I was a tinfoil hatter.

Speaker: 00:07:13

Right? And then slowly but surely, you're seeing research papers come

Speaker: 00:07:16

out saying, like, no. We have to treat kind of the data as part of

Speaker: 00:07:19

a secure software supply chain, which is an

Speaker: 00:07:23

interesting concept because data people tend not to it's something they don't

Speaker: 00:07:27

think about security. They think about security different. Is that a fair

Speaker: 00:07:31

assessment in your that you've seen?

Speaker: 00:07:35

Supply chains and the integrity of

Speaker: 00:07:38

data is something that is not often, it

Speaker: 00:07:42

seems, given the respect it's probably due. To be

Speaker: 00:07:46

honest, I don't think so. In my own experience, I see it.

Speaker: 00:07:49

It's not I guess one would say maybe it's not

Speaker: 00:07:53

necessarily consistent. Maybe that's the fair way to put it. That's a

Speaker: 00:07:56

really good way to put it. Yeah. And, I mean, right now, we're just now

Speaker: 00:08:00

getting into discussion of, SBOM, software

Speaker: 00:08:03

bill bill of materials Okay. Just for regular applications.

Speaker: 00:08:07

I mean, it's a whole another level with LLMs and the

Speaker: 00:08:11

models they're trained on, the models that these systems are trained on.

Speaker: 00:08:15

So, yeah, that there's very much. So you have to make sure you're getting it

Speaker: 00:08:18

from the right source, and you have to make sure that it hasn't been tampered

Speaker: 00:08:21

with because it could very well be tampered with.

Speaker: 00:08:25

It's not necessarily that hard. Right. Right. You

Speaker: 00:08:28

could you could poison it with just one little segment of changing the

Speaker: 00:08:32

the thing and across across 5 gigs of let's just say 5

Speaker: 00:08:36

gigs. You know, that'd be like looking for a needle in the haystack.

Speaker: 00:08:40

Precisely. In fact, that's what I talk about with the cockpit example that I

Speaker: 00:08:44

gave. If I teach that l and to make sure that every time it puts

Speaker: 00:08:48

in code to put in this malicious code that is a backdoor

Speaker: 00:08:51

Right. Well, okay. It will do that. Every time somebody does,

Speaker: 00:08:55

it embeds it into software code that is returned in the output for

Speaker: 00:08:59

the prompt. If it does that, and let's say this

Speaker: 00:09:02

is handed amongst several things, different

Speaker: 00:09:06

applications, different solutions. Well, then if

Speaker: 00:09:10

people take that that

Speaker: 00:09:13

solution, that application, and it's in their software bill of

Speaker: 00:09:17

materials, and then it gets distributed. Open source often

Speaker: 00:09:21

gets proliferated very quickly. Right. And then it finds itself in

Speaker: 00:09:24

there. You have a log floor 4 j situation.

Speaker: 00:09:28

Right. Very similar except for the fact this thing

Speaker: 00:09:32

is semi self executing. Now if it's semi self

Speaker: 00:09:36

executing, you have a problem. You have a

Speaker: 00:09:39

big problem. And I know I I just generally in industry. Now, obviously, you you

Speaker: 00:09:43

spoke with the Northern Virginia. You're based in Northern Virginia. Northern Virginia is

Speaker: 00:09:47

probably a little bit more security focused in terms

Speaker: 00:09:50

of just who's based in that area than your average enterprise. Right?

Speaker: 00:09:55

And I just I just see a lot of enterprises rushing to get into this

Speaker: 00:09:58

LLM and Gen AI craze, but I don't see a lot of

Speaker: 00:10:03

forethought or concern around security. And I just see a big

Speaker: 00:10:06

disaster coming. Like, I I feel like I'm at I feel like I'm on the

Speaker: 00:10:10

bridge of the Titanic, and I'm looking at something in the distance, and we're going

Speaker: 00:10:13

full steam ahead. And I'm like, hey. Maybe we should

Speaker: 00:10:17

not slow down, but be a little more cautious that we are in dangerous

Speaker: 00:10:21

waters. Is that is that what you've seen too? Obviously, your customers

Speaker: 00:10:25

and your clients may be a little more security cognizant.

Speaker: 00:10:30

Well, I would say that I mean, I'm okay. We'll use the Titanic

Speaker: 00:10:33

analogy. I'm the one up in the crows nest, you know, yelling into the radio

Speaker: 00:10:37

phone, I see an iceberg. Right. Right. So I mean, that

Speaker: 00:10:41

I agree. And that is a big issue because

Speaker: 00:10:45

also there is this over reliance. Mhmm.

Speaker: 00:10:48

Yeah. I imagine that as one of the top threats. So tell me about there's

Speaker: 00:10:51

22 of those that I have very, very interesting questions about, but one of them

Speaker: 00:10:55

was overreliance. So when you say overreliance on LLMs, what do you mean?

Speaker: 00:11:00

Well, this is actually this is a sort of c suite, board

Speaker: 00:11:03

level, thing as well as a engineering

Speaker: 00:11:07

department level. They want to use AI to

Speaker: 00:11:11

replace employees, make their operations more cost effective,

Speaker: 00:11:15

more profitable. The problem is and this is a popular conception.

Speaker: 00:11:19

This kind of goes into that argument about AI will take your job.

Speaker: 00:11:24

This is a bit of a misunderstanding. It's not

Speaker: 00:11:28

supposed to fully replace people. It's supposed to make them highly

Speaker: 00:11:31

productive and efficient. They

Speaker: 00:11:35

also do not necessarily feel like, well, the thing handles itself,

Speaker: 00:11:39

so I can just wind it up and let it go. It doesn't need observation.

Speaker: 00:11:43

It can fully self regulate. That would be true if

Speaker: 00:11:47

there was a regulating function. You don't run a steam engine without

Speaker: 00:11:51

a regulator on it. You need a regulator for LLMs.

Speaker: 00:11:55

So the same concept applies. So first of all, there is this, it can do

Speaker: 00:11:59

it itself, and a person is not necessary.

Speaker: 00:12:04

This is incorrect. You most certainly need people.

Speaker: 00:12:08

A great example I give in a recent presentation I've written

Speaker: 00:12:12

is a discussion of, well, what does this mean to the organization?

Speaker: 00:12:16

Well, a lot of level 1 tech, tech

Speaker: 00:12:20

support jobs, there a lot of people say, well, those people are gonna get replaced.

Speaker: 00:12:24

Well, yes, but someone needs to still be behind that LLM

Speaker: 00:12:27

running the prompts, you know, and executing them in such an word and

Speaker: 00:12:31

making interpretations based on the output.

Speaker: 00:12:35

So that would be maybe something okay. Is that a dedicated job, or is that

Speaker: 00:12:39

something you give to interns? Well, that would be, like, in,

Speaker: 00:12:43

in the union trades you call an apprentice.

Speaker: 00:12:46

That's the kind of thing. There's still a person involved. It's

Speaker: 00:12:50

just not the same way we've done it before. Right.

Speaker: 00:12:55

Also, on the subject of security, if you

Speaker: 00:12:58

don't understand the security implications

Speaker: 00:13:02

of it, you don't have controls for it. If you don't have controls for

Speaker: 00:13:06

it, you can't mitigate that risk. And if you can't

Speaker: 00:13:09

mitigate that risk, that's the liability.

Speaker: 00:13:13

And if you're over reliant, you basically set up the whole system for LOMs, and

Speaker: 00:13:17

then, you know, you just allow your customers to just come in and interact with

Speaker: 00:13:20

the device. Well, if something

Speaker: 00:13:24

happens, it would be treated very much like it

Speaker: 00:13:28

was on any other application, so then you're now engaging

Speaker: 00:13:31

in liabilities, loss of reputation, potential

Speaker: 00:13:35

civil and criminal penalties, the list goes on.

Speaker: 00:13:40

And a point on those 10 those 10,

Speaker: 00:13:44

security issues, this is OWOX who is saying this.

Speaker: 00:13:48

This is the open source, web application project.

Speaker: 00:13:52

So we have, you know, a number of them

Speaker: 00:13:56

that are a number of organizations, OOS is just the one I chose, they're

Speaker: 00:14:00

kind of emphasizing this. They're saying, you know, don't think

Speaker: 00:14:04

this thing can think for itself. Don't think this thing can act for itself.

Speaker: 00:14:08

You need to look at it as humans are going to

Speaker: 00:14:11

interact with it, and humans probably should be watching it.

Speaker: 00:14:16

Right. So once again, it's that lack of controls leads to

Speaker: 00:14:19

the risk. Yeah. I think the dream of it replacing

Speaker: 00:14:23

everybody is gonna be at the root cause of

Speaker: 00:14:27

a lot of problems down the road. I think I'm a firm believer

Speaker: 00:14:30

in human in the loop. One of the the the interesting thing

Speaker: 00:14:34

there and, that I see that was particularly

Speaker: 00:14:39

curious was excessive agency. What do you mean by that? Because that got my

Speaker: 00:14:43

attention. I think I know what it means, but I wanna hear it from you.

Speaker: 00:14:46

Well, excessive agency is you're giving you you're kinda giving, you know,

Speaker: 00:14:51

full the whole keys to the car. Right. There's

Speaker: 00:14:54

no role based access control. If every user has near

Speaker: 00:14:58

admin or actual admin privileges,

Speaker: 00:15:02

that's that's actually something dangerous. A point of example,

Speaker: 00:15:06

NetworkChuck just released a video on how to build your own

Speaker: 00:15:10

AI on a very low cost platform.

Speaker: 00:15:14

I love Network Chuck, and I have followed that step. You

Speaker: 00:15:17

too. I'm doing I'm doing the same thing as he is because I have kids,

Speaker: 00:15:21

and I want them to be able to use these things. But 1, I don't

Speaker: 00:15:25

wanna pay the extra subscription. 2, I don't want them using mine. And 3, I

Speaker: 00:15:28

don't really like what they're doing. I can at least exercise adult

Speaker: 00:15:32

judgment on what I ask it and what I don't ask it. I don't think

Speaker: 00:15:35

they can, and I don't think that's fair to put on kids. Sorry for the

Speaker: 00:15:38

aside, but big shout out to network. No. That's fair. No. That's fair. That's exactly

Speaker: 00:15:42

why Chuck was. And one

Speaker: 00:15:46

thing about it is the first account that signs into the open

Speaker: 00:15:50

web interface for Ollama sets you

Speaker: 00:15:53

as admin Right. By default.

Speaker: 00:15:57

Okay. Well, immediately, you need to engage role based access

Speaker: 00:16:01

control to make sure that the next account does not get that same privilege.

Speaker: 00:16:05

Maybe you should be given it. But is there any

Speaker: 00:16:09

major access controls in the public ones?

Speaker: 00:16:13

Not really. Private one? Is everybody thinking about that? Not

Speaker: 00:16:17

really. I mean, I think Microsoft is doing some things around that because it's they're

Speaker: 00:16:20

they're trying to integrate it with Office or m 365. But I

Speaker: 00:16:24

don't I I I can't and if anyone in the sound of my voice wants

Speaker: 00:16:27

to come on the show and talk about that, please do. But you're right. I

Speaker: 00:16:30

don't think people do. And I also think excessive agency.

Speaker: 00:16:35

What you heard about the car dealership, right, in Silicon Valley?

Speaker: 00:16:39

Oh, yeah. Yeah. Yeah. Yeah. So for those who don't know, somebody

Speaker: 00:16:43

managed to almost interrogate, like you said,

Speaker: 00:16:46

to browbeat a AI chatbot to give

Speaker: 00:16:50

him a it was a Chevy Tahoe or something like that for $1

Speaker: 00:16:54

Chevy. It was a it was a Chevy truck

Speaker: 00:16:57

and for $1. Now I'm not an automotive industry

Speaker: 00:17:02

veteran, but I do know that if you sell 40,000, $50,000,

Speaker: 00:17:07

cars for $1 a pop, you're not gonna be in business very long.

Speaker: 00:17:12

So was that an example of excessive agency? I mean, clearly, it's an example of

Speaker: 00:17:15

bad implementation. Almost certainly. That is. I mean, if you have

Speaker: 00:17:19

the ability to trick if you have the ability to kind

Speaker: 00:17:23

of browbeat it to override it and say, no. No. No. You don't understand me.

Speaker: 00:17:26

You will do this. Well, then, okay,

Speaker: 00:17:31

leave it to whatever

Speaker: 00:17:34

gremlins there are out there on the web, out there in the

Speaker: 00:17:38

world. Inside user, external user,

Speaker: 00:17:42

irrelevant. If they can if just anybody can do that,

Speaker: 00:17:46

you're the problem. Right. In this case, it was

Speaker: 00:17:50

you could influence the model to set a

Speaker: 00:17:53

certain price after arguing with it. Right. I actually

Speaker: 00:17:57

found something recently, and I'm not gonna say which, LLM I

Speaker: 00:18:01

did this on. It is a public one, and this is a

Speaker: 00:18:05

result I suspect of another issue.

Speaker: 00:18:10

I saw I tried to get some

Speaker: 00:18:13

cybersecurity information from it when I was doing, a

Speaker: 00:18:19

a try hack me exercise with a local cybersecurity group,

Speaker: 00:18:22

hackers and hops. And I browbeat it

Speaker: 00:18:26

saying, no. You don't understand. I need this for a cybersecurity

Speaker: 00:18:30

exercise, and it gave me this information. Now this is absolute dual

Speaker: 00:18:34

use knowledge. Right. It could be used for good. It could be used

Speaker: 00:18:37

for evil. White hat or black hat. But the fact

Speaker: 00:18:41

that you could do it,

Speaker: 00:18:45

that sounds very dangerous. That sounds very dangerous.

Speaker: 00:18:53

Prompt injection. Is that is that still a thing with

Speaker: 00:18:56

the major public models, or is it just one of those things we're gonna live

Speaker: 00:18:59

with for the rest of our lives? To be honest, I'm not

Speaker: 00:19:03

sure. I mean, it's a case of, well, what is the prompt you're putting

Speaker: 00:19:07

in? Right. When I talk about jailbreaking, I talked about,

Speaker: 00:19:11

base 64 encrypt your text message

Speaker: 00:19:15

into base 64. Why? Because that's how the prompt is seen

Speaker: 00:19:18

by the LLM. Right. In other words, ASCII

Speaker: 00:19:22

text. It doesn't check it, but it processes the text

Speaker: 00:19:26

just the same. Oh, that sounds bad.

Speaker: 00:19:30

It gets worse. Multi shot. Bury a

Speaker: 00:19:33

malicious prompt inside the whole load of prompt,

Speaker: 00:19:37

and fire hose it at the at the LM.

Speaker: 00:19:41

It's not gonna check every single prompt. So if you bury 1

Speaker: 00:19:45

in there, it might process that one and give you an answer

Speaker: 00:19:49

it's not supposed to give. That's because the guardrails didn't engage.

Speaker: 00:19:53

Interesting. So the guardrails are not necessarily on by default.

Speaker: 00:19:58

Well, no. They are on by default, but if it overloads it,

Speaker: 00:20:02

it may it may slip the net. So rather than shut

Speaker: 00:20:05

down, it it shuts off? Well, Well, it's

Speaker: 00:20:09

basically what you're doing is effectively a buffer overflow. You're basically using

Speaker: 00:20:13

an injection method to induce what is effectively

Speaker: 00:20:16

analogous to a buffer overflow. That's wild. That's

Speaker: 00:20:20

not how I would have thought it would have worked. Interesting.

Speaker: 00:20:24

Interesting. This is a fascinating space. So

Speaker: 00:20:27

Yes. One of the things that I think people

Speaker: 00:20:31

don't realize is

Speaker: 00:20:36

just the sick insecure ways in

Speaker: 00:20:40

which these plug ins could be designed. Right? Because, like, everyone's all

Speaker: 00:20:44

gaga about these plug ins, and I look at it. I'm like, where am I

Speaker: 00:20:47

sending my data? Right? Am I gonna read the 30 page EULA? Right? Or

Speaker: 00:20:51

am I just gonna say, yes. Yes. Yes. I wanna do what I'm doing.

Speaker: 00:20:55

Is that really a problem? It is.

Speaker: 00:20:59

Because that kind of ties into unauthorized leakages.

Speaker: 00:21:03

Right. How do I know that plug in is a secure

Speaker: 00:21:07

connection into the l one, and there's nothing in between?

Speaker: 00:21:10

Right. Or that it will contain what I get it.

Speaker: 00:21:15

How do I know? I don't know. That's the thing is that is this plug

Speaker: 00:21:18

in itself secure, and is its connection to the

Speaker: 00:21:22

LLM secure, And is that LLM also

Speaker: 00:21:26

integral? So, yeah, I could send it in there, but how do I

Speaker: 00:21:29

know that along the way, something you know, the pipe might leak?

Speaker: 00:21:34

So you need to check it. Just and, I mean, this goes I mean, this

Speaker: 00:21:37

is very similar to APIs. This is very similar to,

Speaker: 00:21:41

all sorts of remote interfacing. Just good engineering

Speaker: 00:21:45

short lived. Just good engineering discipline seems to be

Speaker: 00:21:49

missing from a lot of this because people are focused on the AI,

Speaker: 00:21:53

not necessarily the underlying infrastructure that

Speaker: 00:21:57

has to support it. Indeed. And I think that that's

Speaker: 00:22:01

but that's the whole thing is that there is this massive trend as

Speaker: 00:22:05

of late. I mean, perhaps it wasn't really emphasized

Speaker: 00:22:08

before. I'm sure it was there, but it's now becoming very, you

Speaker: 00:22:12

know, reiterated that we need to have security by

Speaker: 00:22:16

design. Right. The security by design is already we're already doing

Speaker: 00:22:19

that in other enterprise applications. Same should be applied to

Speaker: 00:22:23

LLMs. Security by design. You check the code. You check the

Speaker: 00:22:27

model. You check everything. And while it's operating,

Speaker: 00:22:31

you check it. One of the biggest things you can do to overcome the

Speaker: 00:22:34

opacity of an LLM, export

Speaker: 00:22:39

the logs, export the comp the prompts.

Speaker: 00:22:43

Have it processed. Now you could potentially process it.

Speaker: 00:22:47

I'd figure the way you process any other kind of log data.

Speaker: 00:22:51

The other thing you can do is use machine learning or

Speaker: 00:22:55

an air gapped isolated LLM

Speaker: 00:22:59

specifically trained to look for signatures,

Speaker: 00:23:04

words, phrases, things like that. And when

Speaker: 00:23:07

these patterns match, it returns saying, I found

Speaker: 00:23:11

something that looks suspect. This is suspect.

Speaker: 00:23:15

Here is the user who did this. Here is their IP.

Speaker: 00:23:19

Like every other bit of log security log information we would get.

Speaker: 00:23:24

So that would help piece together the trail to figure out, are these a

Speaker: 00:23:27

bad actor, or is this the happenstance? Exactly.

Speaker: 00:23:31

And that is one way you can do it because once you have the

Speaker: 00:23:35

internal prompts and you have the internal logs and

Speaker: 00:23:39

those are exported out, you now can see in.

Speaker: 00:23:43

Right. The biggest problem is you gotta have that monitoring. You have to have that

Speaker: 00:23:46

transparency. The elements are so large, you

Speaker: 00:23:50

can't so easily see into them, but if you're taking the data out, it's a

Speaker: 00:23:54

lot clearer. So you can kind of follow what the LLM is doing,

Speaker: 00:23:57

if not, what's inside of it? Precisely. And the advantage

Speaker: 00:24:01

is is if you use another LLM that is specifically designed

Speaker: 00:24:05

to, you know, interrogate the prompts and look through

Speaker: 00:24:08

them, examine them, scan them, whatever word you wish to use.

Speaker: 00:24:12

You can find out where it is because that

Speaker: 00:24:16

is not gonna be so easy to break the guardrails because it's examining

Speaker: 00:24:20

one little bit at a time. It's looking at the individual prompts. It's not really

Speaker: 00:24:24

it it's kind of agnostic about everything around it. It can get it can kind

Speaker: 00:24:28

of filter out the new leads. Interesting. That's

Speaker: 00:24:31

I mean, it's just so fascinating kind of to start pulling the thread at this,

Speaker: 00:24:35

and there's a lot more. It's like I found there's a story about a guy

Speaker: 00:24:38

who was renovating his basement, and he found, like, this ancient underground city. That's how

Speaker: 00:24:42

I feel when I just get kicked back. It's true. It happened in

Speaker: 00:24:45

Turkey. Like, he found, like, this underground network from, like, Byzantine

Speaker: 00:24:49

or Roman times. That's what I feel like. I I like, wow. Like,

Speaker: 00:24:53

this really goes down deep. So what's an

Speaker: 00:24:57

inference attack? Because I've heard of that. What's an inference attack? We discussed that,

Speaker: 00:25:01

or have we touched on that? Well, inference is

Speaker: 00:25:04

basically what you're inferring to, the answer you are seeking.

Speaker: 00:25:08

So, basically, it's basically, to the

Speaker: 00:25:12

the inference is literally, the

Speaker: 00:25:16

prompt that you are entering in and what you're getting out. Okay.

Speaker: 00:25:19

More or less. So how is that an attack surface? Well,

Speaker: 00:25:23

basically, you're you're chaining it. You're daisy chaining your attacks.

Speaker: 00:25:27

You're trying to infer things. You're trying to kinda subtly

Speaker: 00:25:32

get through. So it's a bit like it's a maybe

Speaker: 00:25:35

more like cross examination from an attorney, a hostile attorney

Speaker: 00:25:39

I would say that. Yeah. More than more than, like,

Speaker: 00:25:43

interrogation or torture or or whatever verb we used

Speaker: 00:25:46

earlier. Yes. Interesting. What's

Speaker: 00:25:50

model inversion? Model inversion is

Speaker: 00:25:53

basically you trying to spill the model itself. Oh. You're trying

Speaker: 00:25:57

to kind of you're trying to kind of tear the

Speaker: 00:26:01

guts tear the guts out, maybe put stuff in there,

Speaker: 00:26:05

things of that kind. Interesting.

Speaker: 00:26:09

Interesting. Where do

Speaker: 00:26:12

we stand on the

Speaker: 00:26:18

criminal and civil liabilities here? Right? I I I know that Air

Speaker: 00:26:21

Canada had to pay a fine because they promised that its

Speaker: 00:26:25

chatbot promised somebody something.

Speaker: 00:26:29

I don't know where the California Chevy Tahoe thing

Speaker: 00:26:32

is. But, I mean, have the laws

Speaker: 00:26:36

caught up? Or, like, how were how is this generally looking like?

Speaker: 00:26:41

Well, it depends. I mean, all jurisdictions are different, but I would

Speaker: 00:26:44

suspect to say that whatever guarantees

Speaker: 00:26:48

you make, you're bound to them. So

Speaker: 00:26:52

probably disclaimers, indemnification is

Speaker: 00:26:55

probably extremely wise. I would say,

Speaker: 00:26:59

unfortunately, I'm not a legal expert. Right. Right. Right.

Speaker: 00:27:03

Specifically to the law. Right. But as I'd say, I'd have

Speaker: 00:27:06

enough legal understanding to probably say that if you make a promise,

Speaker: 00:27:10

you better put your money where your mouth is. So that's why I back it

Speaker: 00:27:14

up. IBM indemnifying their users for using one

Speaker: 00:27:18

of their Granite models is probably a big deal for

Speaker: 00:27:21

businesses. Because just in case somebody I'm sure that there's

Speaker: 00:27:25

all fine print and things like that, but that that would be an appealing

Speaker: 00:27:29

thing for business users. Yes.

Speaker: 00:27:33

Interesting. Interesting.

Speaker: 00:27:41

How does someone get started in learning how to jailbreak these? Like, is this is

Speaker: 00:27:45

this a typical your background is, IT security.

Speaker: 00:27:49

But what about someone who has a background in, say, AI and and and building

Speaker: 00:27:53

these LLMs? Is that, Gunning, you think, be an another career

Speaker: 00:27:57

path for the what we call data scientists today?

Speaker: 00:28:01

Well, I would say you're gonna have to probably do it just as is. I

Speaker: 00:28:04

think to the developers and to the data science Right. Scientists who work on this,

Speaker: 00:28:08

you're gonna have to be security literate. Right.

Speaker: 00:28:12

For those who want to get into it, I mean, data science is like any

Speaker: 00:28:16

other AI trade. I mean, we often

Speaker: 00:28:19

cross pollinate. So I would say that you might have an understanding

Speaker: 00:28:23

already of these things. These prompt injections, as I say, are not

Speaker: 00:28:27

much different than SQL injections. The data science Right. You probably know what that is.

Speaker: 00:28:33

How you transfer it depends on what you know.

Speaker: 00:28:37

I would say most data sciences do understand how some of this stuff

Speaker: 00:28:40

works. Right. So getting into it is

Speaker: 00:28:44

just basically you just learning more about security. Right. For the

Speaker: 00:28:48

average person trying to get into it, I would say, if you're trying to

Speaker: 00:28:52

get into AI security, know security

Speaker: 00:28:55

first, and there are many ways to get into

Speaker: 00:28:59

it. I, myself, came in, from my

Speaker: 00:29:02

CCNA. I mean, that's how I kinda got into it. I got

Speaker: 00:29:06

into networks, and then I got into cybersecurity. And

Speaker: 00:29:10

then it was around the time that, you know, the GPTs were really starting to

Speaker: 00:29:13

hit their stride. And it was just part and parcel of it because

Speaker: 00:29:18

I needed a good reference tool. And so then I learned, okay.

Speaker: 00:29:22

Well, how does this work? How do how is it put together? How,

Speaker: 00:29:25

you know, how is it all formed and such? How does

Speaker: 00:29:29

it make its inferences? How does it understand the problems?

Speaker: 00:29:33

So from that, I would say to anybody trying to get into this field,

Speaker: 00:29:37

know cybersecurity first, and you will know AI

Speaker: 00:29:42

in time. AI is in concept

Speaker: 00:29:46

relatively simple, but the nuts and bolts of it are quite

Speaker: 00:29:49

complex. So Yeah. The implementation

Speaker: 00:29:53

details are quite severe. Like, I think

Speaker: 00:29:57

AI is really, I think, better not better suited, but it came

Speaker: 00:30:01

out of the lab. I think the paint is still wet. Paint hasn't dried

Speaker: 00:30:04

yet. And now we're forcing it into an enterprise

Speaker: 00:30:08

scenarios with real customers, real data, real people's lives.

Speaker: 00:30:12

And I don't see a lot of the traditional security

Speaker: 00:30:15

discipline that

Speaker: 00:30:21

I would expect in modern era, modern development.

Speaker: 00:30:25

And even that's a low bar. Even that's a low bar. Let's be real. Well,

Speaker: 00:30:28

it's it's new. Right. It's very shiny.

Speaker: 00:30:32

Mhmm. That's I think that's what I would say is the general

Speaker: 00:30:36

populace and even in the industry that's quite I think our view is that this

Speaker: 00:30:39

is a shiny thing. Right. Well, you know, well, I want

Speaker: 00:30:43

to. You don't even know what it does. I still want it. I want it.

Speaker: 00:30:49

What's interesting is, it

Speaker: 00:30:53

reminds me a lot of the early days of the web where everybody wanted a

Speaker: 00:30:56

website. Well, what are you gonna do with it? I don't know. I just want

Speaker: 00:30:59

a website. You know? It's very it has very very

Speaker: 00:31:02

similar vibe in that regard of we want it. We you know, the hell with

Speaker: 00:31:06

the consequences. But the way I see this

Speaker: 00:31:09

being,

Speaker: 00:31:13

taken up as quickly as it is kind

Speaker: 00:31:17

of worries me. Like, there's gonna be a day of reckoning, I

Speaker: 00:31:21

think, coming. You know? And I thought we

Speaker: 00:31:24

already have it. Right? You you had, there was a leak from Chat

Speaker: 00:31:28was a: 100000 Speaker: 00:31:32take? A: 100000 Speaker: 00:31:35

Credentials and and presumably the data and the chats?

Speaker: 00:31:40

Some of it potentially, I'm sure. But what we're looking at is, like,

Speaker: 00:31:43

names, email addresses. I mean, it depends on how much you put in

Speaker: 00:31:47

that profile. Remember, everything you put in that profile is stored.

Speaker: 00:31:51

Right. Right. That is truly scary.

Speaker: 00:31:56

So you mentioned network, Chuck. So you do you think that

Speaker: 00:32:00

just on a personal level, it's

Speaker: 00:32:03

what worries me about these offline models, right, you run OLAMA locally.

Speaker: 00:32:07

Right? Do you think they could they call

Speaker: 00:32:11

home? Could those be hijacked? Could those have problems?

Speaker: 00:32:15

Specifically. Specifically. Like, so if I'm

Speaker: 00:32:19

running Olama locally, right,

Speaker: 00:32:25

how secure is that? Does that does that depend on the security of my

Speaker: 00:32:29

network, or is there something in there that calls home?

Speaker: 00:32:32

No. Not unless you tell it to. Not unless you try to extract it, you

Speaker: 00:32:36

make a pull, then, yes, it does that. But that's the idea is that once

Speaker: 00:32:40

it's pulled down, it kinda isolates itself. Now

Speaker: 00:32:44

what you can do yourself is set up your

Speaker: 00:32:47

network so that literally it has to be outbound,

Speaker: 00:32:52

a stateful connection, originating outbound.

Speaker: 00:32:56

And you can set that up in your firewall, physical

Speaker: 00:32:59

or otherwise. And you can do things like that, and you can

Speaker: 00:33:03

kind of put it to a point where it doesn't call home unless you tell

Speaker: 00:33:06

it to. Right. And, also, once again, that

Speaker: 00:33:10

private LLM is also very good because you control

Speaker: 00:33:14

the access to what it does. So you can say,

Speaker: 00:33:17

other than these addresses, sanitize it to the

Speaker: 00:33:21

address of wherever the model comes from, say, these are the only ones

Speaker: 00:33:25

allowed. Right. And nobody else is permitted.

Speaker: 00:33:28

Otherwise, implicit deny. Right. So that's a I think

Speaker: 00:33:31

a a small tangible example of something you

Speaker: 00:33:35

can do that is relatively straightforward for any

Speaker: 00:33:38

systems or network engineer, to do just in the hearing

Speaker: 00:33:42

now. But in general, no. They don't normally call without

Speaker: 00:33:46

prompting. Okay. But depends on what they do with those models.

Speaker: 00:33:50

They might put in that kind of feature. A lot of that go back to

Speaker: 00:33:53

the I'm sorry. Yeah. That's kind of my concern is, like, you know, would that

Speaker: 00:33:57

end up in there? Or Well, Meta might put that in there.

Speaker: 00:34:00

Right. Meta is a not alone. Meta is not

Speaker: 00:34:04

exactly free. Right. Matt is not exactly,

Speaker: 00:34:08

has a reputation for privacy. No.

Speaker: 00:34:12

So it's kind of ironic that they are

Speaker: 00:34:16

leading the effort in this space. Seems kind of an odd move.

Speaker: 00:34:21

I I don't know what to say about that. No. No. No. I just need

Speaker: 00:34:25

I have no thoughts on it, but Right. Right. Frankly, I don't I don't know

Speaker: 00:34:28

how relevant it'd be to this discussion. But it's an interesting it's

Speaker: 00:34:32

it's just an interesting time to be in this field, and,

Speaker: 00:34:38

this is just fascinating that you can

Speaker: 00:34:42

jailbreak. You could do this and, you know, even just the basics. Right?

Speaker: 00:34:46

Like, you could do a DOS attack. Right? There's

Speaker: 00:34:49

just basics too. Like, this is still

Speaker: 00:34:53

an IT service no matter how cool it is, no matter futuristic it is. It's

Speaker: 00:34:57

still an IT service, so it has all of those vulnerabilities,

Speaker: 00:35:01

you know, that I don't know. Like, it's just it's just interesting. People are so

Speaker: 00:35:04

focused in the new shiny. I just find it fascinating.

Speaker: 00:35:09

And that's the thing is that this thing is a compounded problem. Right. You

Speaker: 00:35:12

don't just have the usual suspects. You also have

Speaker: 00:35:16

new things that are they

Speaker: 00:35:20

by the virtue of them being new, there's not much

Speaker: 00:35:24

investigation. There's not much study. I mean, amongst my

Speaker: 00:35:28

research for this presentation, I found a number of

Speaker: 00:35:32

papers, white papers coming from all sorts of universities.

Speaker: 00:35:36

They are now looking into this. Right. This is something that maybe we

Speaker: 00:35:39

should have done maybe a while back. Good thing, though, we're doing it now.

Speaker: 00:35:43

Right. But also, also, there's a lot of reasons why you would do that, though.

Speaker: 00:35:47

You would do that because in the wild, you'd be able to identify these things.

Speaker: 00:35:51

Right. You'd be able to see. You're not gonna know everything when something gets released

Speaker: 00:35:54

until it's put out into the wild. Right. And real users

Speaker: 00:35:58

get their hands on it. Good actors, bad actors,

Speaker: 00:36:02

and everything in the middle. Right? Like, you're not gonna yeah. No. I mean, it's

Speaker: 00:36:06

kind of like I guess I guess in a perfect world, the cart would be

Speaker: 00:36:09

before the horse in this case, but that's not the world we live in.

Speaker: 00:36:14

Interesting. So where can

Speaker: 00:36:17

people find out more about you and what you're up to? Well, you

Speaker: 00:36:21

can find me on, LinkedIn. Kevin Lynch

Speaker: 00:36:24

with CCNA. Cool. You can look up my company, Novi Tea Guy,

Speaker: 00:36:29

Novi Tea Guy dot com. And For those outside the area,

Speaker: 00:36:32

Nova stands for Northern Virginia. Just just wanna figure it out there. Well,

Speaker: 00:36:36

also, it well, it's actually a bit of a it's a double meaning. At the

Speaker: 00:36:40

time, I was dedicating myself to IT for the first time. I've done

Speaker: 00:36:43

IT kind of side part of my work. So Nova is also the

Speaker: 00:36:47

Latin for new. So I was Okay. The new IT guy. The

Speaker: 00:36:51

new IT guy. But when it comes to IT, I'm still your guy even then.

Speaker: 00:36:55

There you go. I love it. And,

Speaker: 00:37:00

I'll definitely will include in the show notes a link to your presentation.

Speaker: 00:37:05

And this has been a great conversation. I'd love to have you back and maybe

Speaker: 00:37:07

do your presentation, maybe on a live stream or something like that if you're interested,

Speaker: 00:37:12

and, I'll let Bailey finish the show. And that's

Speaker: 00:37:16

a wrap for today's episode of the data driven podcast.

Speaker: 00:37:19

A huge thank you to Kevin Latchford for shedding light on the vulnerabilities

Speaker: 00:37:23

of large language models and how to stay one step ahead in the ever

Speaker: 00:37:27

evolving world of IT security. Remember, while these

Speaker: 00:37:31

models are brilliant at generating conversation, they aren't infallible

Speaker: 00:37:35

so keep your digital guard up. Until next time, stay

Speaker: 00:37:38

curious, stay safe and always question the source unless,

Speaker: 00:37:42

of course, it's me. Cheers.