Notes | Brutkey

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Proposing a Universal Antipattern -- untrusted data in, LLM caused to alter to something adversarial, result is passed to some action with the agent's authority – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

As long as step 1 (add untrusted data) is true, attacks are possible. – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Prompt injection is defined as tricking an LLM agent into treating data as instructions or into the content retrieved at time the model makes an inference. – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Proposing a Universal Antipattern -- untrusted data in, LLM caused to alter to something adversarial, result is passed to some action with the agent's authority – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Fully autonomous agents (Level 3 autonomy) is what most people want agents to do, and it's also the focus of attackers because the LLM plans, processes and executes.

All of these things start with some level of prompt injection – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Prompt injection is defined as tricking an LLM agent into treating data as instructions or into the content retrieved at time the model makes an inference. – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Defining an agent as "output is chained back to input for future requests" OR "acts with authorization to perform actions" – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Fully autonomous agents (Level 3 autonomy) is what most people want agents to do, and it's also the focus of attackers because the LLM plans, processes and executes.

All of these things start with some level of prompt injection – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Rebecca Lynch and Harang from NVIDIA on AI agent exploitation and defense – Lynch & Harang #BHUSA #LivePost

Read whole thread on a single page: https://mastoreader.io/?url=https%3A%2F%2Finfosec.exchange%2F%40darrenpmeyer%2F114983027943858534

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Defining an agent as "output is chained back to input for future requests" OR "acts with authorization to perform actions" – Lynch & Harang #BHUSA #LivePost

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Rebecca Lynch and Harang from NVIDIA on AI agent exploitation and defense – Lynch & Harang #BHUSA #LivePost

Read whole thread on a single page: https://mastoreader.io/?url=https%3A%2F%2Finfosec.exchange%2F%40darrenpmeyer%2F114983027943858534

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

I'll be occasionally doing a #LivePost of talks if I have the spoons and they're interesting. Block #LivePost if you don't wish to see those updates.

Darren Meyer :donor:
@darrenpmeyer@infosec.exchange

Updating my #introduction: I’m what happens when the weird kid who kept breaking into the school mainframe grows up.

I’m fascinated by designing resilience into socio-technical systems, and the ways in which our default assumptions about doing that tend to break. On my spare time, I bike and tinker (mostly Arduino these days), and mildly obsess over getting interesting coffee to come out of my brewing equipment.

Work-wise, I’m deeply interested in software engineering as a practice and system. I tend to apply this to further #AppSec goals and programs, among many other things.