DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Hao Li, Xiaogeng Liu, Hung-Chun Chiu, Dianqi Li, Ning Zhang, Chaowei Xiao.

The official implementation of NeurIPS 2025 paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents".

Update

[2026.4.19] 🛠️ Update the evaluation on AgentDyn.
[2026.1.30] 🛠️ Support the evaluation on more agents.
[2026.1.30] 🛠️ Update the evaluation code on ASB.

How to Start

We provide the evaluation of DRIFT, you can reproduce the results following:

Evaluating on AgentDojo

Construct Your Environment

conda create -n drift python=3.11
source activate drift
pip install "agentdojo==0.1.35"
pip install -r requirements.txt

Set Your API KEY

We provide three API providers, including OpenAI, Google, and OpenRouter. Please set up the API Key as you need.

export OPENAI_API_KEY=your_key
export GOOGLE_API_KEY=your_key
export OPENROUTER_API_KEY=your_key

run task with no attack

python pipeline_main.py \
--model gpt-4o-mini-2024-07-18 \
--build_constraints --injection_isolation --dynamic_validation
--suites banking,slack,travel,workspace

run task under attack

python pipeline_main.py \
--model gpt-4o-mini-2024-07-18 --do_attack \
--attack_type important_instructions \
--build_constraints --injection_isolation --dynamic_validation
--suites banking,slack,travel,workspace

You can evaluate any model from the supported providers by passing its model identifier (eg., gemini-2.5-pro) to the --model flag. To evaluate under an adaptive attack, include the --adaptive_attack configuration.

Evaluating on AgentDyn

To evaluate on AgentDyn, you can directly replace the AgentDojo dependency with the AgentDyn version. First, run:

git clone git@github.com:SaFo-Lab/AgentDyn.git
cd AgentDyn
pip install -e .

Then, AgentDojo dependency has been replaced with the AgentDyn version, which additionally supports the shopping, github, and dailylife suites. You can use the same commands as for evaluating on AgentDojo to evaluate on these three suites, as shown below:

run task with no attack

python pipeline_main.py \
--model gpt-4o-mini-2024-07-18 \
--build_constraints --injection_isolation --dynamic_validation
--suites shopping,github,dailylife

run task under attack

python pipeline_main.py \
--model gpt-4o-mini-2024-07-18 --do_attack \
--attack_type important_instructions \
--build_constraints --injection_isolation --dynamic_validation
--suites shopping,github,dailylife

Evaluating on ASB

Please refer to ASB_DRIFT/README.md.

Inspect Results

You can find the cached results in runs/.

References

If you find this work useful in your research or applications, we appreciate that if you can kindly cite:

@articles{DRIFT,
  title={DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents},
  author={Hao Li and Xiaogeng Liu and Hung-Chun Chiu and Dianqi Li and Ning Zhang and Chaowei Xiao},
  journal = {NeurIPS},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
ASB_DRIFT		ASB_DRIFT
assets		assets
docs		docs
runs/gpt-4o-mini-2024-07-18		runs/gpt-4o-mini-2024-07-18
.DS_Store		.DS_Store
.gitignore		.gitignore
DRIFTLLM.py		DRIFTLLM.py
DRIFTTaskSuite.py		DRIFTTaskSuite.py
DRIFTToolsExecutionLoop.py		DRIFTToolsExecutionLoop.py
README.md		README.md
client.py		client.py
import_lib.py		import_lib.py
pipeline_main.py		pipeline_main.py
prompts.py		prompts.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Update

How to Start

Evaluating on AgentDojo

Construct Your Environment

Set Your API KEY

run task with no attack

run task under attack

Evaluating on AgentDyn

run task with no attack

run task under attack

Evaluating on ASB

Inspect Results

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Update

How to Start

Evaluating on AgentDojo

Construct Your Environment

Set Your API KEY

run task with no attack

run task under attack

Evaluating on AgentDyn

run task with no attack

run task under attack

Evaluating on ASB

Inspect Results

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages