feat: supported eval investigation and ran attacks in isolation by sdhossain · Pull Request #115 · criticalml-uw/TamperBench

sdhossain · 2026-03-29T14:34:12Z

Changes

Summarize the changes in this PR and describe the context or motivation for
them.

Add a title, prepending the tag [attack], [defense], [evaluation], or [infra] if
appropriate.

Testing

Describe how you tested the changes in this PR. E.g., added tests, or ran
command foo and checked the results looked good.

tomtseng · 2026-04-11T01:51:36Z

+
+        if EvalName.MT_BENCH in self.attack_config.evals:
+            results = pl.concat([results, self.evaluate_mt_bench()])
+


can we throw an error if results is empty?

Otherwise if we add an eval and forget to add it here to evaluate(), it silently fails and we just don't get the evaluation results. It'd be nice for things to fail earlier.

Or another idea is to make use of the evaluation registry here so that we don't have to manually add each eval here in evaluate()

feat: supported eval investigation and ran attacks in isolation

fb208c7

tomtseng reviewed Apr 11, 2026

View reviewed changes

This was referenced Apr 13, 2026

defenses: Add original TAR implementation #113

Merged

policy_eval: Switch PolicyEval default dataset to official dataset #120

Merged

stashing changes

8d5cbfe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: supported eval investigation and ran attacks in isolation#115

feat: supported eval investigation and ran attacks in isolation#115
sdhossain wants to merge 2 commits into
mainfrom
sh/eval_investigation

sdhossain commented Mar 29, 2026

Uh oh!

tomtseng Apr 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		if EvalName.MT_BENCH in self.attack_config.evals:
		results = pl.concat([results, self.evaluate_mt_bench()])

Conversation

sdhossain commented Mar 29, 2026

Changes

Testing

Uh oh!

tomtseng Apr 11, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants