Skip to content
/ D-STEER Public

A Preference Alignment Techniques Learn to Behave, not to Believe-Beneath the Surface, DPO as Steering Vector Perturbation in Activation Space

License

Notifications You must be signed in to change notification settings

pps121/D-STEER

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

D-STEER

A Preference Alignment Techniques Learn to Behave, not to Believe-Beneath the Surface, DPO as Steering Vector Perturbation in Activation Space

About

A Preference Alignment Techniques Learn to Behave, not to Believe-Beneath the Surface, DPO as Steering Vector Perturbation in Activation Space

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published