Here I develop a framework using inspect-ai for evaluating LLMs on the ETHICS dataset.
This project is a foundation for evaluating the behaviors of complex LLMs such as moral parliaments on ethical questions.
In machine ethics, the moral parliament is a leading idea among ethical decision-making algorithms.
Such algorithms are of particular interest in future LLMs.
aaron-sandoval/ethics_eval
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|