Steered Llama-v2-7b towards Effective Arguments for Liberal Readers
This is the steered Llama-v2-7b-chat-hf model.
We used the processed debateorg dataset to create the steering vectors:
- We first extracted the hidden layers of effective arguments and ineffective arguments.
- For each layer, from 18-20,
- we calculate the median of the hidden vectors.
- We substract the median of effective arguments from the median of ineffective arguments
- We add the result to each corresponding activation layer
- Downloads last month
- 13
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.