notaphoenix
/

argument-transfer-liberal_l0.2_median

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Steered Llama-v2-7b towards Effective Arguments for Liberal Readers

This is the steered Llama-v2-7b-chat-hf model.

We used the processed debateorg dataset to create the steering vectors:

We first extracted the hidden layers of effective arguments and ineffective arguments.
For each layer, from 18-20,
1. we calculate the median of the hidden vectors.
2. We substract the median of effective arguments from the median of ineffective arguments
3. We add the result to each corresponding activation layer

Downloads last month: 13

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train notaphoenix/argument-transfer-liberal_l0.2_median