Model Card for Model ID

This LoRA adapter was extracted from mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated and uses meta-llama/Llama-3.1-8B-Instruct as a base.

Model Details

The model was extracted by running mlabonne/harmful_behaviors through the original abliterated model to generate a dataset of prompt/completion pairs, and was trained for 15 epochs on an H100 with Unsloth.

Model Description

Developed by: @reissbaker
Funded by: Synthetic Lab
License: Apache 2.0
Finetuned from model: Llama 3.1 8B Instruct

How to Get Started with the Model

Run the model with one click on glhf.chat.

Training Hyperparameters

BF16 mixed-precision 2e-4 LR Linear LR schedule AdamW 8-bit optimizer

reissbaker
/

llama-3.1-8b-abliterated-lora

Model Card for Model ID

Model Details

Model Description

How to Get Started with the Model

Training Hyperparameters

Model tree for reissbaker/llama-3.1-8b-abliterated-lora