Model Card for Model ID
This LoRA adapter was extracted from mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated and uses meta-llama/Llama-3.1-8B-Instruct as a base.
Model Details
The model was extracted by running mlabonne/harmful_behaviors through the original abliterated model to generate a dataset of prompt/completion pairs, and was trained for 15 epochs on an H100 with Unsloth.
Model Description
- Developed by: @reissbaker
- Funded by: Synthetic Lab
- License: Apache 2.0
- Finetuned from model: Llama 3.1 8B Instruct
How to Get Started with the Model
Run the model with one click on glhf.chat.
Training Hyperparameters
BF16 mixed-precision 2e-4 LR Linear LR schedule AdamW 8-bit optimizer
- Downloads last month
- 226
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model’s pipeline type.
Model tree for reissbaker/llama-3.1-8b-abliterated-lora
Base model
meta-llama/Llama-3.1-8B
Finetuned
meta-llama/Llama-3.1-8B-Instruct