nbeerbower
/

Dumpling-Mistral-Nemo-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

🧪 Experimental

An attempt to recover intelligence with a quick train, results are meh

Dumpling-Mistral-Nemo-8B

nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:

Method

QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.

Downloads last month: 24

Safetensors

Model size

8.43B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Model tree for nbeerbower/Dumpling-Mistral-Nemo-8B

Base model

nbeerbower/Mahou-1.5-mistral-nemo-12B-lorablated

Finetuned

nbeerbower/mistral-nemo-kartoffel-12B

Finetuned

nbeerbower/mistral-nemo-kartoffel-PRUNE3

Finetuned

(1)

this model

Quantizations

Datasets used to train nbeerbower/Dumpling-Mistral-Nemo-8B