Text Generation
Transformers
Safetensors
llama
galore
text-generation-inference
Inference Endpoints

Basic Model Info

1 epoch on adamo1139/uninstruct-v1-experimental-chatml and then 1 epoch on adamo1139/HESOYAM_v0.3. I used GaLore for both stages.

This is a model trained on only human data, finetuned to behave like a person on 4chan board /x/ or redditor. Data used has comments from 1 4chan board "paranormal" and about 10 reddit subreddits. There's also a pippa in case you want to roleplay. Have a look at dataset to know what to expect.

Use ChatML prompt format with a system prompt like those in adamo1139/HESOYAM_v0.3, so A chat on 4chan or A chat on subreddit /r/wallstreetbets. It behaves like OpenAI slopped model with system prompt A chat so I advise you to avoid using that.

Downloads last month
15
Safetensors
Model size
34.4B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Datasets used to train adamo1139/Yi-34B-200K-HESOYAM-2206