RegularizedSelfPlay
/

sppo_reversekl-0.05-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter3

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sppo_reversekl-0.05-PromptABC-LLAMA-3-8B-Instruct-SPPO-Iter3

1 contributor

History: 3 commits

angelahzyuan's picture

Upload tokenizer

4c79ae9 verified 17 days ago