view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 153
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 19 days ago • 1.22k • 5
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q2_K-GGUF Text Generation • Updated 19 days ago • 1.22k • 5
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 19 days ago • 200 • 2
AlicanKiraz0/Seneca-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Safe-Q8_0-GGUF Text Generation • Updated 19 days ago • 200 • 2
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 19 days ago • 273 • 3
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 19 days ago • 273 • 3
AlicanKiraz0/SenecaLLM-x-DeepSeek-R1-Distill-Qwen-32B-v1.3-Q4_K_M-GGUF Text Generation • Updated 19 days ago • 273 • 3