Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
cyberagent
/
DeepSeek-R1-Distill-Qwen-14B-Japanese
like
67
Follow
CyberAgent
463
Text Generation
Safetensors
Japanese
qwen2
japanese
conversational
arxiv:
2501.12948
License:
mit
Model card
Files
Files and versions
Community
main
DeepSeek-R1-Distill-Qwen-14B-Japanese
1 contributor
History:
4 commits
rishigami
Update README.md
cc1ebcc
verified
10 days ago
.gitattributes
Safe
1.57 kB
Upload tokenizer
10 days ago
README.md
Safe
5.16 kB
Update README.md
10 days ago
config.json
Safe
746 Bytes
Upload Qwen2ForCausalLM
10 days ago
generation_config.json
Safe
181 Bytes
Upload Qwen2ForCausalLM
10 days ago
model-00001-of-00006.safetensors
Safe
4.99 GB
LFS
Upload Qwen2ForCausalLM
10 days ago
model-00002-of-00006.safetensors
Safe
4.95 GB
LFS
Upload Qwen2ForCausalLM
10 days ago
model-00003-of-00006.safetensors
Safe
4.95 GB
LFS
Upload Qwen2ForCausalLM
10 days ago
model-00004-of-00006.safetensors
Safe
4.95 GB
LFS
Upload Qwen2ForCausalLM
10 days ago
model-00005-of-00006.safetensors
Safe
4.95 GB
LFS
Upload Qwen2ForCausalLM
10 days ago
model-00006-of-00006.safetensors
Safe
4.73 GB
LFS
Upload Qwen2ForCausalLM
10 days ago
model.safetensors.index.json
Safe
47.5 kB
Upload Qwen2ForCausalLM
10 days ago
special_tokens_map.json
Safe
485 Bytes
Upload tokenizer
10 days ago
tokenizer.json
Safe
11.4 MB
LFS
Upload tokenizer
10 days ago
tokenizer_config.json
Safe
6.75 kB
Upload tokenizer
10 days ago