DeepSeek-V3-lite naming conventions?
#76 opened about 1 hour ago
by
AlphaGaO
torch.distributed.DistNetworkError
#75 opened 4 days ago
by
yu19920006607
remove reference to deprecated transformers code
2
#74 opened 8 days ago
by
winglian
Update README.md
#73 opened 9 days ago
by
SamimSaikia
DeepSeek R1 answer ChatGPT ??
4
#72 opened 9 days ago
by
valerebron
ValueError: Unrecognized configuration class <class 'transformers_modules.configuration_deepseek.DeepseekV3Config'> to build an AutoTokenizer.
3
#69 opened 10 days ago
by
ajtakto
Paralelized script
#67 opened 10 days ago
by
ajtakto
I am getting an error message while executing pip install - r requirements. txt
5
#64 opened 14 days ago
by
yu19920006607
Does deepseek allow adding new data?
#63 opened 18 days ago
by
JoshuaBontor
`aux_loss_alpha` should be 1e-4 instead of 1e-3?
#61 opened 23 days ago
by
cuichenx
captcha not loading on edge
#60 opened 25 days ago
by
leo-smi
Upload shreya.zip
#59 opened 25 days ago
by
Msdthala
Upload IMG_20250111_184317.jpg
#58 opened 26 days ago
by
Sajalhero
无辅助损失的专家路由
1
#56 opened 27 days ago
by
qing9
AI Games
#55 opened 28 days ago
by
ChickenUJHAYIUSGU
Upload IMG_0509 4.HEIC
#54 opened 28 days ago
by
borhanrabbany
how to inference with mtp?
#53 opened 28 days ago
by
duanyu
Does it support ollama
2
#52 opened 28 days ago
by
sminbb
Create gngn
#49 opened 29 days ago
by
axingd
Missing tool call in system prompt
1
#48 opened 29 days ago
by
bchenfireworks
Update config.json
#47 opened about 1 month ago
by
STATIKwitak
Rename figures/benchmark.png to figures/𓇋𓀀𓍿.png
#46 opened about 1 month ago
by
STATIKwitak
Rename figures/benchmark.png to figures/𓇋𓀀𓍿.png
#45 opened about 1 month ago
by
STATIKwitak
Upload IMG_0295.HEIC
#42 opened about 1 month ago
by
Umarkhan499
vLLM on A100s
6
#41 opened about 1 month ago
by
fsaudm
When do you plan to integrate Huggingface Transformer?
#40 opened about 1 month ago
by
echooooooooo
Deciphering messages
1
#39 opened about 1 month ago
by
DoctorDonald
Update README.md
#38 opened about 1 month ago
by
chaitanyayerroju
Update README.md
1
#37 opened about 1 month ago
by
TomGrc
Training problem
3
#29 opened about 1 month ago
by
DonGan13
Update README.md
1
#28 opened about 1 month ago
by
Wisnet
Update README.md
2
#27 opened about 1 month ago
by
Aikun7777777
Failed to run the model with 4 nodes of 8 4090
17
#25 opened about 1 month ago
by
aisensiy
kill openai,come on
#24 opened about 1 month ago
by
chaochaoli
Update modeling_deepseek.py
1
#23 opened about 1 month ago
by
erichartford
is_torch_greater_or_equal_than_1_13 deprecated
#22 opened about 1 month ago
by
erichartford
Request: DOI
#21 opened about 1 month ago
by
TheDandyMan
Has anyone tried running this model on Ollama?
6
#20 opened about 1 month ago
by
Yuxin362
vLLM on A100s
4
#19 opened about 1 month ago
by
fsaudm
Fine-tuning roadmap
4
#18 opened about 1 month ago
by
RonanMcGovern
CUDA out of memory error during fp8 to bf16 model conversion + fix
1
#17 opened about 1 month ago
by
sszymczyk
when llm leaderboard?
3
#14 opened about 1 month ago
by
blazespinnaker
Update README.md
#13 opened about 1 month ago
by
BANblongz
Please make V3-lite
3
#12 opened about 1 month ago
by
rombodawg
minimum vram?
11
#9 opened about 1 month ago
by
CHNtentes
Update README.md
#7 opened about 1 month ago
by
Spestly
Converted bf16 Model on Hugging Face
2
#5 opened about 1 month ago
by
OpenSourceRonin
Update README.md
#3 opened about 1 month ago
by
reach-vb
Smaller version for Home User GPU's
10
#2 opened about 1 month ago
by
apcameron