Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
19
1
Eric Hartford
erichartford
Follow
bemer12's profile picture
21world's profile picture
Brandyatorrez's profile picture
4 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
5 days ago
mistralai/Mistral-Small-24B-Instruct-2501:
updated model_max_length from to 1000000000000000019884624838656 to 32768
new
activity
16 days ago
unsloth/DeepSeek-R1-BF16:
how did you create it?
new
activity
22 days ago
mlx-community/DeepSeek-V3-4bit:
VRAM Requirements for Running the Model
View all activity
Organizations
erichartford
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Mistral-Small-24B-Instruct-2501
5 days ago
updated model_max_length from to 1000000000000000019884624838656 to 32768
2
#21 opened 5 days ago by
LHC88
New activity in
unsloth/DeepSeek-R1-BF16
16 days ago
how did you create it?
1
#1 opened 17 days ago by
erichartford
New activity in
mlx-community/DeepSeek-V3-4bit
22 days ago
VRAM Requirements for Running the Model
3
#1 opened 27 days ago by
wilfoderek
New activity in
hexgrad/Kokoro-82M
25 days ago
[FAQ] Alternatives to Finetuning Kokoro
10
#19 opened about 1 month ago by
hexgrad
New activity in
huihui-ai/Dolphin3.0-Llama3.1-8B-abliterated
28 days ago
Question
2
#1 opened 29 days ago by
VicoIlFicoFigo
New activity in
mlx-community/DeepSeek-V3-3bit-bf16
30 days ago
3bit-bf16
4
#1 opened about 1 month ago by
ehartford
updated
a model
about 1 month ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
13 days ago
•
1.05M
•
•
3.22k
New activity in
deepseek-ai/DeepSeek-V3
about 1 month ago
Update modeling_deepseek.py
1
#23 opened about 1 month ago by
erichartford
updated
a model
about 1 month ago
deepseek-ai/DeepSeek-V3-Base
Updated
13 days ago
•
28.8k
•
1.52k
New activity in
deepseek-ai/DeepSeek-V3-Base
about 1 month ago
Update modeling_deepseek.py
#47 opened about 1 month ago by
erichartford
New activity in
deepseek-ai/DeepSeek-V3
about 1 month ago
is_torch_greater_or_equal_than_1_13 deprecated
#22 opened about 1 month ago by
erichartford
New activity in
tencent/Tencent-Hunyuan-Large
about 1 month ago
MLX quants
6
#19 opened 2 months ago by
ehartford
New activity in
inarikami/DeepSeek-V3-int4-TensorRT
about 1 month ago
Could you please tell how to inference this model?
3
#4 opened about 1 month ago by
carlosbdw
New activity in
OPEA/DeepSeek-V2.5-1210-int4-sym-inc
about 1 month ago
alternative serving framework
2
#1 opened about 1 month ago by
erichartford
New activity in
OpenCoder-LLM/opc-sft-stage1
about 1 month ago
realuser_instruct_filtered contains ERP data
2
#8 opened about 1 month ago by
erichartford
New activity in
deepseek-ai/DeepSeek-V3-Base
about 1 month ago
我嘞个dou,这么大
9
#1 opened about 1 month ago by
mrwkd123
New activity in
deepseek-ai/DeepSeek-V2.5-1210
about 2 months ago
test code doesn't work
1
#4 opened about 2 months ago by
erichartford
New activity in
meta-metrics/MetaMetrics-RM-v1.0
about 2 months ago
weights?
2
#1 opened about 2 months ago by
erichartford
New activity in
deepseek-ai/DeepSeek-V2.5-1210
about 2 months ago
fp8
#3 opened about 2 months ago by
erichartford
updated
a model
about 2 months ago
mistral-community/mixtral-8x22B-v0.3
Text Generation
•
Updated
Dec 14, 2024
•
59
•
3
Load more