20 6

MrDragonFox

https://discord.gg/foxengine-ai

AI & ML interests

None yet

Recent Activity

updated a model 7 days ago

MrDragonFox/mistral_small-grpo-600-step-adaptor

published a model 7 days ago

MrDragonFox/mistral_small-grpo-600-step-adaptor

commented on an article 23 days ago

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

View all activity

Organizations

MrDragonFox's activity

updated a model 7 days ago

MrDragonFox/mistral_small-grpo-600-step-adaptor

Updated 7 days ago • 5

published a model 7 days ago

MrDragonFox/mistral_small-grpo-600-step-adaptor

Updated 7 days ago • 5

commented on The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... 23 days ago

its all just attention patching .. really old stuff -https://nnsight.net/
https://github.com/ndif-team/nnsight
is the best toolkit for that

commented on The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... 23 days ago

nnsight has a good toolkit for that

commented on The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... 24 days ago

just limit vllm to 1 gpu and run the rest on a other one .. or use -gmu

commented on The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... 24 days ago

8b repo empty and dataset empty too .. well its a little off from sota .... tbh glm4voice had better results - but its certainly a "ok" poc

gh repo empty / no paper

updated a dataset 24 days ago

MrDragonFox/vtube

Updated 24 days ago • 7

liked 2 models 25 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated 5 days ago • 710k • • 1.04k

unsloth/DeepSeek-R1-Distill-Qwen-7B-GGUF

Updated 20 days ago • 170k • 70

published a dataset 25 days ago

MrDragonFox/vtube

Updated 24 days ago • 7

replied to mitkox's post about 1 month ago

is that with ddr4 or 5 ?

replied to mitkox's post about 1 month ago

with 250g ram used ^^ probably running it at a 2 bit quant .

New activity in deepseek-ai/DeepSeek-V3 about 2 months ago

When GGUF?

#6 opened about 2 months ago by

ChuckMcSneed

New activity in byroneverson/glm-4-9b-chat-abliterated about 2 months ago

GLM

#2 opened 2 months ago by

MrDragonFox

New activity in THUDM/glm-4-voice-9b 2 months ago

nnsight output logits nan

#1 opened 2 months ago by

MrDragonFox

New activity in mistralai/Mistral-Large-Instruct-2411 2 months ago

Disappointing

#11 opened 2 months ago by

ChuckMcSneed

reacted to danielhanchen's post with 🔥 3 months ago

Post

1434

Vision finetuning is in 🦥Unsloth! You can now finetune Llama 3.2, Qwen2 VL, Pixtral and all Llava variants up to 2x faster and with up to 70% less VRAM usage! Colab to finetune Llama 3.2: https://colab.research.google.com/drive/1j0N4XTY1zXXy7mPAhOC1_gMYZ2F2EBlk?usp=sharing

1 reply

New activity in m-a-p/MIO-7B-Instruct 3 months ago

code still missing ... at least give us examples

#1 opened 3 months ago by

MrDragonFox

New activity in LanguageBind/Open-Sora-Plan-v1.3.0 4 months ago

prompt refiner misses part 3 of the model

#1 opened 4 months ago by

MrDragonFox

New activity in kyutai/mimi 5 months ago

Training code

#1 opened 5 months ago by

ChristophSchuhmann