12 8

Vedat Baday

badayvedat

AI & ML interests

None yet

Recent Activity

updated a model 10 days ago

badayvedat/hunyuan-video-cakeify

published a model 10 days ago

badayvedat/hunyuan-video-cakeify

updated a model 15 days ago

badayvedat/Steamboat-Willie-Hunyuan-Video-LoRA

View all activity

Organizations

badayvedat's activity

updated a model 10 days ago

badayvedat/hunyuan-video-cakeify

Updated 10 days ago

published a model 10 days ago

badayvedat/hunyuan-video-cakeify

Updated 10 days ago

updated a model 15 days ago

badayvedat/Steamboat-Willie-Hunyuan-Video-LoRA

Updated 15 days ago

published a model 15 days ago

badayvedat/Steamboat-Willie-Hunyuan-Video-LoRA

Updated 15 days ago

updated a dataset 2 months ago

badayvedat/LJSpeech-1.1

Viewer • Updated Dec 5, 2024 • 13.1k • 54

updated a dataset 4 months ago

badayvedat/VCTK

Viewer • Updated Oct 12, 2024 • 88.3k • 43

liked 2 Spaces 7 months ago

AuraFlow-v0.3 with Captioner

🖼

Create images from descriptions or images

140

Gemma 2 llama.cpp 2B/9B/27B

😻

Chat with Gemma 2 for text-based conversations

updated 2 models 7 months ago

badayvedat/test-model-2

Updated Jul 3, 2024 • 9

badayvedat/test-model

Updated Jul 3, 2024

liked a model 7 months ago

AuraDiffusion/16ch-vae

Updated Jul 3, 2024 • 44 • 70

reacted to isidentical's post with 🔥 7 months ago

Post

1513

It is time for some Aura.

First in our series of fully open sourced / commercially available models by @fal-ai : AuraSR - a 600M parameter upscaler based on GigaGAN.

Blog: https://blog.fal.ai/introducing-aurasr-an-open-reproduction-of-the-gigagan-upscaler-2/

HF: https://huggingface.co/fal-ai/AuraSR

Code: https://github.com/fal-ai/aura-sr

Playground: https://fal.ai/models/fal-ai/aura-sr/playground

What other models would you like to see open-sourced and commercially available? :)

reacted to yushun0410's post with 🔥 7 months ago

Post

4626

Hi Huggingfacers!

Thrilled to introduce Adam-mini, an optimizer that achieves on-par or better performance than AdamW with 45% to 50% less memory footprint. Adam-mini can also achieve 49.5% higher throughput than AdamW on Llama2-7B pre-training.

The design of Adam-mini is inspired by certain Hessian structures we observed on Transformers.

Feel free to try it out! Try switching to Adam-mini with the same hyperparams of AdamW, it would work with only half memory. Hope Adam-mini can help save time, cost, and energy in your tasks!

Paper: "Adam-mini: Use Fewer Learning Rates To Gain More" https://arxiv.org/abs/2406.16793

Code: https://github.com/zyushun/Adam-mini

1 reply

reacted to Xenova's post with 🔥 7 months ago

Post

6053

Florence-2, the new vision foundation model by Microsoft, can now run 100% locally in your browser on WebGPU, thanks to Transformers.js! 🤗🤯

It supports tasks like image captioning, optical character recognition, object detection, and many more! 😍 WOW!
- Demo: Xenova/florence2-webgpu
- Models: https://huggingface.co/models?library=transformers.js&other=florence2
- Source code: https://github.com/xenova/transformers.js/tree/v3/examples/florence2-webgpu

liked a model 8 months ago

fal/AuraSR

Updated Jul 15, 2024 • 1.45k • 299

liked a Space 10 months ago

imgsys.org

📊

imgsys.org -- arena for text guided image generation

reacted to isidentical's post with ❤️ 10 months ago

Post

2110

Happy to announce https://imgsys.org -- a sister project to Chatbot Arena by lmsys -- for comparing different text guided image generation models models. Try it natively on HuggingFace: https://huggingface.co/spaces/fal-ai/imgsys

1 reply

reacted to wanghaofan's post with 🔥 10 months ago

Post

3495

Greeting!

We are happy to introduce our InstantStyle, which is a framework that employs straightforward yet potent techniques for achieving effective disentanglement of style and content from reference images.

Code: https://github.com/InstantStyle/InstantStyle
Paper: InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation (2404.02733)
Project Page: https://instantstyle.github.io/

reacted to Jaward's post with ❤️ 10 months ago

Post

2828

After giving GPU Programming a hands-on try, I have come to appreciate the level of complexity in AI compute:

- Existing/leading frameworks (CUDA, OpenCL, DSLs, even Triton), still fall at the mercy of low-level compute that requires deeper understanding and experience.
- Ambiguous optimizations methods that will literally drive you mad 🤯
- Triton is cool but not cool enough (high level abstractions that fall back to low level compute issues as you build more specialized kernels)
- As for CUDA, optimization requires considering all major components of the GPU (DRAM, SRAM, ALUs) 🤕
- Models today require stallion written GPU kernels to reduce storage and compute cost.
- GPTQ was a big save 👍🏼

@karpathy is right expertise in this area is scarce and the reason is quite obvious - uncertainties: we are still struggling to get peak performance from multi-connected GPUs while maintaining precision and reducing cost.

May the Scaling Laws favor us lol.

5 replies

New activity in ByteDance/AnimateDiff-Lightning 11 months ago

Reproducing ComfyUI examples

#6 opened 11 months ago by

badayvedat