Jaward Sesay

Jaward

AI & ML interests

I like to train large deep neural nets too 🧠🤖💥 | First Paper (AutoAgents: A Framework for Automatic Agent Generation) Accepted @ IJCAI 2024 | Role Model Karpathy

Recent Activity

posted an update 2 days ago

ByteDance drops OmniHuman🔥 This is peak SOTA performance - flawless natural gestures with perfect lip sync and facial expressions. This is the second time they've released SOTA level talking-heads only this time with hands and body motion. Project: https://omnihuman-lab.github.io/

upvoted a paper 2 days ago

Process Reinforcement through Implicit Rewards

upvoted a paper 2 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

View all activity

Organizations

Jaward's activity

upvoted 2 papers 2 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 3 days ago • 53

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 3 days ago • 149

upvoted an article 6 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 646

upvoted a paper 17 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 21 days ago • 105

upvoted a collection about 1 month ago

Cosmos

Collection

The collection of Cosmos models • 31 items • Updated 20 days ago • 254

upvoted 4 papers 3 months ago

Multimodal Autoregressive Pre-training of Large Vision Encoders

Paper • 2411.14402 • Published Nov 21, 2024 • 43

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 32

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 83

upvoted 4 papers 4 months ago

upvoted a collection 4 months ago

Emu3

Collection

Emu3: Next-Token Prediction is All You Need • 7 items • Updated 24 days ago • 68

upvoted a paper 4 months ago

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published Sep 26, 2024 • 34

upvoted 3 papers 5 months ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140

Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 93

Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 123

upvoted a paper 6 months ago

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 80

upvoted a paper 7 months ago

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Paper • 2407.14057 • Published Jul 19, 2024 • 45