Adina Yakefu's picture

Adina Yakefu

AdinaY

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

reacted to lin-tan's post with 🔥 about 22 hours ago

🚀 Excited to share that our paper, "SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models", has been accepted to #ICRA2025! 🔗 Preprint: https://arxiv.org/pdf/2409.19471 We introduce SELP (Safe Efficient LLM Planner), a novel approach for generating plans that adhere to user-specified constraints while optimizing for time-efficient execution. By leveraging linear temporal logic (LTL) to interpret natural language commands, SELP effectively handles complex commands and long-horizon tasks. 🤖 💡SELP presents three key insights: 1️⃣ Equivalence Voting: Ensures robust translations from natural language instructions into LTL specifications. 2️⃣ Constrained Decoding: Uses the generated LTL formula to guide the autoregressive inference of plans, ensuring the generated plans conform to the LTL. 3️⃣ Domain-Specific Fine-Tuning: Customizes LLMs for specific robotic tasks, boosting both safety and efficiency. 📊 Experiment: Our experiments demonstrate SELP’s effectiveness and generalizability across diverse tasks. In drone navigation, SELP outperforms state-of-the-art LLM planners by 10.8% in safety rate and by 19.8% in plan efficiency. For robot manipulation, SELP achieves a 20.4% improvement in safety rate. @yiwu @jiang719 #ICRA2025 #LLM #Robotics #Agent #LLMPlanner

reacted to their post with 🔥 about 22 hours ago

Xwen 🔥 a series of open models based on Qwen2.5 models, developed by a brilliant research team of PhD students from the Chinese community. https://huggingface.co/collections/shenzhi-wang/xwen-chat-679e30ab1f4b90cfa7dbc49e ✨ 7B/72B ✨ Apache 2.0 ✨ Xwen-72B-Chat outperformed DeepSeek V3 on Arena Hard Auto

View all activity

Organizations

AdinaY's activity

upvoted a paper about 2 hours ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 2 days ago • 61

upvoted a collection about 22 hours ago

Xwen-Chat

6 items • Updated 3 days ago • 7

upvoted an article 1 day ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 636

upvoted a collection 1 day ago

DeepSeek-VL2

5 items • Updated 1 day ago • 52

upvoted a paper 1 day ago

ACECODER: Acing Coder RL via Automated Test-Case Synthesis

Paper • 2502.01718 • Published 3 days ago • 22

upvoted 2 papers 2 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 3 days ago • 149

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 3 days ago • 53

upvoted an article 3 days ago

Article

Open-R1: Update #1

By

and 7 others •

5 days ago

• 237

upvoted an article 5 days ago

Article

The AI tools for Art Newsletter - Issue 1

7 days ago

• 44

upvoted a collection 7 days ago

TinySwallow

Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 8 days ago • 12

upvoted a paper 7 days ago

TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Paper • 2501.16937 • Published 9 days ago • 4

upvoted a collection 8 days ago

2025 January Papers 🧐

10 items • Updated 9 days ago • 4

upvoted an article 8 days ago

Article

Welcome to Inference Providers on the Hub 🔥

10 days ago

• 259

upvoted 3 papers 9 days ago

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 12 days ago • 53

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 12 days ago • 54

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published 11 days ago • 23

upvoted an article 9 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 647

upvoted 2 collections 10 days ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 10 days ago • 322

2025 January

33 items • Updated 8 days ago • 12

upvoted a collection 11 days ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 11 days ago • 97