fireblade2534

AI & ML interests

None yet

Recent Activity

reacted to lin-tan's post with 🔥 about 24 hours ago

🚀 Excited to share that our paper, "SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models", has been accepted to #ICRA2025! 🔗 Preprint: https://arxiv.org/pdf/2409.19471 We introduce SELP (Safe Efficient LLM Planner), a novel approach for generating plans that adhere to user-specified constraints while optimizing for time-efficient execution. By leveraging linear temporal logic (LTL) to interpret natural language commands, SELP effectively handles complex commands and long-horizon tasks. 🤖 💡SELP presents three key insights: 1️⃣ Equivalence Voting: Ensures robust translations from natural language instructions into LTL specifications. 2️⃣ Constrained Decoding: Uses the generated LTL formula to guide the autoregressive inference of plans, ensuring the generated plans conform to the LTL. 3️⃣ Domain-Specific Fine-Tuning: Customizes LLMs for specific robotic tasks, boosting both safety and efficiency. 📊 Experiment: Our experiments demonstrate SELP’s effectiveness and generalizability across diverse tasks. In drone navigation, SELP outperforms state-of-the-art LLM planners by 10.8% in safety rate and by 19.8% in plan efficiency. For robot manipulation, SELP achieves a 20.4% improvement in safety rate. @yiwu @jiang719 #ICRA2025 #LLM #Robotics #Agent #LLMPlanner

liked a model about 24 hours ago

xwen-team/Xwen-7B-Chat

upvoted an article 1 day ago

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

View all activity

Organizations

None yet

fireblade2534's activity

reacted to lin-tan's post with 🔥 about 24 hours ago

Post

1582

🚀 Excited to share that our paper, "SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models", has been accepted to #ICRA2025! 🔗 Preprint: https://arxiv.org/pdf/2409.19471

We introduce SELP (Safe Efficient LLM Planner), a novel approach for generating plans that adhere to user-specified constraints while optimizing for time-efficient execution. By leveraging linear temporal logic (LTL) to interpret natural language commands, SELP effectively handles complex commands and long-horizon tasks. 🤖

💡SELP presents three key insights:
1️⃣ Equivalence Voting: Ensures robust translations from natural language instructions into LTL specifications.
2️⃣ Constrained Decoding: Uses the generated LTL formula to guide the autoregressive inference of plans, ensuring the generated plans conform to the LTL.
3️⃣ Domain-Specific Fine-Tuning: Customizes LLMs for specific robotic tasks, boosting both safety and efficiency.

📊 Experiment: Our experiments demonstrate SELP’s effectiveness and generalizability across diverse tasks. In drone navigation, SELP outperforms state-of-the-art LLM planners by 10.8% in safety rate and by 19.8% in plan efficiency. For robot manipulation, SELP achieves a 20.4% improvement in safety rate.

@yiwu @jiang719

#ICRA2025 #LLM #Robotics #Agent #LLMPlanner

liked a model about 24 hours ago

xwen-team/Xwen-7B-Chat

Text Generation • Updated 2 days ago • 409 • 18

upvoted 3 articles 1 day ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

15 days ago

• 119

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 525

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 662

liked a Space 2 days ago

Joy Caption Alpha Two Vqa Test One

🚀

Ask questions about images and get detailed answers

liked a Space 7 days ago

Paper Impact

🐢

AI-Powered Research Impact Predictor

liked a model 7 days ago

open-thoughts/OpenThinker-7B

Text Generation • Updated 18 minutes ago • 2.06k • 50

liked a dataset 7 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 19 minutes ago • 114k • 27.9k • 303

liked a model 7 days ago

ozone-ai/0x-lite

Text Generation • Updated 10 days ago • 552 • 53

reacted to hexgrad's post with ❤️🔥 8 days ago

Post

8214

hexgrad/Kokoro-82M got an upgrade! ⬆️ More voices, more languages, pip install kokoro, and still 82M parameters.

GitHub: https://github.com/hexgrad/kokoro
PyPI: https://pypi.org/project/kokoro/
Space: hexgrad/Kokoro-TTS

11 replies

liked a Space 8 days ago

Shuttle Jaguar

🖼

upvoted an article 8 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

10 days ago

• 649

liked a model 9 days ago

adlb/Audialab_EDM_Elements

Updated Dec 5, 2024 • 44

liked 3 Spaces 10 days ago

236

FitDiT

🦀

FitDiT is a high-fidelity virtual try-on model.

220

Llama 3.2 Reasoning WebGPU

🧠

Small and powerful reasoning LLM that runs in your browser

131

ViTPose Transformers

⚡

Detect and annotate poses in images and videos

reacted to hexgrad's post with ❤️ 10 days ago

Post

3895

IMHO, being able & willing to defeat CAPTCHA, hCaptcha, or any other reasoning puzzle is a must-have for any Web-Browsing / Computer-Using Agent (WB/CUA).

I realize it subverts the purpose of CAPTCHA, but I do not think you can claim to be building AGI/agents without smoothly passing humanity checks. It would be like getting in a self-driving car that requires human intervention over speed bumps. Claiming AGI or even "somewhat powerful AI" seems hollow if you are halted by a mere CAPTCHA.

I imagine OpenAI's Operator is *able* but *not willing* to defeat CAPTCHA. Like their non-profit status, I expect that policy to evolve over time—and if not, rival agent-builders will attack that opening to offer a better product.

2 replies

liked a Space 10 days ago

DeepSeekR1 Search

🚀

Generate detailed answers to queries with web search and voice response