fireblade2534

fireblade2534

AI & ML interests

None yet

Recent Activity

liked a model about 24 hours ago
xwen-team/Xwen-7B-Chat
View all activity

Organizations

None yet

fireblade2534's activity

reacted to lin-tan's post with πŸ”₯ about 24 hours ago
view post
Post
1582
πŸš€ Excited to share that our paper, "SELP: Generating Safe and Efficient Task Plans for Robot Agents with Large Language Models", has been accepted to #ICRA2025! πŸ”— Preprint: https://arxiv.org/pdf/2409.19471

We introduce SELP (Safe Efficient LLM Planner), a novel approach for generating plans that adhere to user-specified constraints while optimizing for time-efficient execution. By leveraging linear temporal logic (LTL) to interpret natural language commands, SELP effectively handles complex commands and long-horizon tasks. πŸ€–

πŸ’‘SELP presents three key insights:
1️⃣ Equivalence Voting: Ensures robust translations from natural language instructions into LTL specifications.
2️⃣ Constrained Decoding: Uses the generated LTL formula to guide the autoregressive inference of plans, ensuring the generated plans conform to the LTL.
3️⃣ Domain-Specific Fine-Tuning: Customizes LLMs for specific robotic tasks, boosting both safety and efficiency.

πŸ“Š Experiment: Our experiments demonstrate SELP’s effectiveness and generalizability across diverse tasks. In drone navigation, SELP outperforms state-of-the-art LLM planners by 10.8% in safety rate and by 19.8% in plan efficiency. For robot manipulation, SELP achieves a 20.4% improvement in safety rate.

@yiwu @jiang719

#ICRA2025 #LLM #Robotics #Agent #LLMPlanner
upvoted 3 articles 1 day ago
view article
Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

β€’ 119
view article
Article

Finally, a Replacement for BERT: Introducing ModernBERT

β€’ 525
view article
Article

Open-source DeepResearch – Freeing our search agents

β€’ 662
reacted to hexgrad's post with ❀️πŸ”₯ 8 days ago
upvoted an article 8 days ago
view article
Article

Open-R1: a fully open reproduction of DeepSeek-R1

β€’ 649
reacted to hexgrad's post with ❀️ 10 days ago
view post
Post
3895
IMHO, being able & willing to defeat CAPTCHA, hCaptcha, or any other reasoning puzzle is a must-have for any Web-Browsing / Computer-Using Agent (WB/CUA).

I realize it subverts the purpose of CAPTCHA, but I do not think you can claim to be building AGI/agents without smoothly passing humanity checks. It would be like getting in a self-driving car that requires human intervention over speed bumps. Claiming AGI or even "somewhat powerful AI" seems hollow if you are halted by a mere CAPTCHA.

I imagine OpenAI's Operator is *able* but *not willing* to defeat CAPTCHA. Like their non-profit status, I expect that policy to evolve over timeβ€”and if not, rival agent-builders will attack that opening to offer a better product.
  • 2 replies
Β·