Aadesh

neo-9981

AI & ML interests

NLP

Recent Activity

liked a model 12 days ago

mistralai/Mistral-Small-24B-Instruct-2501

liked a model 15 days ago

NovaSky-AI/Sky-T1-32B-Preview

liked a model 16 days ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

None yet

neo-9981's activity

upvoted an article 27 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

28 days ago

• 142

upvoted a paper about 2 months ago

Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces

Paper • 2412.14171 • Published Dec 18, 2024 • 24

upvoted a collection 4 months ago

Llama-3.1-Nemotron-70B

Collection

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 26 days ago • 151

upvoted an article 4 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

and 1 other •

Oct 14, 2024

• 69

upvoted an article 5 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 199

upvoted a collection 5 months ago

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18, 2024 • 227

upvoted a paper 6 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 157

upvoted an article 7 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 112

upvoted a paper 8 months ago

Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks

Paper • 2311.06242 • Published Nov 10, 2023 • 90

upvoted a collection 8 months ago

Florence

Collection

9 items • Updated Jan 8 • 163

upvoted 3 articles 9 months ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 151

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 128

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1, 2024

• 72

upvoted a paper 9 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

upvoted an article 10 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 283

upvoted a paper 11 months ago

Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding

Paper • 2401.04398 • Published Jan 9, 2024 • 23

upvoted a paper about 1 year ago

ChatQA: Building GPT-4 Level Conversational QA Models

Paper • 2401.10225 • Published Jan 18, 2024 • 35