view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference 22 days ago • 63
Qwen2-VL Collection Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 202
view article Article 🤗 Serve any model with Inference Endpoints + Custom Handlers By alvarobartt • Nov 22, 2024 • 3
Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations Paper • 2411.00640 • Published Nov 1, 2024 • 3
JudgeBench: A Benchmark for Evaluating LLM-based Judges Paper • 2410.12784 • Published Oct 16, 2024 • 44
view article Article Releasing Outlines-core 0.1.0: structured generation in Rust and Python Oct 22, 2024 • 44