SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 2 days ago • 65
Contrastive Sparse Autoencoders for Interpreting Planning of Chess-Playing Agents Paper • 2406.04028 • Published Jun 6, 2024 • 1
Extending the Massive Text Embedding Benchmark to French Paper • 2405.20468 • Published May 30, 2024 • 2