SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper
โข
2502.02737
โข
Published
โข
61
2048x2048
. If your images are mostly larger than 1024x1024
, use BiRefNet_HR for better results! Thanks to
@Freepik
for the kind support of H200s for this huge training.1024x1024
on val set: