Snowflake/snowflake-arctic-embed-l-v2.0 Sentence Similarity β’ Updated Dec 14, 2024 β’ 89.3k β’ 112
view post Post 3221 πΈπ° Hovorte po slovensky? Help build better AI for Slovak! We only need 90 more annotations to include Slovak in the next Hugging Face FineWeb2-C dataset ( data-is-better-together/fineweb-c) release! Your contribution will help create better language models for 5+ million Slovak speakers.Annotate here: data-is-better-together/fineweb-c.Read more about why we're doing it: https://huggingface.co/blog/davanstrien/fineweb2-community See translation 3 replies Β· β€οΈ 10 10 π€ 1 1 π 1 1 π 1 1 + Reply
view article Article FineWeb2-C: Help Build Better Language Models in Your Language By davanstrien and 5 others β’ Dec 23, 2024 β’ 18
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated Jan 8 β’ 80
Running on CPU Upgrade 34 34 FineWeb-c - Annotation π Launch Argilla for data labeling and annotation
EU20-Benchmarks Collection Evaluation Benchmarks for 20 European languages. β’ 5 items β’ Updated Oct 11, 2024 β’ 7