Tom Aarsen
tomaarsen
AI & ML interests
NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification
Recent Activity
liked
a Space
about 6 hours ago
argilla/synthetic-data-generator
upvoted
an
article
1 day ago
Open-source DeepResearch – Freeing our search agents
liked
a model
1 day ago
dragonkue/BGE-m3-ko
Organizations
tomaarsen's activity
Entering on MTEB
3
#12 opened 7 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Clarification regarding dimensions for gtr-t5-large embedding model
5
#3 opened 3 days ago
by
ksridhar-123
nan or 0.0 loss when training with flash attention
16
#59 opened 6 days ago
by
roadtoagi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/677a6a5ab06a2c07ece49e9d/JUYG31uT4i0SuYrbK2k7y.jpeg)
Unable to load sentence transformer ( was previously working)
1
#98 opened 6 days ago
by
avifin19
Clean up README slightly
1
#7 opened 13 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
NaN values when input is longer than context window?
3
#11 opened 7 days ago
by
AHuguet
Add Sentence Transformers integration
5
#7 opened 17 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Librarian Bot: Add language metadata for dataset
#2 opened 9 days ago
by
librarian-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg)
Import fails on AWS lamba instance.
4
#55 opened 15 days ago
by
obeijbom
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6070c710227ff331937110ea/36xEaxRRjzXKQHDwiEF42.jpeg)
ModernBERT fails to work without FlashAttention !
3
#56 opened 13 days ago
by
benhachem
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1672318259412-noauth.jpeg)
How to load ONNX version with CrossEncoder class?
1
#7 opened 14 days ago
by
hveigz
Update `base_model_relation` to `finetune`
#11 opened 13 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Update `base_model_relation` to `finetune`
#2 opened 13 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Update `base_model_relation` to `finetune`
#8 opened 13 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Update `base_model_relation` to `finetune`
#10 opened 13 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
max_seq_length seems not to be properly reported in sentence_bert_config.json
1
#35 opened 13 days ago
by
yjoonjang
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a4c4ed2548c41ad9b1421c/bMQbowjHKvq-bKpzalvWm.jpeg)
Convert git-lfs md, py, json files to normal git files
1
#8 opened 15 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
patch inference on CPU & Windows + Update README snippets
#2 opened 15 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Complete Sentence Transformers integration + patch inference on CPU & Windows
1
#4 opened 15 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)