Julien Abadji
uj
AI & ML interests
oscar :)
Recent Activity
liked
a model
about 7 hours ago
nomic-ai/modernbert-embed-base
liked
a dataset
2 months ago
oscar-corpus/community-oscar
new activity
7 months ago
HuggingFaceTB/SmolLM-360M-Instruct:fix typo
Organizations
uj's activity
fix typo
1
#2 opened 7 months ago
by
uj
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1632326156496-614b51fbf2a3f96861e548ae.jpeg)
About the number of documents
6
#6 opened over 1 year ago
by
lixin4ever
Add info about virus warnings in README.md
#13 opened over 1 year ago
by
uj
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1632326156496-614b51fbf2a3f96861e548ae.jpeg)
Unsafe Files
20
#12 opened almost 2 years ago
by
GetzPro
![](https://cdn-avatars.huggingface.co/v1/production/uploads/644331389788699939b42206/5KMdZuyUoscJ1iOUmEGJf.png)
Deduplicated English Corpus
2
#3 opened almost 2 years ago
by
conceptofmind
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6276ba3c2d26ac639e5a2b01/k7LHkSbNjPR31ma4EereF.png)
The data size of Chinses is only 385GB
2
#4 opened almost 2 years ago
by
zxs1997zju
Data hosting on Huggingface
1
#2 opened almost 2 years ago
by
hieuhocnlp
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1669750003541-noauth.png)
How to download only one language?
2
#1 opened almost 2 years ago
by
musabg
how to use it
1
#2 opened over 2 years ago
by
graybyte
Fix typo in dataset card
#9 opened about 2 years ago
by
albertvillanova
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1606406298765-noauth.jpeg)
Issue : Dataset "doesn't exist on the Hub"
2
#1 opened over 2 years ago
by
RomanCast
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1662972114904-61efc5853bc9016395076dd7.png)
mwparserfromhell: KeyError: "000nbsp [while running 'train/Clean content']" while cleaning Arabic data from 20/09/2022
1
#4 opened over 2 years ago
by
uj
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1632326156496-614b51fbf2a3f96861e548ae.jpeg)
Progression feedback on Beam related processing?
4
#1 opened over 2 years ago
by
uj
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1632326156496-614b51fbf2a3f96861e548ae.jpeg)
Using the Corpus
3
#1 opened over 2 years ago
by
vitvit