Ali El Filali's picture

Ali El Filali

alielfilali01

·

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Other interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

updated a dataset about 14 hours ago

OALL/requests

updated a Space 1 day ago

inceptionai/X-Risks-Leaderboard

upvoted an article 1 day ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

Posts 29

Post

1923

3C3H AraGen Leaderboard welcomes today deepseek-ai/DeepSeek-V3 and 12 other models (including the late gpt-3.5 💀) to the ranking of best LLMs in Arabic !

Observations:
- DeepSeek-v3 ranked 3rd and only Open model among the top 5 !

- A 14B open model ( Qwen/Qwen2.5-14B-Instruct) outperforms gpt-3.5-turbo-0125 (from last year). This shows how much we came in advancing and supporting Arabic presence within the LLM ecosystem !

- Contrary to what observed in likelihood-acc leaderboards (like OALL/Open-Arabic-LLM-Leaderboard) further finetuned models like maldv/Qwentile2.5-32B-Instruct actually decreased the performance compared to the original model Qwen/Qwen2.5-32B-Instruct.
It's worth to note that the decrease is statiscally insignificant which imply that at best, the out-domain finetuning do not really hurts the model original capabilities acquired during pretraining.
Previous work addressed this (finetuning VS pretraining) but more investigation in this regard is required (any PhDs here ? This could be your question ...)

Check out the latest rankings: inceptionai/AraGen-Leaderboard

Articles 2

Article

31

Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard

View all Articles

Collections 4

Papers 1

arxiv:2404.00565

spaces 15

Jupyter Lab

LLM Training Cost Calculator

aya-101

jais-13b-chat

SambaLingo-Arabic-Chat

AceGPT-7B-chat

models 44

alielfilali01/Toubkla-fineweb2-arb_Arab-adapter-small-test

Updated Dec 26, 2024

alielfilali01/dallah-llama

Visual Question Answering • Updated Dec 19, 2024 • 32

alielfilali01/AceGPT-v2-32B-Chat

Updated Dec 19, 2024 • 8

alielfilali01/PG7BB

Text Generation • Updated Jun 24, 2024 • 2.91k

alielfilali01/Q2AW1M-1001

Text Generation • Updated Jun 21, 2024 • 2.91k

alielfilali01/Q2AW1M-1111

Text Generation • Updated Jun 21, 2024 • 2.91k

alielfilali01/Q2AW1M-0000

Text Generation • Updated Jun 21, 2024 • 2.91k

alielfilali01/Q2AW1M-1000

Text Generation • Updated Jun 21, 2024 • 2.91k

alielfilali01/Q2AW1M-0100

Text Generation • Updated Jun 21, 2024 • 2.91k

alielfilali01/Q2AW1M-1100

Text Generation • Updated Jun 21, 2024 • 2.9k

datasets 31

alielfilali01/AE8B-AraTrust

Viewer • Updated 8 days ago • 521 • 13 • 1

alielfilali01/R7B-AraTrust

Viewer • Updated 8 days ago • 464 • 34

alielfilali01/fineweb-2-arb_Arab-text-only-2

Viewer • Updated Dec 26, 2024 • 57.8M • 47

alielfilali01/fineweb-2-arb_Arab-text-only

Viewer • Updated Dec 26, 2024 • 57.8M • 208 • 1

alielfilali01/fineweb-2-arb_Arab

Viewer • Updated Dec 26, 2024 • 57.8M • 182

alielfilali01/Bactrian-X-ar-SFT

Viewer • Updated Jun 24, 2024 • 67k • 44

alielfilali01/wikipedia-20231101.ar-100k

Viewer • Updated May 20, 2024 • 100k • 41

alielfilali01/MA-Culture-Vision-v0.2

Viewer • Updated May 18, 2024 • 93 • 38 • 1

alielfilali01/MA-Culture-Vision-v0.1

Viewer • Updated May 18, 2024 • 120 • 35 • 4

alielfilali01/ary-wikipedia-20231101-MT-PC

Viewer • Updated May 5, 2024 • 8k • 36