Benjamin Therien

btherien

https://bentherien.github.io/

AI & ML interests

Passionate about machine learning research! Currently working on efficient foundation model pre-training.

Recent Activity

updated a dataset 17 days ago

btherien/png

published a dataset 17 days ago

btherien/png

updated a dataset 18 days ago

btherien/tfds_2

View all activity

Organizations

btherien's activity

updated a dataset 17 days ago

btherien/png

Updated 17 days ago • 39

published a dataset 17 days ago

btherien/png

Updated 17 days ago • 39

updated a dataset 18 days ago

btherien/tfds_2

Updated 18 days ago • 14

published a dataset 18 days ago

btherien/tfds_2

Updated 18 days ago • 14

updated a dataset 18 days ago

btherien/lo_tf_datset

Updated 18 days ago • 25

published a dataset 18 days ago

btherien/lo_tf_datset

Updated 18 days ago • 25

upvoted a collection 7 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 647

updated a collection 7 months ago

Continual Pre-training

Collection

Models from Simple and Scalable Strategies to Continually Pre-train Large Language Models • 10 items • Updated Jul 4, 2024

upvoted a paper 8 months ago

μLO: Compute-Efficient Meta-Generalization of Learned Optimizers

Paper • 2406.00153 • Published May 31, 2024 • 11

authored 2 papers 8 months ago

Continual Pre-Training of Large Language Models: How to (re)warm your model?

Paper • 2308.04014 • Published Aug 8, 2023 • 2

$μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

Paper • 2406.00153 • Published May 31, 2024 • 11

updated a collection 10 months ago

Continual Pre-training

Collection

Models from Simple and Scalable Strategies to Continually Pre-train Large Language Models • 10 items • Updated Jul 4, 2024

updated 2 models 10 months ago

cerc-aai/405m_pile-sp_replay1

Updated Apr 24, 2024

cerc-aai/405m_pile-sp_replay5

Updated Apr 24, 2024