Stefan Josef

stefan-jo

stefan-jo

AI & ML interests

language modeling, text classification, translation, summarization

Recent Activity

upvoted an article about 21 hours ago

Open-source DeepResearch – Freeing our search agents

upvoted an article 3 days ago

Open-R1: Update #1

reacted to psinger's post with 👍 about 1 year ago

Happy to share H2O-Danube-1.8b, a small 1.8b model based trained on only 1T natural language tokens showing competitive metrics across benchmarks in the <2B model space. Base weights: https://huggingface.co/h2oai/h2o-danube-1.8b-base Chat weights: https://huggingface.co/h2oai/h2o-danube-1.8b-chat Technical report: https://huggingface.co/papers/2401.16818

View all activity

Organizations

stefan-jo's activity

upvoted an article about 21 hours ago

Article

Open-source DeepResearch – Freeing our search agents

3 days ago

• 648

upvoted an article 3 days ago

Article

Open-R1: Update #1

and 7 others •

5 days ago

• 237

reacted to psinger's post with 👍 about 1 year ago

Post

Happy to share H2O-Danube-1.8b, a small 1.8b model based trained on only 1T natural language tokens showing competitive metrics across benchmarks in the <2B model space.

Base weights: h2oai/h2o-danube-1.8b-base
Chat weights: h2oai/h2o-danube-1.8b-chat
Technical report: H2O-Danube-1.8B Technical Report (2401.16818)

reacted to clem's post with 👍 about 1 year ago

Post

Re-posting @karpathy 's blogpost here because it's down on https://karpathy.github.io/2024/01/21/selfdriving-agi. What do you all think?

4 replies

reacted to philschmid's post with ❤️ about 1 year ago

Post

What's the best way to fine-tune open LLMs in 2024? Look no further! 👀 I am excited to share “How to Fine-Tune LLMs in 2024 with Hugging Face” using the latest research techniques, including Flash Attention, Q-LoRA, OpenAI dataset formats (messages), ChatML, Packing, all built with Hugging Face TRL. 🚀

It is created for consumer-size GPUs (24GB) covering the full end-to-end lifecycle with:
💡Define and understand use cases for fine-tuning
🧑🏻‍💻 Setup of the development environment
🧮 Create and prepare dataset (OpenAI format)
🏋️‍♀️ Fine-tune LLM using TRL and the SFTTrainer
🥇 Test and evaluate the LLM
🚀 Deploy for production with TGI

👉 https://www.philschmid.de/fine-tune-llms-in-2024-with-trl

Coming soon: Advanced Guides for multi-GPU/multi-Node full fine-tuning and alignment using DPO & KTO. 🔜

4 replies

updated a collection over 1 year ago

reading-list

Collection

1 item • Updated Nov 2, 2023

updated a model about 3 years ago

stefan-jo/bert-finetuned-ner

Token Classification • Updated Jan 2, 2022 • 5