Philipp Schmid's picture

Philipp Schmid

philschmid

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 hours ago
philschmid/pdf-samples
published a dataset about 17 hours ago
philschmid/pdf-samples
View all activity

Organizations

Google's profile picture Amazon SageMaker Community's profile picture GermanT5's profile picture Libre Euro Lingua-Alliance's profile picture Language Tools's profile picture Hugging Face H4 Community's profile picture Phind's profile picture gg-hf's profile picture Zeitgeist's profile picture Social Post Explorers's profile picture hsramall's profile picture gg-tt's profile picture LLHF's profile picture blhf's profile picture

Posts 2

view post
Post
7452
New state-of-the-art open LLM! 🚀 Databricks just released DBRX, a 132B MoE trained on 12T tokens. Claiming to surpass OpenAI GPT-3.5 and is competitive with Google Gemini 1.0 Pro. 🤯

TL;DR
🧮 132B MoE with 16 experts with 4 active in generation
🪟 32 000 context window
📈 Outperforms open LLMs on common benchmarks, including MMLU
🚀 Up to 2x faster inference than Llama 2 70B
💻 Trained on 12T tokens
🔡 Uses the GPT-4 tokenizer
📜 Custom License, commercially useable

Collection: databricks/dbrx-6601c0852a0cdd3c59f71962
Demo: https://huggingface.co/spaces/databricks/dbrx-instruct

Kudos to the Team at Databricks and MosaicML for this strong release in the open community! 🤗

Articles 48

Article
28

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial