James PRO

jtatman

AI & ML interests

improving domain specific models and re-sampling data, refining datasets for use in different modalities, small scale micro-llm clusters using quantized and smoothed models, and all emerging llm stack connecting technologies. Small models rock.

Recent Activity

liked a dataset 2 days ago
Locutusque/hercules-v6.9
liked a model 3 days ago
deepseek-ai/Janus-Pro-1B
liked a model 19 days ago
nvidia/AceInstruct-1.5B
View all activity

Organizations

ZeroGPU Explorers's profile picture The Hydra Project's profile picture Tatman ML Technologies's profile picture M4-ai's profile picture

jtatman's activity

upvoted an article 7 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

226
upvoted an article 8 months ago
view article
Article

Welcome Gemma - Google's new open LLM

21
upvoted an article 9 months ago
view article
Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

33