-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 107 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 41 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 54 -
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Paper • 2403.18814 • Published • 47
Phuong Pham
mp1704
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 6 hours ago
cognitivecomputations/dolphin-r1
liked
a dataset
1 day ago
HumanLLMs/Human-Like-DPO-Dataset
liked
a model
9 days ago
qnguyen3/r1-res-stream
Organizations
Collections
1
models
15
mp1704/tora_7b_sft_ckpt_200
Text Generation
•
Updated
•
6
mp1704/tora_7b_pt
Text Generation
•
Updated
•
8
mp1704/gpt-neo-sft-v2.1
Text Generation
•
Updated
•
107
mp1704/gpt-neo-sft-v2
Text Generation
•
Updated
•
108
mp1704/gpt-neo-sft
Text Generation
•
Updated
•
108
mp1704/gpt-neo-pt
Text Generation
•
Updated
•
107
mp1704/gemma_2b_sft
Text Generation
•
Updated
•
4
mp1704/gemma_2b_pt
Text Generation
•
Updated
•
10
mp1704/qwen_1.8b_sft_full_3
Text Generation
•
Updated
•
126
mp1704/qwen_1.8b_sft_full_2
Feature Extraction
•
Updated
•
104