Haihao Shen
Haihao
·
AI & ML interests
LLM quantization, sparsity, and acceleration
Recent Activity
Organizations
Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon
Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding