view article Article Accelerating LLM Code Generation Through Mask Store Streamlining By vivien โข 20 days ago โข 1
view article Article Fast, High-Fidelity LLM Decoding with Regex Constraints By vivien โข Feb 23, 2024 โข 6
view article Article An Optimal Lossy Variant of Speculative Decoding By vivien โข Jun 12, 2024 โข 2