Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 15 days ago • 7
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 9 days ago • 32
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper • 2501.17433 • Published 8 days ago • 8
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 8 days ago • 50
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 7 days ago • 49
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 8 days ago • 22
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 6 days ago • 32