Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 15 days ago • 7
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 9 days ago • 32
Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation Paper • 2501.17433 • Published 8 days ago • 8
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 8 days ago • 50
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published 7 days ago • 49
Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Paper • 2501.18119 • Published 8 days ago • 22
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 6 days ago • 32
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 11 days ago • 322
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Paper • 2501.10132 • Published 20 days ago • 17
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 16 days ago • 22
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 16 days ago • 48