Token-level and sequence-level loss smoothing for RNN language models Paper • 1805.05062 • Published May 14, 2018
Efficient Wait-k Models for Simultaneous Machine Translation Paper • 2005.08595 • Published May 18, 2020
Added Toxicity Mitigation at Inference Time for Multimodal and Massively Multilingual Translation Paper • 2311.06532 • Published Nov 11, 2023
Large Concept Models: Language Modeling in a Sentence Representation Space Paper • 2412.08821 • Published Dec 11, 2024 • 14
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision Tasks Paper • 2310.19909 • Published Oct 30, 2023 • 21
Does Progress On Object Recognition Benchmarks Improve Real-World Generalization? Paper • 2307.13136 • Published Jul 24, 2023 • 1
PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning Paper • 2308.03977 • Published Aug 8, 2023
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others Paper • 2212.04825 • Published Dec 9, 2022
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling Paper • 2408.04810 • Published Aug 9, 2024 • 23
DynamicStereo: Consistent Dynamic Depth from Stereo Videos Paper • 2305.02296 • Published May 3, 2023
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos Paper • 2410.11831 • Published Oct 15, 2024 • 9