LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training Paper • 2509.23661 • Published Sep 28, 2025 • 49
Sidon: Fast and Robust Open-Source Multilingual Speech Restoration for Large-scale Dataset Cleansing Paper • 2509.17052 • Published Sep 21, 2025 • 3
OmniRetarget: Interaction-Preserving Data Generation for Humanoid Whole-Body Loco-Manipulation and Scene Interaction Paper • 2509.26633 • Published Sep 30, 2025 • 7
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets Paper • 2505.15517 • Published May 21, 2025 • 5
TruthfulQA: Measuring How Models Mimic Human Falsehoods Paper • 2109.07958 • Published Sep 8, 2021 • 2
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus Paper • 2010.14571 • Published Oct 27, 2020 • 2
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26, 2025 • 94
SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks Paper • 2506.10954 • Published Jun 12, 2025 • 53
FunCineForge: A Unified Dataset Toolkit and Model for Zero-Shot Movie Dubbing in Diverse Cinematic Scenes Paper • 2601.14777 • Published Jan 21 • 2
MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages Paper • 2204.08582 • Published Apr 18, 2022 • 2
SQuAD: 100,000+ Questions for Machine Comprehension of Text Paper • 1606.05250 • Published Jun 16, 2016 • 4
Know What You Don't Know: Unanswerable Questions for SQuAD Paper • 1806.03822 • Published Jun 11, 2018 • 1