Submitted by akhaliq 31 Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks · 6 authors 96 2
Submitted by akhaliq 18 CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases · 8 authors 4.07k 2
Submitted by davanstrien 18 WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models · 11 authors 40 3
Submitted by akhaliq 14 Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields · 5 authors 74 3
Submitted by akhaliq 13 Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling · 12 authors 17 2
Submitted by akhaliq 10 Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond · 5 authors 24 2
Submitted by akhaliq 10 RayGauss: Volumetric Gaussian-Based Ray Casting for Photorealistic Novel View Synthesis · 3 authors 2
Submitted by akhaliq 8 Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation · 3 authors 36 2