PRiSM: Benchmarking Phone Realization in Speech Models Paper • 2601.14046 • Published about 17 hours ago • 3
Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects Paper • 2601.07274 • Published 9 days ago • 1
PWESuite: Phonetic Word Embeddings and Tasks They Facilitate Paper • 2304.02541 • Published Apr 5, 2023 • 2
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks Paper • 2411.05361 • Published Nov 8, 2024 • 3
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model Paper • 2510.24992 • Published Oct 28, 2025 • 2
ChiKhaPo: A Large-Scale Multilingual Benchmark for Evaluating Lexical Comprehension and Generation in Large Language Models Paper • 2510.16928 • Published Oct 19, 2025 • 4
OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder Paper • 2507.14129 • Published Jul 18, 2025 • 9
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model Paper • 2510.24992 • Published Oct 28, 2025 • 2