AudioX is a unified framework for multimodal-conditioned audio and music generation with superior instruction-following capabilities.