| title: Multimodal AI Taxonomy | |
| emoji: π | |
| colorFrom: red | |
| colorTo: red | |
| sdk: gradio | |
| sdk_version: 5.49.1 | |
| app_file: app.py | |
| pinned: false | |
| # Multimodal AI Taxonomy | |
| An attempt to define a structured taxonomy for multimodal generative AI capabilities, organized by output modality and operation type. | |
| Dataset repository: https://huggingface.co/datasets/danielrosehill/multimodal-ai-taxonomy | |
| This Space provides an interactive explorer for browsing and comparing different multimodal AI capabilities across: | |
| - Video Generation | |
| - Audio Generation | |
| - Image Generation | |
| - Text Generation | |
| - 3D Generation | |
| Each modality is categorized into Creation (generating new content) and Editing (modifying existing content) operations. | |