A newer version of the Gradio SDK is available:
6.1.0
metadata
title: Multimodal AI Taxonomy
emoji: π
colorFrom: red
colorTo: red
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
Multimodal AI Taxonomy
An attempt to define a structured taxonomy for multimodal generative AI capabilities, organized by output modality and operation type.
Dataset repository: https://huggingface.co/datasets/danielrosehill/multimodal-ai-taxonomy
This Space provides an interactive explorer for browsing and comparing different multimodal AI capabilities across:
- Video Generation
- Audio Generation
- Image Generation
- Text Generation
- 3D Generation
Each modality is categorized into Creation (generating new content) and Editing (modifying existing content) operations.