File size: 724 Bytes
5f609a0 597e3a5 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 |
---
title: Multimodal AI Taxonomy
emoji: π
colorFrom: red
colorTo: red
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
---
# Multimodal AI Taxonomy
An attempt to define a structured taxonomy for multimodal generative AI capabilities, organized by output modality and operation type.
Dataset repository: https://huggingface.co/datasets/danielrosehill/multimodal-ai-taxonomy
This Space provides an interactive explorer for browsing and comparing different multimodal AI capabilities across:
- Video Generation
- Audio Generation
- Image Generation
- Text Generation
- 3D Generation
Each modality is categorized into Creation (generating new content) and Editing (modifying existing content) operations.
|