File size: 724 Bytes
5f609a0
 
 
 
 
 
 
 
 
 
 
597e3a5
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
title: Multimodal AI Taxonomy
emoji: 🌍
colorFrom: red
colorTo: red
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
pinned: false
---

# Multimodal AI Taxonomy

An attempt to define a structured taxonomy for multimodal generative AI capabilities, organized by output modality and operation type.

Dataset repository: https://huggingface.co/datasets/danielrosehill/multimodal-ai-taxonomy

This Space provides an interactive explorer for browsing and comparing different multimodal AI capabilities across:
- Video Generation
- Audio Generation
- Image Generation
- Text Generation
- 3D Generation

Each modality is categorized into Creation (generating new content) and Editing (modifying existing content) operations.