Improve model card: Add pipeline tag, links, abstract, features, and usage instructions

by nielsr HF Staff - opened Oct 5

←

nielsr

Oct 5

This PR significantly enhances the model card for Ovi, an 8-bit quantized audio-video generation model.

Key improvements include:

Adding the pipeline_tag: any-to-any to accurately reflect its multimodal generative capabilities, making it more discoverable on the Hugging Face Hub.
Expanding the model description by including the full paper abstract and "Key Features" from the GitHub README.
Providing direct links to the paper (Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation), the project page (https://aaxwaz.github.io/Ovi), the GitHub repository (https://github.com/character-ai/Ovi), and a Hugging Face Space demo.
Including a detailed "Quick Start" section with installation steps, weight download instructions, configuration options, prompt formatting, and command-line usage examples for single GPU, multi-GPU inference, and Gradio, all directly sourced from the official GitHub repository.
Embedding the video demo from the GitHub README.
Adding Acknowledgements and Citation sections for proper academic practice.

These updates ensure the model card is informative, user-friendly, and compliant with best practices for documenting AI artifacts.

rkfg changed pull request status to merged Oct 27

rkfg

Owner Oct 27

Thank you! Didn't get a notification to e-mail for some reason and only now noticed this PR.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment