Improve model card: Add metadata, paper, toolkit links, and usage example

by nielsr HF Staff - opened Aug 14

←

nielsr

Aug 14

This PR significantly enhances the model card for llama31-405b-quant by providing more comprehensive and discoverable information.

Key updates include:

Addition of license: apache-2.0, library_name: transformers, and pipeline_tag: text-generation to the metadata, improving categorization and clarity.
Linking the model to its associated research paper: LLMC+: Benchmarking Vision-Language Model Compression with a Plug-and-play Toolkit.
Including direct links to the project's official GitHub repository (https://github.com/ModelTC/llmc) and its documentation (https://llmc-en.readthedocs.io/en/latest/).
Expanding the model description to provide context about its origin and the LightCompress (formerly LLMC) toolkit used for quantization.
Providing a Python code snippet for loading and inferring with the model using the transformers library, including a chat template example suitable for Llama models.

These changes will make the model more accessible, informative, and usable for the Hugging Face community.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment