Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoLLaMA2.1-7B-16F-Base
like
1
Follow
Language Technology Lab at Alibaba DAMO Academy
154
Visual Question Answering
Transformers
OpenGVLab/VideoChat2-IT
Lin-Chen/ShareGPT4V
liuhaotian/LLaVA-Instruct-150K
English
videollama2_qwen2
text-generation
multimodal large language model
large video-language model
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VideoLLaMA2.1-7B-16F-Base
756 MB
3 contributors
History:
4 commits
lixin4ever
Update README.md
d3289bb
verified
about 1 year ago
.gitattributes
1.52 kB
initial commit
about 1 year ago
README.md
7.51 kB
Update README.md
about 1 year ago
config.json
1.16 kB
Upload projector model files.
about 1 year ago
mm_projector.bin
754 MB
xet
Upload projector model files.
about 1 year ago
trainer_state.json
1.93 MB
Upload projector model files.
about 1 year ago