AI & ML interests

Video Understanding, Audio-Visual, Multimodal LLMs, Video Captioning, Instruction Tuning, Dataset Curation, Qwen-based, Open-source, Fully-Open-MLLMs

Recent Activity

lyhisme  updated a model about 10 hours ago
AudioVisual-Caption/ASID-Captioner-3B
lyhisme  updated a model about 10 hours ago
AudioVisual-Caption/ASID-Captioner-7B
lyhisme  updated a Space 2 days ago
AudioVisual-Caption/README
View all activity

lyhisme 
updated a Space 2 days ago
lyhisme 
published a Space 15 days ago