10 8

Boqiang Zhang

Cyril666

https://cyrilsterling.github.io/

CyrilSterling

AI & ML interests

Multi-modal Large Language Models Vision-Language-Action Models

Recent Activity

authored a paper about 18 hours ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

authored a paper about 18 hours ago

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness

authored a paper about 18 hours ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

View all activity

Organizations

Papers 7

spaces 4

models 6

Cyril666/av_model_v3

3B • Updated Jan 16

Cyril666/whisper-large-v3-encoder

Automatic Speech Recognition • 0.6B • Updated Dec 24, 2025 • 35

Cyril666/Qwen2-Audio-Encoder

0.6B • Updated Dec 8, 2025 • 4

Cyril666/SFL-Encoder-Pretrained-Qwen3

Text Generation • 0.4B • Updated Nov 17, 2025 • 94

Cyril666/exp003-softvq-l-64

Updated Jul 8, 2025 • 1

Cyril666/exp002-softvq-l-64

Updated Jul 8, 2025 • 1

datasets 0

None public yet

Boqiang Zhang

AI & ML interests

Recent Activity

Organizations

Papers 7

spaces 4 Sort: Recently updated

ContourNet

ContourNet

My_abi

Demo

models 6 Sort: Recently updated

datasets 0

spaces 4

models 6