K Mohammed Irfan's picture

2 7

K Mohammed Irfan

k-m-irfan

·

k-m-irfan

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

upvoted a paper 9 days ago

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

updated a dataset 9 days ago

MBZUAI/longshot-bench

View all activity

Organizations

upvoted a paper 9 days ago

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

Paper • 2512.16978 • Published 13 days ago • 4

upvoted a paper 13 days ago

Robust and Calibrated Detection of Authentic Multimedia Content

Paper • 2512.15182 • Published 14 days ago • 15

upvoted a paper 7 months ago

VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos

Paper • 2506.05349 • Published Jun 5 • 24

upvoted a paper 10 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 72

upvoted 2 papers about 1 year ago

UniMed-CLIP: Towards a Unified Image-Text Pretraining Paradigm for Diverse Medical Imaging Modalities

Paper • 2412.10372 • Published Dec 13, 2024 • 3

BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities

Paper • 2412.07769 • Published Dec 10, 2024 • 30

upvoted a paper over 1 year ago

GLaMM: Pixel Grounding Large Multimodal Model

Paper • 2311.03356 • Published Nov 6, 2023 • 36