OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding Paper • 2406.07471 • Published Jun 11, 2024 • 1
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining Paper • 2411.15421 • Published Nov 23, 2024
MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation Paper • 2502.11903 • Published Feb 17, 2025
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published Aug 28, 2025 • 142
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding Paper • 2603.13366 • Published 12 days ago • 90
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding Paper • 2603.13366 • Published 12 days ago • 90
Reasoning Models Struggle to Control their Chains of Thought Paper • 2603.05706 • Published 16 days ago • 34
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models Paper • 2505.21523 • Published May 23, 2025 • 13
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published Aug 28, 2025 • 142
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models Paper • 2505.21523 • Published May 23, 2025 • 13