LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs Paper • 2602.00462 • Published 12 days ago • 15
Controlling Multimodal LLMs via Reward-guided Decoding Paper • 2508.11616 • Published Aug 15, 2025 • 7
Controlling Multimodal LLMs via Reward-guided Decoding Paper • 2508.11616 • Published Aug 15, 2025 • 7
Controlling Multimodal LLMs via Reward-guided Decoding Paper • 2508.11616 • Published Aug 15, 2025 • 7 • 2
Consistency-diversity-realism Pareto fronts of conditional image generative models Paper • 2406.10429 • Published Jun 14, 2024
Running 592 Scaling test-time compute 📈 592 Run advanced LLM search strategies to boost problem solving
meta-llama/Llama-3.2-90B-Vision-Instruct Image-Text-to-Text • 89B • Updated Mar 4, 2025 • 2.52k • • 349
view article Article LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning? Jul 25, 2024 • 17