Mishal's picture

2 12

Mishal

mishalalal

·

mishaal79

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

liked a model about 1 month ago

ibm-granite/granite-docling-258M

liked a model 2 months ago

hexgrad/Kokoro-82M

View all activity

Organizations

upvoted a paper 19 days ago

Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning

Paper • 2512.15687 • Published 20 days ago • 18

upvoted an article 3 months ago

Article

What is test-time compute and how to scale it?

Feb 6, 2025

•

110