M Saad Salman
MSS444
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 hour ago
Detecting RLVR Training Data via Structural Convergence of Reasoning
upvoted
a
paper
about 1 hour ago
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
upvoted
a
paper
about 1 hour ago
FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching
Organizations
None yet