Nagori
MohammedNaeem
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 16 hours ago
Back to Basics: Revisiting REINFORCE Style Optimization for Learning
from Human Feedback in LLMs
upvoted
a
collection
about 16 hours ago
Ming-V2
upvoted
a
paper
1 day ago
iGRPO: Self-Feedback-Driven LLM Reasoning
Organizations
None yet