RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models Paper • 2602.17053 • Published 20 days ago • 1