Picture for Maheed H. Ahmed

Maheed H. Ahmed

Multi-User Dueling Bandits: A Fair Approach using Nash Social Welfare

Add code
May 03, 2026
Viaarxiv icon

Reinforcement Learning from Multi-level and Episodic Human Feedback

Add code
Apr 20, 2025
Figure 1 for Reinforcement Learning from Multi-level and Episodic Human Feedback
Figure 2 for Reinforcement Learning from Multi-level and Episodic Human Feedback
Figure 3 for Reinforcement Learning from Multi-level and Episodic Human Feedback
Viaarxiv icon