Picture for Claire Chen

Claire Chen

Offline Two-Player Zero-Sum Markov Games with KL Regularization

Add code
May 13, 2026
Viaarxiv icon

AstroAlertBench: Evaluating the Accuracy, Reasoning, and Honesty of Multimodal LLMs in Astronomical Classification

Add code
May 07, 2026
Viaarxiv icon

Instructing LLMs to Negotiate using Reinforcement Learning with Verifiable Rewards

Add code
Apr 10, 2026
Viaarxiv icon

Beyond Pessimism: Offline Learning in KL-regularized Games

Add code
Apr 08, 2026
Viaarxiv icon

MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics

Add code
Jan 30, 2026
Viaarxiv icon

Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel

Add code
May 28, 2025
Figure 1 for Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel
Figure 2 for Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel
Figure 3 for Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel
Figure 4 for Causal-PIK: Causality-based Physical Reasoning with a Physics-Informed Kernel
Viaarxiv icon

DexForce: Extracting Force-informed Actions from Kinesthetic Demonstrations for Dexterous Manipulation

Add code
Jan 17, 2025
Figure 1 for DexForce: Extracting Force-informed Actions from Kinesthetic Demonstrations for Dexterous Manipulation
Figure 2 for DexForce: Extracting Force-informed Actions from Kinesthetic Demonstrations for Dexterous Manipulation
Figure 3 for DexForce: Extracting Force-informed Actions from Kinesthetic Demonstrations for Dexterous Manipulation
Figure 4 for DexForce: Extracting Force-informed Actions from Kinesthetic Demonstrations for Dexterous Manipulation
Viaarxiv icon

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning

Add code
Oct 08, 2024
Figure 1 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Figure 2 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Figure 3 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Figure 4 for Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
Viaarxiv icon

Doubly Optimal Policy Evaluation for Reinforcement Learning

Add code
Oct 03, 2024
Figure 1 for Doubly Optimal Policy Evaluation for Reinforcement Learning
Figure 2 for Doubly Optimal Policy Evaluation for Reinforcement Learning
Figure 3 for Doubly Optimal Policy Evaluation for Reinforcement Learning
Figure 4 for Doubly Optimal Policy Evaluation for Reinforcement Learning
Viaarxiv icon

AO-Grasp: Articulated Object Grasp Generation

Add code
Oct 24, 2023
Figure 1 for AO-Grasp: Articulated Object Grasp Generation
Figure 2 for AO-Grasp: Articulated Object Grasp Generation
Figure 3 for AO-Grasp: Articulated Object Grasp Generation
Figure 4 for AO-Grasp: Articulated Object Grasp Generation
Viaarxiv icon