Picture for Ming Jin

Ming Jin

David

Data-Centric Human Preference Optimization with Rationales

Add code
Jul 19, 2024
Viaarxiv icon

Fluid Antenna-Assisted Simultaneous Wireless Information and Power Transfer Systems

Add code
Jul 16, 2024
Viaarxiv icon

A Framework of FAS-RIS Systems: Performance Analysis and Throughput Optimization

Add code
Jul 11, 2024
Viaarxiv icon

Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?

Add code
Jun 25, 2024
Viaarxiv icon

InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States

Add code
Jun 17, 2024
Figure 1 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Figure 2 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Figure 3 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Figure 4 for InternalInspector $I^2$: Robust Confidence Estimation in LLMs through Internal States
Viaarxiv icon

Fairness-Aware Meta-Learning via Nash Bargaining

Add code
Jun 11, 2024
Figure 1 for Fairness-Aware Meta-Learning via Nash Bargaining
Figure 2 for Fairness-Aware Meta-Learning via Nash Bargaining
Figure 3 for Fairness-Aware Meta-Learning via Nash Bargaining
Figure 4 for Fairness-Aware Meta-Learning via Nash Bargaining
Viaarxiv icon

Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation

Add code
May 31, 2024
Figure 1 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 2 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 3 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Figure 4 for Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation
Viaarxiv icon

A CMDP-within-online framework for Meta-Safe Reinforcement Learning

Add code
May 26, 2024
Viaarxiv icon

Safe and Balanced: A Framework for Constrained Multi-Objective Reinforcement Learning

Add code
May 26, 2024
Viaarxiv icon

Pausing Policy Learning in Non-stationary Reinforcement Learning

Add code
May 25, 2024
Viaarxiv icon