Picture for Kai Yang

Kai Yang

Sherman

A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation

Add code
Jun 29, 2024
Viaarxiv icon

Learning Multi-view Molecular Representations with Structured and Unstructured Knowledge

Add code
Jun 14, 2024
Viaarxiv icon

CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning

Add code
Jun 11, 2024
Viaarxiv icon

Triadic-OCD: Asynchronous Online Change Detection with Provable Robustness, Optimality, and Convergence

Add code
May 03, 2024
Viaarxiv icon

Do Efficient Transformers Really Save Computation?

Add code
Feb 21, 2024
Figure 1 for Do Efficient Transformers Really Save Computation?
Figure 2 for Do Efficient Transformers Really Save Computation?
Figure 3 for Do Efficient Transformers Really Save Computation?
Figure 4 for Do Efficient Transformers Really Save Computation?
Viaarxiv icon

BATON: Aligning Text-to-Audio Model with Human Preference Feedback

Add code
Feb 01, 2024
Figure 1 for BATON: Aligning Text-to-Audio Model with Human Preference Feedback
Figure 2 for BATON: Aligning Text-to-Audio Model with Human Preference Feedback
Figure 3 for BATON: Aligning Text-to-Audio Model with Human Preference Feedback
Figure 4 for BATON: Aligning Text-to-Audio Model with Human Preference Feedback
Viaarxiv icon

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

Add code
Jan 29, 2024
Figure 1 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 2 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 3 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Figure 4 for Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation
Viaarxiv icon

Exploration and Anti-Exploration with Distributional Random Network Distillation

Add code
Jan 25, 2024
Figure 1 for Exploration and Anti-Exploration with Distributional Random Network Distillation
Figure 2 for Exploration and Anti-Exploration with Distributional Random Network Distillation
Figure 3 for Exploration and Anti-Exploration with Distributional Random Network Distillation
Figure 4 for Exploration and Anti-Exploration with Distributional Random Network Distillation
Viaarxiv icon

Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective

Add code
Jan 21, 2024
Figure 1 for Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective
Figure 2 for Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective
Figure 3 for Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective
Figure 4 for Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective
Viaarxiv icon

Provably Convergent Federated Trilevel Learning

Add code
Dec 19, 2023
Viaarxiv icon