Alert button
Picture for Yu Bai

Yu Bai

Alert button

Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning

Add code
Bookmark button
Alert button
Apr 08, 2024
Ruiqi Zhang, Licong Lin, Yu Bai, Song Mei

Viaarxiv icon

Text2Data: Low-Resource Data Generation with Textual Control

Add code
Bookmark button
Alert button
Feb 08, 2024
Shiyu Wang, Yihao Feng, Tian Lan, Ning Yu, Yu Bai, Ran Xu, Huan Wang, Caiming Xiong, Silvio Savarese

Viaarxiv icon

Analyzing Task-Encoding Tokens in Large Language Models

Add code
Bookmark button
Alert button
Jan 20, 2024
Yu Bai, Heyan Huang, Cesare Spinoso-Di Piano, Marc-Antoine Rondeau, Sanxing Chen, Yang Gao, Jackie Chi Kit Cheung

Viaarxiv icon

Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning?

Add code
Bookmark button
Alert button
Nov 29, 2023
Lei Zhao, Mengdi Wang, Yu Bai

Viaarxiv icon

How Do Transformers Learn In-Context Beyond Simple Functions? A Case Study on Learning with Representations

Add code
Bookmark button
Alert button
Oct 16, 2023
Tianyu Guo, Wei Hu, Song Mei, Huan Wang, Caiming Xiong, Silvio Savarese, Yu Bai

Viaarxiv icon

Transformers as Decision Makers: Provable In-Context Reinforcement Learning via Supervised Pretraining

Add code
Bookmark button
Alert button
Oct 12, 2023
Licong Lin, Yu Bai, Song Mei

Viaarxiv icon

An Empirical Study of NetOps Capability of Pre-Trained Large Language Models

Add code
Bookmark button
Alert button
Sep 19, 2023
Yukai Miao, Yu Bai, Li Chen, Dan Li, Haifeng Sun, Xizheng Wang, Ziqiu Luo, Yanyu Ren, Dapeng Sun, Xiuting Xu, Qi Zhang, Chao Xiang, Xinchi Li

Figure 1 for An Empirical Study of NetOps Capability of Pre-Trained Large Language Models
Figure 2 for An Empirical Study of NetOps Capability of Pre-Trained Large Language Models
Figure 3 for An Empirical Study of NetOps Capability of Pre-Trained Large Language Models
Figure 4 for An Empirical Study of NetOps Capability of Pre-Trained Large Language Models
Viaarxiv icon

What can a Single Attention Layer Learn? A Study Through the Random Features Lens

Add code
Bookmark button
Alert button
Jul 21, 2023
Hengyu Fu, Tianyu Guo, Yu Bai, Song Mei

Figure 1 for What can a Single Attention Layer Learn? A Study Through the Random Features Lens
Figure 2 for What can a Single Attention Layer Learn? A Study Through the Random Features Lens
Figure 3 for What can a Single Attention Layer Learn? A Study Through the Random Features Lens
Figure 4 for What can a Single Attention Layer Learn? A Study Through the Random Features Lens
Viaarxiv icon

Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight

Add code
Bookmark button
Alert button
Jul 06, 2023
Jiacheng Guo, Minshuo Chen, Huan Wang, Caiming Xiong, Mengdi Wang, Yu Bai

Viaarxiv icon