Picture for Yekun Chai

Yekun Chai

EvolKV: Evolutionary KV Cache Compression for LLM Inference

Add code
Sep 10, 2025
Viaarxiv icon

Understanding Subword Compositionality of Large Language Models

Add code
Aug 25, 2025
Viaarxiv icon

Debiasing Multilingual LLMs in Cross-lingual Latent Space

Add code
Aug 25, 2025
Viaarxiv icon

Curiosity-Driven Reinforcement Learning from Human Feedback

Add code
Jan 20, 2025
Figure 1 for Curiosity-Driven Reinforcement Learning from Human Feedback
Figure 2 for Curiosity-Driven Reinforcement Learning from Human Feedback
Figure 3 for Curiosity-Driven Reinforcement Learning from Human Feedback
Figure 4 for Curiosity-Driven Reinforcement Learning from Human Feedback
Viaarxiv icon

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Add code
Oct 03, 2024
Figure 1 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Figure 2 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Figure 3 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Figure 4 for MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Viaarxiv icon

Tokenization Falling Short: The Curse of Tokenization

Add code
Jun 17, 2024
Viaarxiv icon

Dual Modalities of Text: Visual and Textual Generative Pre-training

Add code
Apr 17, 2024
Figure 1 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Figure 2 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Figure 3 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Figure 4 for Dual Modalities of Text: Visual and Textual Generative Pre-training
Viaarxiv icon

On Training Data Influence of GPT Models

Add code
Apr 11, 2024
Figure 1 for On Training Data Influence of GPT Models
Figure 2 for On Training Data Influence of GPT Models
Figure 3 for On Training Data Influence of GPT Models
Figure 4 for On Training Data Influence of GPT Models
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Figure 1 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 2 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 3 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Figure 4 for Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order
Viaarxiv icon

StarCoder 2 and The Stack v2: The Next Generation

Add code
Feb 29, 2024
Figure 1 for StarCoder 2 and The Stack v2: The Next Generation
Figure 2 for StarCoder 2 and The Stack v2: The Next Generation
Figure 3 for StarCoder 2 and The Stack v2: The Next Generation
Figure 4 for StarCoder 2 and The Stack v2: The Next Generation
Viaarxiv icon