Picture for Yu Yang

Yu Yang

Celine

S$^4$C: Speculative Sampling with Syntactic and Semantic Coherence for Efficient Inference of Large Language Models

Add code
Jun 17, 2025
Viaarxiv icon

X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability

Add code
Jun 16, 2025
Figure 1 for X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
Figure 2 for X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
Figure 3 for X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
Figure 4 for X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability
Viaarxiv icon

How to Provably Improve Return Conditioned Supervised Learning?

Add code
Jun 10, 2025
Viaarxiv icon

MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning

Add code
Jun 10, 2025
Viaarxiv icon

LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation

Add code
May 27, 2025
Viaarxiv icon

O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

Add code
May 22, 2025
Viaarxiv icon

Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach

Add code
May 16, 2025
Figure 1 for Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Figure 2 for Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Figure 3 for Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Figure 4 for Predicting Student Dropout Risk With A Dual-Modal Abrupt Behavioral Changes Approach
Viaarxiv icon

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Add code
Apr 17, 2025
Figure 1 for CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Figure 2 for CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Figure 3 for CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Figure 4 for CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Viaarxiv icon

AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration

Add code
Mar 20, 2025
Figure 1 for AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration
Figure 2 for AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration
Figure 3 for AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration
Figure 4 for AutoRedTeamer: Autonomous Red Teaming with Lifelong Attack Integration
Viaarxiv icon

Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency

Add code
Feb 07, 2025
Figure 1 for Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
Figure 2 for Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
Figure 3 for Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
Figure 4 for Near-Optimal Online Learning for Multi-Agent Submodular Coordination: Tight Approximation and Communication Efficiency
Viaarxiv icon