Picture for Wayne Xin Zhao

Wayne Xin Zhao

Towards Effective and Efficient Continual Pre-training of Large Language Models

Add code
Jul 26, 2024
Figure 1 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 2 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 3 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Figure 4 for Towards Effective and Efficient Continual Pre-training of Large Language Models
Viaarxiv icon

Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment

Add code
Jul 15, 2024
Viaarxiv icon

LLMBox: A Comprehensive Library for Large Language Models

Add code
Jul 08, 2024
Figure 1 for LLMBox: A Comprehensive Library for Large Language Models
Figure 2 for LLMBox: A Comprehensive Library for Large Language Models
Figure 3 for LLMBox: A Comprehensive Library for Large Language Models
Figure 4 for LLMBox: A Comprehensive Library for Large Language Models
Viaarxiv icon

YuLan: An Open-source Large Language Model

Add code
Jun 28, 2024
Figure 1 for YuLan: An Open-source Large Language Model
Figure 2 for YuLan: An Open-source Large Language Model
Figure 3 for YuLan: An Open-source Large Language Model
Figure 4 for YuLan: An Open-source Large Language Model
Viaarxiv icon

Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning

Add code
Jun 20, 2024
Figure 1 for Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning
Figure 2 for Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning
Figure 3 for Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning
Figure 4 for Investigating the Pre-Training Dynamics of In-Context Learning: Task Recognition vs. Task Learning
Viaarxiv icon

Towards Event-oriented Long Video Understanding

Add code
Jun 20, 2024
Figure 1 for Towards Event-oriented Long Video Understanding
Figure 2 for Towards Event-oriented Long Video Understanding
Figure 3 for Towards Event-oriented Long Video Understanding
Figure 4 for Towards Event-oriented Long Video Understanding
Viaarxiv icon

CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration

Add code
Jun 19, 2024
Viaarxiv icon

Low-Redundant Optimization for Large Language Model Alignment

Add code
Jun 18, 2024
Figure 1 for Low-Redundant Optimization for Large Language Model Alignment
Figure 2 for Low-Redundant Optimization for Large Language Model Alignment
Figure 3 for Low-Redundant Optimization for Large Language Model Alignment
Figure 4 for Low-Redundant Optimization for Large Language Model Alignment
Viaarxiv icon

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

Add code
Jun 18, 2024
Viaarxiv icon

Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector

Add code
Jun 17, 2024
Figure 1 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Figure 2 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Figure 3 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Figure 4 for Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector
Viaarxiv icon