Picture for Weizhu Chen

Weizhu Chen

Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena

Add code
Jul 15, 2024
Viaarxiv icon

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Add code
Jun 11, 2024
Viaarxiv icon

Automatic Instruction Evolving for Large Language Models

Add code
Jun 02, 2024
Viaarxiv icon

Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment

Add code
May 31, 2024
Figure 1 for Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Figure 2 for Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Figure 3 for Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Figure 4 for Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment
Viaarxiv icon

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Add code
Apr 23, 2024
Figure 1 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 2 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 3 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Figure 4 for Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Viaarxiv icon

Rho-1: Not All Tokens Are What You Need

Add code
Apr 11, 2024
Figure 1 for Rho-1: Not All Tokens Are What You Need
Figure 2 for Rho-1: Not All Tokens Are What You Need
Figure 3 for Rho-1: Not All Tokens Are What You Need
Figure 4 for Rho-1: Not All Tokens Are What You Need
Viaarxiv icon

A Note on LoRA

Add code
Apr 07, 2024
Viaarxiv icon

Exploring the Mystery of Influential Data for Mathematical Reasoning

Add code
Apr 01, 2024
Figure 1 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Figure 2 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Figure 3 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Figure 4 for Exploring the Mystery of Influential Data for Mathematical Reasoning
Viaarxiv icon

Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning

Add code
Mar 04, 2024
Figure 1 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 2 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 3 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Figure 4 for Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Viaarxiv icon

Multi-LoRA Composition for Image Generation

Add code
Feb 26, 2024
Figure 1 for Multi-LoRA Composition for Image Generation
Figure 2 for Multi-LoRA Composition for Image Generation
Figure 3 for Multi-LoRA Composition for Image Generation
Figure 4 for Multi-LoRA Composition for Image Generation
Viaarxiv icon