Picture for Quan Wei

Quan Wei

Katie

Scalable Parameter and Memory Efficient Pretraining for LLM: Recent Algorithmic Advances and Benchmarking

Add code
May 28, 2025
Viaarxiv icon

Reinforcing Multi-Turn Reasoning in LLM Agents via Turn-Level Credit Assignment

Add code
May 17, 2025
Viaarxiv icon

RoSTE: An Efficient Quantization-Aware Supervised Fine-Tuning Approach for Large Language Models

Add code
Feb 13, 2025
Viaarxiv icon