Picture for Qipeng Guo

Qipeng Guo

Eric

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Add code
Jun 17, 2025
Viaarxiv icon

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law

Add code
Jun 16, 2025
Viaarxiv icon

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Add code
Jun 13, 2025
Viaarxiv icon

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Add code
Jun 08, 2025
Viaarxiv icon

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Add code
Apr 12, 2025
Viaarxiv icon

Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences

Add code
Mar 17, 2025
Viaarxiv icon

How to Mitigate Overfitting in Weak-to-strong Generalization?

Add code
Mar 06, 2025
Viaarxiv icon

CritiQ: Mining Data Quality Criteria from Human Preferences

Add code
Feb 26, 2025
Viaarxiv icon

Thus Spake Long-Context Large Language Model

Add code
Feb 24, 2025
Viaarxiv icon

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Add code
Feb 20, 2025
Viaarxiv icon