Picture for Zhen Qin

Zhen Qin

On the Convergence of Gradient Descent on Learning Transformers with Residual Connections

Add code
Jun 05, 2025
Viaarxiv icon

Hybrid Latent Reasoning via Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

EVADE: Multimodal Benchmark for Evasive Content Detection in E-Commerce Applications

Add code
May 23, 2025
Viaarxiv icon

Accelerate TarFlow Sampling with GS-Jacobi Iteration

Add code
May 19, 2025
Viaarxiv icon

Optimizing Compound Retrieval Systems

Add code
Apr 16, 2025
Viaarxiv icon

Can Pre-training Indicators Reliably Predict Fine-tuning Outcomes of LLMs?

Add code
Apr 16, 2025
Viaarxiv icon

Rankers, Judges, and Assistants: Towards Understanding the Interplay of LLMs in Information Retrieval Evaluation

Add code
Mar 24, 2025
Viaarxiv icon

Vertical Federated Learning in Practice: The Good, the Bad, and the Ugly

Add code
Feb 12, 2025
Viaarxiv icon

LLM Alignment as Retriever Optimization: An Information Retrieval Perspective

Add code
Feb 06, 2025
Figure 1 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Figure 2 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Figure 3 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Figure 4 for LLM Alignment as Retriever Optimization: An Information Retrieval Perspective
Viaarxiv icon

MiniMax-01: Scaling Foundation Models with Lightning Attention

Add code
Jan 14, 2025
Viaarxiv icon