Picture for Huawei Shen

Huawei Shen

From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment

Add code
Jun 14, 2025
Viaarxiv icon

Inference-time Alignment in Continuous Space

Add code
May 26, 2025
Viaarxiv icon

Too Consistent to Detect: A Study of Self-Consistent Errors in LLMs

Add code
May 23, 2025
Viaarxiv icon

Distilling the Implicit Multi-Branch Structure in LLMs' Reasoning via Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

InfoNCE is a Free Lunch for Semantically guided Graph Contrastive Learning

Add code
May 07, 2025
Viaarxiv icon

Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models

Add code
Apr 01, 2025
Viaarxiv icon

MIGE: A Unified Framework for Multimodal Instruction-Based Image Generation and Editing

Add code
Feb 28, 2025
Viaarxiv icon

ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon

Revisiting Robust RAG: Do We Still Need Complex Robust Training in the Era of Powerful LLMs?

Add code
Feb 17, 2025
Viaarxiv icon

Following the Autoregressive Nature of LLM Embeddings via Compression and Alignment

Add code
Feb 17, 2025
Viaarxiv icon