Picture for Wenbo Su

Wenbo Su

SpiralFormer: Looped Transformers Can Learn Hierarchical Dependencies via Multi-Resolution Recursion

Add code
Feb 12, 2026
Viaarxiv icon

COMI: Coarse-to-fine Context Compression via Marginal Information Gain

Add code
Feb 02, 2026
Viaarxiv icon

Read As Human: Compressing Context via Parallelizable Close Reading and Skimming

Add code
Feb 02, 2026
Viaarxiv icon

CoMeT: Collaborative Memory Transformer for Efficient Long Context Modeling

Add code
Feb 02, 2026
Viaarxiv icon

PretrainRL: Alleviating Factuality Hallucination of Large Language Models at the Beginning

Add code
Feb 02, 2026
Viaarxiv icon

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

Add code
Feb 02, 2026
Viaarxiv icon

Data Distribution Matters: A Data-Centric Perspective on Context Compression for Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

CE-RM: A Pointwise Generative Reward Model Optimized via Two-Stage Rollout and Unified Criteria

Add code
Jan 28, 2026
Viaarxiv icon

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants

Add code
Jan 26, 2026
Viaarxiv icon

Logics-STEM: Empowering LLM Reasoning via Failure-Driven Post-Training and Document Knowledge Enhancement

Add code
Jan 08, 2026
Viaarxiv icon