Picture for Zhen Fang

Zhen Fang

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Add code
Feb 10, 2026
Viaarxiv icon

ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning

Add code
Feb 10, 2026
Viaarxiv icon

Delving into Spectral Clustering with Vision-Language Representations

Add code
Feb 10, 2026
Viaarxiv icon

NSC-SL: A Bandwidth-Aware Neural Subspace Compression for Communication-Efficient Split Learning

Add code
Feb 02, 2026
Viaarxiv icon

Vision-DeepResearch Benchmark: Rethinking Visual and Textual Search for Multimodal Large Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

Beyond In-Domain Detection: SpikeScore for Cross-Domain Hallucination Detection

Add code
Jan 27, 2026
Viaarxiv icon

How Do Transformers Learn to Associate Tokens: Gradient Leading Terms Bring Mechanistic Interpretability

Add code
Jan 27, 2026
Viaarxiv icon

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Add code
Jan 08, 2026
Viaarxiv icon

Learning Robust Spectral Dynamics for Temporal Domain Generalization

Add code
May 19, 2025
Figure 1 for Learning Robust Spectral Dynamics for Temporal Domain Generalization
Figure 2 for Learning Robust Spectral Dynamics for Temporal Domain Generalization
Figure 3 for Learning Robust Spectral Dynamics for Temporal Domain Generalization
Figure 4 for Learning Robust Spectral Dynamics for Temporal Domain Generalization
Viaarxiv icon