Picture for Fan Yang

Fan Yang

refer to the report for detailed contributions

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning

Add code
Jun 06, 2025
Figure 1 for Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Figure 2 for Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Figure 3 for Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Figure 4 for Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Viaarxiv icon

NGA: Non-autoregressive Generative Auction with Global Externalities for Advertising Systems

Add code
Jun 06, 2025
Figure 1 for NGA: Non-autoregressive Generative Auction with Global Externalities for Advertising Systems
Figure 2 for NGA: Non-autoregressive Generative Auction with Global Externalities for Advertising Systems
Figure 3 for NGA: Non-autoregressive Generative Auction with Global Externalities for Advertising Systems
Viaarxiv icon

Understand, Think, and Answer: Advancing Visual Reasoning with Large Multimodal Models

Add code
May 27, 2025
Viaarxiv icon

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Add code
May 27, 2025
Figure 1 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Figure 2 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Figure 3 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Figure 4 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Viaarxiv icon

rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset

Add code
May 27, 2025
Viaarxiv icon

FieldWorkArena: Agentic AI Benchmark for Real Field Work Tasks

Add code
May 26, 2025
Viaarxiv icon

Beyond Cascaded Architectures: An End-to-end Generative Framework for Industrial Advertising

Add code
May 26, 2025
Figure 1 for Beyond Cascaded Architectures: An End-to-end Generative Framework for Industrial Advertising
Figure 2 for Beyond Cascaded Architectures: An End-to-end Generative Framework for Industrial Advertising
Figure 3 for Beyond Cascaded Architectures: An End-to-end Generative Framework for Industrial Advertising
Figure 4 for Beyond Cascaded Architectures: An End-to-end Generative Framework for Industrial Advertising
Viaarxiv icon

Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering

Add code
May 21, 2025
Viaarxiv icon

SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science

Add code
May 19, 2025
Viaarxiv icon