Picture for Chenxin An

Chenxin An

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Add code
Sep 30, 2025
Viaarxiv icon

Effective Length Extrapolation via Dimension-Wise Positional Embeddings Manipulation

Add code
Apr 26, 2025
Viaarxiv icon

PowerAttention: Exponentially Scaling of Receptive Fields for Effective Sparse Attention

Add code
Mar 05, 2025
Viaarxiv icon

VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models

Add code
Nov 26, 2024
Figure 1 for VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
Figure 2 for VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
Figure 3 for VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
Figure 4 for VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
Viaarxiv icon

Why Does the Effective Context Length of LLMs Fall Short?

Add code
Oct 24, 2024
Figure 1 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 2 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 3 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 4 for Why Does the Effective Context Length of LLMs Fall Short?
Viaarxiv icon

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Add code
Oct 23, 2024
Figure 1 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 2 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 3 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 4 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Viaarxiv icon

Temporal Reasoning Transfer from Text to Video

Add code
Oct 08, 2024
Figure 1 for Temporal Reasoning Transfer from Text to Video
Figure 2 for Temporal Reasoning Transfer from Text to Video
Figure 3 for Temporal Reasoning Transfer from Text to Video
Figure 4 for Temporal Reasoning Transfer from Text to Video
Viaarxiv icon

Training-Free Long-Context Scaling of Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon

Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective

Add code
Oct 17, 2023
Figure 1 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Figure 2 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Figure 3 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Figure 4 for Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective
Viaarxiv icon

Scaling Laws of RoPE-based Extrapolation

Add code
Oct 08, 2023
Viaarxiv icon