Picture for Ning Ding

Ning Ding

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Add code
Jun 09, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Figure 1 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 2 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 3 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Figure 4 for MiniCPM4: Ultra-Efficient LLMs on End Devices
Viaarxiv icon

Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation

Add code
May 28, 2025
Figure 1 for Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation
Figure 2 for Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation
Figure 3 for Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation
Figure 4 for Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Figure 1 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 2 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 3 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Figure 4 for The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models
Viaarxiv icon

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

Add code
May 25, 2025
Viaarxiv icon

Multi-agent Embodied AI: Advances and Future Directions

Add code
May 08, 2025
Viaarxiv icon

TTRL: Test-Time Reinforcement Learning

Add code
Apr 22, 2025
Figure 1 for TTRL: Test-Time Reinforcement Learning
Figure 2 for TTRL: Test-Time Reinforcement Learning
Figure 3 for TTRL: Test-Time Reinforcement Learning
Figure 4 for TTRL: Test-Time Reinforcement Learning
Viaarxiv icon

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Add code
Apr 04, 2025
Figure 1 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 2 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 3 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Figure 4 for AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset
Viaarxiv icon

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Add code
Mar 14, 2025
Figure 1 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 2 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 3 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Figure 4 for Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models
Viaarxiv icon

Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping

Add code
Mar 10, 2025
Figure 1 for Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping
Figure 2 for Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping
Figure 3 for Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping
Figure 4 for Post-Training Quantization for Diffusion Transformer via Hierarchical Timestep Grouping
Viaarxiv icon