Picture for Jingren Zhou

Jingren Zhou

additional authors not shown

Qwen3 Technical Report

Add code
May 14, 2025
Figure 1 for Qwen3 Technical Report
Figure 2 for Qwen3 Technical Report
Figure 3 for Qwen3 Technical Report
Figure 4 for Qwen3 Technical Report
Viaarxiv icon

Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Add code
May 10, 2025
Viaarxiv icon

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

Add code
Apr 30, 2025
Viaarxiv icon

Tree-based Models for Vertical Federated Learning: A Survey

Add code
Apr 03, 2025
Viaarxiv icon

Wan: Open and Advanced Large-Scale Video Generative Models

Add code
Mar 26, 2025
Figure 1 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 2 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 3 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 4 for Wan: Open and Advanced Large-Scale Video Generative Models
Viaarxiv icon

Qwen2.5-1M Technical Report

Add code
Jan 26, 2025
Figure 1 for Qwen2.5-1M Technical Report
Figure 2 for Qwen2.5-1M Technical Report
Figure 3 for Qwen2.5-1M Technical Report
Figure 4 for Qwen2.5-1M Technical Report
Viaarxiv icon

Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models

Add code
Jan 21, 2025
Figure 1 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Figure 2 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Figure 3 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Figure 4 for Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Viaarxiv icon

Towards Robust and Realistic Human Pose Estimation via WiFi Signals

Add code
Jan 16, 2025
Figure 1 for Towards Robust and Realistic Human Pose Estimation via WiFi Signals
Figure 2 for Towards Robust and Realistic Human Pose Estimation via WiFi Signals
Figure 3 for Towards Robust and Realistic Human Pose Estimation via WiFi Signals
Figure 4 for Towards Robust and Realistic Human Pose Estimation via WiFi Signals
Viaarxiv icon

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Add code
Jan 13, 2025
Figure 1 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 2 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 3 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Figure 4 for The Lessons of Developing Process Reward Models in Mathematical Reasoning
Viaarxiv icon

ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling

Add code
Jan 07, 2025
Figure 1 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Figure 2 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Figure 3 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Figure 4 for ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling
Viaarxiv icon