Picture for Weinan E

Weinan E

GradPower: Powering Gradients for Faster Language Model Pre-Training

Add code
May 30, 2025
Viaarxiv icon

On the Expressive Power of Mixture-of-Experts for Structured Complex Tasks

Add code
May 30, 2025
Viaarxiv icon

Scalable Complexity Control Facilitates Reasoning Ability of LLMs

Add code
May 29, 2025
Viaarxiv icon

RARE: Retrieval-Augmented Reasoning Modeling

Add code
Mar 30, 2025
Viaarxiv icon

Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens

Add code
Mar 21, 2025
Viaarxiv icon

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

Add code
Feb 26, 2025
Viaarxiv icon

Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence

Add code
Feb 21, 2025
Figure 1 for Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence
Figure 2 for Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence
Viaarxiv icon

PaSa: An LLM Agent for Comprehensive Academic Paper Search

Add code
Jan 17, 2025
Figure 1 for PaSa: An LLM Agent for Comprehensive Academic Paper Search
Figure 2 for PaSa: An LLM Agent for Comprehensive Academic Paper Search
Figure 3 for PaSa: An LLM Agent for Comprehensive Academic Paper Search
Figure 4 for PaSa: An LLM Agent for Comprehensive Academic Paper Search
Viaarxiv icon

Intelligent System for Automated Molecular Patent Infringement Assessment

Add code
Dec 10, 2024
Figure 1 for Intelligent System for Automated Molecular Patent Infringement Assessment
Figure 2 for Intelligent System for Automated Molecular Patent Infringement Assessment
Figure 3 for Intelligent System for Automated Molecular Patent Infringement Assessment
Figure 4 for Intelligent System for Automated Molecular Patent Infringement Assessment
Viaarxiv icon

How Transformers Implement Induction Heads: Approximation and Optimization Analysis

Add code
Oct 15, 2024
Figure 1 for How Transformers Implement Induction Heads: Approximation and Optimization Analysis
Viaarxiv icon