Picture for Xiaozhe Ren

Xiaozhe Ren

CAPE: Context-Adaptive Positional Encoding for Length Extrapolation

Add code
May 23, 2024
Figure 1 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 2 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 3 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Figure 4 for CAPE: Context-Adaptive Positional Encoding for Length Extrapolation
Viaarxiv icon

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Add code
Mar 07, 2024
Figure 1 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 2 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 3 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Figure 4 for PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Viaarxiv icon

A Survey of Reasoning with Foundation Models

Add code
Dec 26, 2023
Figure 1 for A Survey of Reasoning with Foundation Models
Figure 2 for A Survey of Reasoning with Foundation Models
Figure 3 for A Survey of Reasoning with Foundation Models
Figure 4 for A Survey of Reasoning with Foundation Models
Viaarxiv icon

EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge

Add code
Nov 23, 2023
Figure 1 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Figure 2 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Figure 3 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Figure 4 for EdgeFM: Leveraging Foundation Model for Open-set Learning on the Edge
Viaarxiv icon

CAME: Confidence-guided Adaptive Memory Efficient Optimization

Add code
Jul 05, 2023
Figure 1 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 2 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 3 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 4 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Viaarxiv icon

Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline

Add code
May 22, 2023
Figure 1 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 2 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 3 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 4 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Viaarxiv icon

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

Add code
Mar 20, 2023
Figure 1 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 2 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 3 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Figure 4 for PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
Viaarxiv icon

Deeper vs Wider: A Revisit of Transformer Configuration

Add code
May 24, 2022
Figure 1 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 2 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 3 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 4 for Deeper vs Wider: A Revisit of Transformer Configuration
Viaarxiv icon

One Student Knows All Experts Know: From Sparse to Dense

Add code
Jan 26, 2022
Figure 1 for One Student Knows All Experts Know: From Sparse to Dense
Figure 2 for One Student Knows All Experts Know: From Sparse to Dense
Figure 3 for One Student Knows All Experts Know: From Sparse to Dense
Figure 4 for One Student Knows All Experts Know: From Sparse to Dense
Viaarxiv icon

Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Add code
Nov 02, 2021
Figure 1 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 2 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 3 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 4 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Viaarxiv icon