Picture for Zangwei Zheng

Zangwei Zheng

Dataset Growth

Add code
May 28, 2024
Figure 1 for Dataset Growth
Figure 2 for Dataset Growth
Figure 3 for Dataset Growth
Figure 4 for Dataset Growth
Viaarxiv icon

How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?

Add code
Apr 19, 2024
Figure 1 for How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Figure 2 for How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Figure 3 for How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Figure 4 for How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?
Viaarxiv icon

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers

Add code
Mar 15, 2024
Figure 1 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 2 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 3 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Figure 4 for DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers
Viaarxiv icon

Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization

Add code
Feb 23, 2024
Figure 1 for Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Figure 2 for Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Figure 3 for Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Figure 4 for Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization
Viaarxiv icon

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

Add code
Jan 29, 2024
Figure 1 for OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Figure 2 for OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Figure 3 for OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Figure 4 for OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Viaarxiv icon

CAME: Confidence-guided Adaptive Memory Efficient Optimization

Add code
Jul 05, 2023
Figure 1 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 2 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 3 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Figure 4 for CAME: Confidence-guided Adaptive Memory Efficient Optimization
Viaarxiv icon

To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis

Add code
May 22, 2023
Figure 1 for To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
Figure 2 for To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
Figure 3 for To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
Figure 4 for To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis
Viaarxiv icon

Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline

Add code
May 22, 2023
Figure 1 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 2 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 3 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Figure 4 for Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline
Viaarxiv icon

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models

Add code
Mar 12, 2023
Figure 1 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Figure 2 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Figure 3 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Figure 4 for Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models
Viaarxiv icon

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Add code
Mar 08, 2023
Figure 1 for InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Figure 2 for InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Figure 3 for InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Figure 4 for InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning
Viaarxiv icon