Picture for Dacheng Tao

Dacheng Tao

and Other Contributors

Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models

Add code
Oct 20, 2023
Figure 1 for Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Figure 2 for Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Figure 3 for Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Figure 4 for Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models
Viaarxiv icon

Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function

Add code
Oct 18, 2023
Figure 1 for Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function
Figure 2 for Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function
Figure 3 for Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function
Figure 4 for Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function
Viaarxiv icon

Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

Add code
Oct 15, 2023
Viaarxiv icon

Learn From Model Beyond Fine-Tuning: A Survey

Add code
Oct 12, 2023
Viaarxiv icon

PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation

Add code
Oct 11, 2023
Figure 1 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Figure 2 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Figure 3 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Figure 4 for PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation
Viaarxiv icon

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

Add code
Oct 11, 2023
Figure 1 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Figure 2 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Figure 3 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Figure 4 for Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
Viaarxiv icon

Parameter Efficient Multi-task Model Fusion with Partial Linearization

Add code
Oct 10, 2023
Figure 1 for Parameter Efficient Multi-task Model Fusion with Partial Linearization
Figure 2 for Parameter Efficient Multi-task Model Fusion with Partial Linearization
Figure 3 for Parameter Efficient Multi-task Model Fusion with Partial Linearization
Figure 4 for Parameter Efficient Multi-task Model Fusion with Partial Linearization
Viaarxiv icon

Which mode is better for federated learning? Centralized or Decentralized

Add code
Oct 05, 2023
Figure 1 for Which mode is better for federated learning? Centralized or Decentralized
Figure 2 for Which mode is better for federated learning? Centralized or Decentralized
Figure 3 for Which mode is better for federated learning? Centralized or Decentralized
Figure 4 for Which mode is better for federated learning? Centralized or Decentralized
Viaarxiv icon

AdaMerging: Adaptive Model Merging for Multi-Task Learning

Add code
Oct 04, 2023
Figure 1 for AdaMerging: Adaptive Model Merging for Multi-Task Learning
Figure 2 for AdaMerging: Adaptive Model Merging for Multi-Task Learning
Figure 3 for AdaMerging: Adaptive Model Merging for Multi-Task Learning
Figure 4 for AdaMerging: Adaptive Model Merging for Multi-Task Learning
Viaarxiv icon

Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models

Add code
Oct 04, 2023
Figure 1 for Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Figure 2 for Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Figure 3 for Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Figure 4 for Efficient Federated Prompt Tuning for Black-box Large Pre-trained Models
Viaarxiv icon