Picture for Tong Liu

Tong Liu

Sherman

MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Add code
Apr 21, 2025
Figure 1 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Figure 2 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Figure 3 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Figure 4 for MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Viaarxiv icon

Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification

Add code
Mar 14, 2025
Figure 1 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Figure 2 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Figure 3 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Figure 4 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Viaarxiv icon

AVD2: Accident Video Diffusion for Accident Video Description

Add code
Feb 21, 2025
Viaarxiv icon

Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs

Add code
Feb 21, 2025
Viaarxiv icon

Multi-Class Traffic Assignment using Multi-View Heterogeneous Graph Attention Networks

Add code
Jan 15, 2025
Viaarxiv icon

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

Add code
Jan 11, 2025
Figure 1 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 2 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 3 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 4 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Viaarxiv icon

All-domain Moveline Evolution Network for Click-Through Rate Prediction

Add code
Nov 18, 2024
Viaarxiv icon

Collaborative Contrastive Network for Click-Through Rate Prediction

Add code
Nov 18, 2024
Figure 1 for Collaborative Contrastive Network for Click-Through Rate Prediction
Figure 2 for Collaborative Contrastive Network for Click-Through Rate Prediction
Figure 3 for Collaborative Contrastive Network for Click-Through Rate Prediction
Figure 4 for Collaborative Contrastive Network for Click-Through Rate Prediction
Viaarxiv icon

Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models

Add code
Oct 16, 2024
Figure 1 for Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models
Figure 2 for Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models
Figure 3 for Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models
Figure 4 for Supply Chain Network Extraction and Entity Classification Leveraging Large Language Models
Viaarxiv icon

Upcycling Large Language Models into Mixture of Experts

Add code
Oct 10, 2024
Figure 1 for Upcycling Large Language Models into Mixture of Experts
Figure 2 for Upcycling Large Language Models into Mixture of Experts
Figure 3 for Upcycling Large Language Models into Mixture of Experts
Figure 4 for Upcycling Large Language Models into Mixture of Experts
Viaarxiv icon