Picture for Xiaolu Zhang

Xiaolu Zhang

Keypoint-based Progressive Chain-of-Thought Distillation for LLMs

Add code
May 25, 2024
Viaarxiv icon

Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations

Add code
Apr 24, 2024
Figure 1 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Figure 2 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Figure 3 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Figure 4 for Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations
Viaarxiv icon

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

Add code
Apr 15, 2024
Viaarxiv icon

Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors

Add code
Mar 28, 2024
Figure 1 for Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors
Figure 2 for Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors
Figure 3 for Breaking the Length Barrier: LLM-Enhanced CTR Prediction in Long Textual User Behaviors
Viaarxiv icon

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems

Add code
Jan 09, 2024
Figure 1 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 2 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 3 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 4 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Viaarxiv icon

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training

Add code
Dec 19, 2023
Viaarxiv icon

One Model for All: Large Language Models are Domain-Agnostic Recommendation Systems

Add code
Oct 22, 2023
Figure 1 for One Model for All: Large Language Models are Domain-Agnostic Recommendation Systems
Figure 2 for One Model for All: Large Language Models are Domain-Agnostic Recommendation Systems
Figure 3 for One Model for All: Large Language Models are Domain-Agnostic Recommendation Systems
Figure 4 for One Model for All: Large Language Models are Domain-Agnostic Recommendation Systems
Viaarxiv icon

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Add code
Oct 09, 2023
Figure 1 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 2 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 3 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 4 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Viaarxiv icon

AntM$^{2}$C: A Large Scale Dataset For Multi-Scenario Multi-Modal CTR Prediction

Add code
Aug 31, 2023
Figure 1 for AntM$^{2}$C: A Large Scale Dataset For Multi-Scenario Multi-Modal CTR Prediction
Figure 2 for AntM$^{2}$C: A Large Scale Dataset For Multi-Scenario Multi-Modal CTR Prediction
Figure 3 for AntM$^{2}$C: A Large Scale Dataset For Multi-Scenario Multi-Modal CTR Prediction
Figure 4 for AntM$^{2}$C: A Large Scale Dataset For Multi-Scenario Multi-Modal CTR Prediction
Viaarxiv icon

Towards Open Temporal Graph Neural Networks

Add code
Mar 27, 2023
Figure 1 for Towards Open Temporal Graph Neural Networks
Figure 2 for Towards Open Temporal Graph Neural Networks
Figure 3 for Towards Open Temporal Graph Neural Networks
Figure 4 for Towards Open Temporal Graph Neural Networks
Viaarxiv icon