Alert button
Picture for Luo Mai

Luo Mai

Alert button

MoE-Infinity: Activation-Aware Expert Offloading for Efficient MoE Serving

Add code
Bookmark button
Alert button
Jan 25, 2024
Leyang Xue, Yao Fu, Zhan Lu, Luo Mai, Mahesh Marina

Viaarxiv icon

ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models

Add code
Bookmark button
Alert button
Jan 25, 2024
Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai

Viaarxiv icon

TENPLEX: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections

Add code
Bookmark button
Alert button
Dec 08, 2023
Marcel Wagenländer, Guo Li, Bo Zhao, Luo Mai, Peter Pietzuch

Figure 1 for TENPLEX: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections
Figure 2 for TENPLEX: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections
Figure 3 for TENPLEX: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections
Figure 4 for TENPLEX: Changing Resources of Deep Learning Jobs using Parallelizable Tensor Collections
Viaarxiv icon

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Hanjing Wang, Man-Kit Sit, Congjie He, Ying Wen, Weinan Zhang, Jun Wang, Yaodong Yang, Luo Mai

Figure 1 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 2 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 3 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Figure 4 for GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models
Viaarxiv icon

Large Sequence Models for Sequential Decision-Making: A Survey

Add code
Bookmark button
Alert button
Jun 24, 2023
Muning Wen, Runji Lin, Hanjing Wang, Yaodong Yang, Ying Wen, Luo Mai, Jun Wang, Haifeng Zhang, Weinan Zhang

Figure 1 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 2 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 3 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 4 for Large Sequence Models for Sequential Decision-Making: A Survey
Viaarxiv icon

Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness

Add code
Bookmark button
Alert button
May 18, 2023
Zeyuan Tan, Xiulong Yuan, Congjie He, Man-Kit Sit, Guo Li, Xiaoze Liu, Baole Ai, Kai Zeng, Peter Pietzuch, Luo Mai

Figure 1 for Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness
Figure 2 for Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness
Figure 3 for Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness
Figure 4 for Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness
Viaarxiv icon

TorchOpt: An Efficient Library for Differentiable Optimization

Add code
Bookmark button
Alert button
Nov 13, 2022
Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang

Figure 1 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 2 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 3 for TorchOpt: An Efficient Library for Differentiable Optimization
Figure 4 for TorchOpt: An Efficient Library for Differentiable Optimization
Viaarxiv icon

MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment

Add code
Bookmark button
Alert button
Dec 10, 2021
Jie Ren, Wenteng Liang, Ran Yan, Luo Mai, Shiwen Liu, Xiao Liu

Figure 1 for MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment
Figure 2 for MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment
Figure 3 for MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment
Figure 4 for MegBA: A High-Performance and Distributed Library for Large-Scale Bundle Adjustment
Viaarxiv icon

Fast and Flexible Human Pose Estimation with HyperPose

Add code
Bookmark button
Alert button
Aug 26, 2021
Yixiao Guo, Jiawei Liu, Guo Li, Luo Mai, Hao Dong

Figure 1 for Fast and Flexible Human Pose Estimation with HyperPose
Figure 2 for Fast and Flexible Human Pose Estimation with HyperPose
Figure 3 for Fast and Flexible Human Pose Estimation with HyperPose
Figure 4 for Fast and Flexible Human Pose Estimation with HyperPose
Viaarxiv icon