Picture for Ruisi Cai

Ruisi Cai

Steepest Descent Density Control for Compact 3D Gaussian Splatting

Add code
May 08, 2025
Viaarxiv icon

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Add code
Jan 01, 2025
Figure 1 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Figure 2 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Figure 3 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Figure 4 for Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding
Viaarxiv icon

Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing

Add code
Dec 31, 2024
Figure 1 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Figure 2 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Figure 3 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Figure 4 for Understanding and Mitigating Bottlenecks of State Space Models through the Lens of Recency and Over-smoothing
Viaarxiv icon

Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design

Add code
Oct 24, 2024
Figure 1 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Figure 2 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Figure 3 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Figure 4 for Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design
Viaarxiv icon

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild

Add code
Oct 07, 2024
Figure 1 for Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
Figure 2 for Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
Figure 3 for Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
Figure 4 for Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
Viaarxiv icon

Flextron: Many-in-One Flexible Large Language Model

Add code
Jun 11, 2024
Figure 1 for Flextron: Many-in-One Flexible Large Language Model
Figure 2 for Flextron: Many-in-One Flexible Large Language Model
Figure 3 for Flextron: Many-in-One Flexible Large Language Model
Figure 4 for Flextron: Many-in-One Flexible Large Language Model
Viaarxiv icon

LoCoCo: Dropping In Convolutions for Long Context Compression

Add code
Jun 08, 2024
Viaarxiv icon

Robust Mixture-of-Expert Training for Convolutional Neural Networks

Add code
Aug 19, 2023
Viaarxiv icon

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Add code
Jul 19, 2023
Viaarxiv icon

Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?

Add code
Feb 24, 2023
Figure 1 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Figure 2 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Figure 3 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Figure 4 for Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?
Viaarxiv icon