Picture for Songwei Liu

Songwei Liu

Motion-Aware Caching for Efficient Autoregressive Video Generation

Add code
May 03, 2026
Viaarxiv icon

DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing

Add code
Mar 30, 2026
Viaarxiv icon

S2O: Early Stopping for Sparse Attention via Online Permutation

Add code
Feb 26, 2026
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference

Add code
Dec 23, 2024
Figure 1 for GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
Figure 2 for GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
Figure 3 for GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
Figure 4 for GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference
Viaarxiv icon

Timely reliable Bayesian decision-making enabled using memristors

Add code
Dec 07, 2024
Viaarxiv icon

ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models

Add code
Aug 16, 2024
Figure 1 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Figure 2 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Figure 3 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Figure 4 for ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models
Viaarxiv icon

Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models

Add code
Aug 13, 2024
Figure 1 for Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models
Figure 2 for Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models
Figure 3 for Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models
Figure 4 for Hybrid SD: Edge-Cloud Collaborative Inference for Stable Diffusion Models
Viaarxiv icon

FoldGPT: Simple and Effective Large Language Model Compression Scheme

Add code
Jul 01, 2024
Viaarxiv icon

Local stochastic computing using memristor-enabled stochastic logics

Add code
Feb 25, 2024
Viaarxiv icon