Picture for Fangxin Liu

Fangxin Liu

Shanghai Jiao Tong University

SpecQuant: Spectral Decomposition and Adaptive Truncation for Ultra-Low-Bit LLMs Quantization

Add code
Nov 11, 2025
Viaarxiv icon

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

Add code
Nov 10, 2025
Viaarxiv icon

Flexible Operator Fusion for Fast Sparse Transformer with Diverse Masking on GPU

Add code
Jun 06, 2025
Viaarxiv icon

DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies

Add code
May 23, 2025
Figure 1 for DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies
Figure 2 for DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies
Figure 3 for DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies
Figure 4 for DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies
Viaarxiv icon

Phantom: Constraining Generative Artificial Intelligence Models for Practical Domain Specific Peripherals Trace Synthesizing

Add code
Nov 10, 2024
Viaarxiv icon

DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification

Add code
Dec 09, 2021
Figure 1 for DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification
Figure 2 for DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification
Figure 3 for DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification
Figure 4 for DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification
Viaarxiv icon

TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval

Add code
May 05, 2021
Figure 1 for TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
Figure 2 for TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
Figure 3 for TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
Figure 4 for TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
Viaarxiv icon

SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network

Add code
Mar 02, 2021
Figure 1 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Figure 2 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Figure 3 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Figure 4 for SME: ReRAM-based Sparse-Multiplication-Engine to Squeeze-Out Bit Sparsity of Neural Network
Viaarxiv icon