Picture for Yanxuan Yu

Yanxuan Yu

CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving

Add code
Dec 11, 2025
Viaarxiv icon

$π$-Attention: Periodic Sparse Transformers for Efficient Long-Context Modeling

Add code
Nov 12, 2025
Viaarxiv icon

SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling

Add code
Aug 21, 2025
Figure 1 for SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling
Figure 2 for SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling
Figure 3 for SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling
Figure 4 for SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling
Viaarxiv icon

FastCache: Fast Caching for Diffusion Transformer Through Learnable Linear Approximation

Add code
May 26, 2025
Viaarxiv icon

MobileNetV2: A lightweight classification model for home-based sleep apnea screening

Add code
Dec 28, 2024
Figure 1 for MobileNetV2: A lightweight classification model for home-based sleep apnea screening
Figure 2 for MobileNetV2: A lightweight classification model for home-based sleep apnea screening
Figure 3 for MobileNetV2: A lightweight classification model for home-based sleep apnea screening
Figure 4 for MobileNetV2: A lightweight classification model for home-based sleep apnea screening
Viaarxiv icon