Sparse Autoencoder


When Truthful Representations Flip Under Deceptive Instructions?

Add code
Jul 29, 2025
Viaarxiv icon

Model-Agnostic Gender Bias Control for Text-to-Image Generation via Sparse Autoencoder

Add code
Jul 28, 2025
Viaarxiv icon

Latent Inter-User Difference Modeling for LLM Personalization

Add code
Jul 28, 2025
Viaarxiv icon

Sparse Autoencoders for Sequential Recommendation Models: Interpretation and Flexible Control

Add code
Jul 16, 2025
Viaarxiv icon

Teach Old SAEs New Domain Tricks with Boosting

Add code
Jul 17, 2025
Viaarxiv icon

CytoSAE: Interpretable Cell Embeddings for Hematology

Add code
Jul 16, 2025
Viaarxiv icon

Learning Representations of Event Time Series with Sparse Autoencoders for Anomaly Detection, Similarity Search, and Unsupervised Classification

Add code
Jul 15, 2025
Viaarxiv icon

VITA: Vision-to-Action Flow Matching Policy

Add code
Jul 17, 2025
Viaarxiv icon

LoViC: Efficient Long Video Generation with Context Compression

Add code
Jul 17, 2025
Viaarxiv icon

Exploring Task Performance with Interpretable Models via Sparse Auto-Encoders

Add code
Jul 08, 2025
Viaarxiv icon