Tokenization


MLLMs Know When Before Speaking: Revealing and Recovering Temporal Grounding via Attention Cues

Add code
May 21, 2026
Viaarxiv icon

Cambrian-P: Pose-Grounded Video Understanding

Add code
May 21, 2026
Viaarxiv icon

Gated DeltaNet-2: Decoupling Erase and Write in Linear Attention

Add code
May 21, 2026
Viaarxiv icon

DecQ: Detail-Condensing Queries for Enhanced Reconstruction and Generation in Representation Autoencoders

Add code
May 21, 2026
Viaarxiv icon

Uniform Diffusion Models Revisited: Leave-One-Out Denoiser and Absorbing State Reformulation

Add code
May 21, 2026
Viaarxiv icon

Post-Training is About States, Not Tokens: A State Distribution View of SFT, RL, and On-Policy Distillation

Add code
May 21, 2026
Viaarxiv icon

Reading Task Failure Off the Activations: A Sparse-Feature Audit of GPT-2 Small on Indirect Object Identification

Add code
May 21, 2026
Viaarxiv icon

WorldKV: Efficient World Memory with World Retrieval and Compression

Add code
May 21, 2026
Viaarxiv icon

AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild

Add code
May 21, 2026
Viaarxiv icon

AMEL: Accumulated Message Effects on LLM Judgments

Add code
May 21, 2026
Viaarxiv icon