Face


Speech Emotion Recognition Leveraging OpenAI's Whisper Representations and Attentive Pooling Methods

Add code
Feb 05, 2026
Viaarxiv icon

Regularized Calibration with Successive Rounding for Post-Training Quantization

Add code
Feb 05, 2026
Viaarxiv icon

RRAttention: Dynamic Block Sparse Attention via Per-Head Round-Robin Shifts for Long-Context Inference

Add code
Feb 05, 2026
Viaarxiv icon

OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention

Add code
Feb 05, 2026
Viaarxiv icon

OmniMoE: An Efficient MoE by Orchestrating Atomic Experts at Scale

Add code
Feb 05, 2026
Viaarxiv icon

ShapeUP: Scalable Image-Conditioned 3D Editing

Add code
Feb 05, 2026
Viaarxiv icon

GLASS: A Generative Recommender for Long-sequence Modeling via SID-Tier and Semantic Search

Add code
Feb 05, 2026
Viaarxiv icon

ROMAN: Reward-Orchestrated Multi-Head Attention Network for Autonomous Driving System Testing

Add code
Feb 05, 2026
Viaarxiv icon

Feature points evaluation on omnidirectional vision with a photorealistic fisheye sequence -- A report on experiments done in 2014

Add code
Feb 05, 2026
Viaarxiv icon

Reasoning under Ambiguity: Uncertainty-Aware Multilingual Emotion Classification under Partial Supervision

Add code
Feb 05, 2026
Viaarxiv icon