Text


A Variational Framework for Improving Naturalness in Generative Spoken Language Models

Add code
Jun 17, 2025
Viaarxiv icon

GenerationPrograms: Fine-grained Attribution with Executable Programs

Add code
Jun 17, 2025
Viaarxiv icon

Hyper-Local Deformable Transformers for Text Spotting on Historical Maps

Add code
Jun 17, 2025
Viaarxiv icon

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Add code
Jun 16, 2025
Viaarxiv icon

Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention

Add code
Jun 16, 2025
Viaarxiv icon

Anomaly Object Segmentation with Vision-Language Models for Steel Scrap Recycling

Add code
Jun 16, 2025
Viaarxiv icon

Fatigue-Aware Adaptive Interfaces for Wearable Devices Using Deep Learning

Add code
Jun 16, 2025
Viaarxiv icon

Equitable Electronic Health Record Prediction with FAME: Fairness-Aware Multimodal Embedding

Add code
Jun 16, 2025
Viaarxiv icon

Learning Event Completeness for Weakly Supervised Video Anomaly Detection

Add code
Jun 16, 2025
Viaarxiv icon

Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective

Add code
Jun 16, 2025
Viaarxiv icon