Text


FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning

Add code
Jan 16, 2026
Viaarxiv icon

Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning

Add code
Jan 16, 2026
Viaarxiv icon

Membership Inference on LLMs in the Wild

Add code
Jan 16, 2026
Viaarxiv icon

Scalable Music Cover Retrieval Using Lyrics-Aligned Audio Embeddings

Add code
Jan 16, 2026
Viaarxiv icon

VidLeaks: Membership Inference Attacks Against Text-to-Video Models

Add code
Jan 16, 2026
Viaarxiv icon

The Growing Gains and Pains of Iterative Web Corpora Crawling: Insights from South Slavic CLASSLA-web 2.0 Corpora

Add code
Jan 16, 2026
Viaarxiv icon

CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

Add code
Jan 16, 2026
Viaarxiv icon

Generation of Chest CT pulmonary Nodule Images by Latent Diffusion Models using the LIDC-IDRI Dataset

Add code
Jan 16, 2026
Viaarxiv icon

Steering Language Models Before They Speak: Logit-Level Interventions

Add code
Jan 16, 2026
Viaarxiv icon

AviationLMM: A Large Multimodal Foundation Model for Civil Aviation

Add code
Jan 16, 2026
Viaarxiv icon