Picture for Yossi Adi

Yossi Adi

Sid

Discrete Audio Tokens: More Than a Survey!

Add code
Jun 12, 2025
Viaarxiv icon

Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation

Add code
Jun 11, 2025
Viaarxiv icon

StressTest: Can YOUR Speech LM Handle the Stress?

Add code
May 28, 2025
Viaarxiv icon

WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

Add code
May 25, 2025
Viaarxiv icon

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

Add code
May 23, 2025
Viaarxiv icon

PAST: Phonetic-Acoustic Speech Tokenizer

Add code
May 20, 2025
Viaarxiv icon

CAFA: a Controllable Automatic Foley Artist

Add code
Apr 15, 2025
Viaarxiv icon

On The Landscape of Spoken Language Models: A Comprehensive Survey

Add code
Apr 11, 2025
Viaarxiv icon

Controllable Automatic Foley Artist

Add code
Apr 09, 2025
Viaarxiv icon

Scaling Analysis of Interleaved Speech-Text Language Models

Add code
Apr 03, 2025
Viaarxiv icon