Picture for Cem Subakan

Cem Subakan

Investigating Faithfulness in Large Audio Language Models

Add code
Sep 26, 2025
Viaarxiv icon

FocalCodec-Stream: Streaming Low-Bitrate Speech Coding via Causal Distillation

Add code
Sep 19, 2025
Viaarxiv icon

Audio Prototypical Network For Controllable Music Recommendation

Add code
Jul 31, 2025
Viaarxiv icon

Discrete Audio Tokens: More Than a Survey!

Add code
Jun 12, 2025
Viaarxiv icon

ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs

Add code
May 26, 2025
Viaarxiv icon

LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs

Add code
May 24, 2025
Viaarxiv icon

Sample Compression for Continual Learning

Add code
Mar 13, 2025
Viaarxiv icon

ReTreever: Tree-based Coarse-to-Fine Representations for Retrieval

Add code
Feb 11, 2025
Viaarxiv icon

FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks

Add code
Feb 06, 2025
Figure 1 for FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks
Figure 2 for FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks
Figure 3 for FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks
Figure 4 for FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks
Viaarxiv icon

Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers

Add code
Jan 08, 2025
Figure 1 for Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers
Figure 2 for Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers
Figure 3 for Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers
Figure 4 for Planing It by Ear: Convolutional Neural Networks for Acoustic Anomaly Detection in Industrial Wood Planers
Viaarxiv icon