Picture for Ishaan Kumar

Ishaan Kumar

PoDAR: Power-Disentangled Audio Representation for Generative Modeling

Add code
May 11, 2026
Viaarxiv icon

UpstreamQA: A Modular Framework for Explicit Reasoning on Video Question Answering Tasks

Add code
Apr 25, 2026
Viaarxiv icon

High-Fidelity Audio Compression with Improved RVQGAN

Add code
Jun 11, 2023
Figure 1 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 2 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 3 for High-Fidelity Audio Compression with Improved RVQGAN
Figure 4 for High-Fidelity Audio Compression with Improved RVQGAN
Viaarxiv icon