Picture for Kyogu Lee

Kyogu Lee

Variable Bitrate Residual Vector Quantization for Audio Coding

Add code
Oct 08, 2024
Viaarxiv icon

Hear Your Face: Face-based voice conversion with F0 estimation

Add code
Aug 19, 2024
Figure 1 for Hear Your Face: Face-based voice conversion with F0 estimation
Figure 2 for Hear Your Face: Face-based voice conversion with F0 estimation
Figure 3 for Hear Your Face: Face-based voice conversion with F0 estimation
Figure 4 for Hear Your Face: Face-based voice conversion with F0 estimation
Viaarxiv icon

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch

Add code
Aug 06, 2024
Figure 1 for GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
Figure 2 for GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
Figure 3 for GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
Viaarxiv icon

Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings

Add code
Jul 29, 2024
Figure 1 for Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings
Figure 2 for Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings
Figure 3 for Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings
Figure 4 for Practical and Reproducible Symbolic Music Generation by Large Language Models with Structural Embeddings
Viaarxiv icon

Wavespace: A Highly Explorable Wavetable Generator

Add code
Jul 29, 2024
Figure 1 for Wavespace: A Highly Explorable Wavetable Generator
Figure 2 for Wavespace: A Highly Explorable Wavetable Generator
Figure 3 for Wavespace: A Highly Explorable Wavetable Generator
Figure 4 for Wavespace: A Highly Explorable Wavetable Generator
Viaarxiv icon

Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation

Add code
Jul 07, 2024
Figure 1 for Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
Figure 2 for Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
Figure 3 for Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
Figure 4 for Differentiable Modal Synthesis for Physical Modeling of Planar String Sound and Motion Simulation
Viaarxiv icon

Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation

Add code
Jun 12, 2024
Figure 1 for Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation
Figure 2 for Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation
Figure 3 for Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation
Figure 4 for Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation
Viaarxiv icon

Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation

Add code
May 01, 2024
Figure 1 for Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation
Figure 2 for Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation
Figure 3 for Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation
Figure 4 for Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation
Viaarxiv icon

Multidimensional Interpolants

Add code
Apr 22, 2024
Figure 1 for Multidimensional Interpolants
Figure 2 for Multidimensional Interpolants
Figure 3 for Multidimensional Interpolants
Figure 4 for Multidimensional Interpolants
Viaarxiv icon

Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling

Add code
Apr 01, 2024
Viaarxiv icon