Picture for Bohan Yu

Bohan Yu

TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured Table Question Answering

Add code
Jun 04, 2025
Viaarxiv icon

Deep Speech Synthesis from Multimodal Articulatory Representations

Add code
Dec 17, 2024
Figure 1 for Deep Speech Synthesis from Multimodal Articulatory Representations
Figure 2 for Deep Speech Synthesis from Multimodal Articulatory Representations
Figure 3 for Deep Speech Synthesis from Multimodal Articulatory Representations
Figure 4 for Deep Speech Synthesis from Multimodal Articulatory Representations
Viaarxiv icon

Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP

Add code
Sep 04, 2024
Figure 1 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Figure 2 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Figure 3 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Figure 4 for Fast, High-Quality and Parameter-Efficient Articulatory Synthesis using Differentiable DSP
Viaarxiv icon

Towards EMG-to-Speech with a Necklace Form Factor

Add code
Jul 31, 2024
Figure 1 for Towards EMG-to-Speech with a Necklace Form Factor
Figure 2 for Towards EMG-to-Speech with a Necklace Form Factor
Figure 3 for Towards EMG-to-Speech with a Necklace Form Factor
Figure 4 for Towards EMG-to-Speech with a Necklace Form Factor
Viaarxiv icon

E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors

Add code
Jul 11, 2024
Viaarxiv icon

Multimodal Segmentation for Vocal Tract Modeling

Add code
Jun 22, 2024
Figure 1 for Multimodal Segmentation for Vocal Tract Modeling
Figure 2 for Multimodal Segmentation for Vocal Tract Modeling
Figure 3 for Multimodal Segmentation for Vocal Tract Modeling
Figure 4 for Multimodal Segmentation for Vocal Tract Modeling
Viaarxiv icon

Towards Streaming Speech-to-Avatar Synthesis

Add code
Oct 25, 2023
Figure 1 for Towards Streaming Speech-to-Avatar Synthesis
Figure 2 for Towards Streaming Speech-to-Avatar Synthesis
Figure 3 for Towards Streaming Speech-to-Avatar Synthesis
Figure 4 for Towards Streaming Speech-to-Avatar Synthesis
Viaarxiv icon

Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities

Add code
Oct 04, 2023
Figure 1 for Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities
Figure 2 for Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities
Figure 3 for Towards an Interpretable Representation of Speaker Identity via Perceptual Voice Qualities
Viaarxiv icon

Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe

Add code
Sep 12, 2022
Figure 1 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 2 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 3 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Figure 4 for Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe
Viaarxiv icon

Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs

Add code
Feb 10, 2021
Figure 1 for Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs
Figure 2 for Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs
Figure 3 for Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs
Figure 4 for Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs
Viaarxiv icon