Picture for Yifan Peng

Yifan Peng

Ret.

Multi-Convformer: Extending Conformer with Multiple Convolution Kernels

Add code
Jul 04, 2024
Viaarxiv icon

Towards Robust Speech Representation Learning for Thousands of Languages

Add code
Jul 02, 2024
Viaarxiv icon

Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss

Add code
Jun 23, 2024
Viaarxiv icon

On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models

Add code
Jun 13, 2024
Viaarxiv icon

4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders

Add code
Jun 05, 2024
Viaarxiv icon

Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation

Add code
May 22, 2024
Viaarxiv icon

Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling

Add code
May 14, 2024
Figure 1 for Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
Figure 2 for Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
Figure 3 for Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
Figure 4 for Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling
Viaarxiv icon

Point Resampling and Ray Transformation Aid to Editable NeRF Models

Add code
May 12, 2024
Figure 1 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 2 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 3 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Figure 4 for Point Resampling and Ray Transformation Aid to Editable NeRF Models
Viaarxiv icon

Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory

Add code
May 07, 2024
Figure 1 for Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
Figure 2 for Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
Figure 3 for Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
Figure 4 for Characterizing the Dilemma of Performance and Index Size in Billion-Scale Vector Search and Breaking It with Second-Tier Memory
Viaarxiv icon

A Literature Review and Framework for Human Evaluation of Generative Large Language Models in Healthcare

Add code
May 04, 2024
Viaarxiv icon