Picture for Jiaqi Song

Jiaqi Song

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

Add code
Apr 20, 2026
Viaarxiv icon

ARGen: Affect-Reinforced Generative Augmentation towards Vision-based Dynamic Emotion Perception

Add code
Apr 14, 2026
Viaarxiv icon

Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs

Add code
Apr 09, 2026
Viaarxiv icon

FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model

Add code
Oct 03, 2024
Figure 1 for FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Figure 2 for FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Figure 3 for FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Figure 4 for FastAdaSP: Multitask-Adapted Efficient Inference for Large Speech Language Model
Viaarxiv icon

SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data

Add code
Aug 01, 2024
Viaarxiv icon

Temporal compressive edge imaging enabled by a lensless diffuser camera

Add code
Sep 13, 2023
Viaarxiv icon

Dual-mode adaptive-SVD ghost imaging

Add code
Feb 14, 2023
Figure 1 for Dual-mode adaptive-SVD ghost imaging
Figure 2 for Dual-mode adaptive-SVD ghost imaging
Figure 3 for Dual-mode adaptive-SVD ghost imaging
Figure 4 for Dual-mode adaptive-SVD ghost imaging
Viaarxiv icon