Picture for Jiajun Deng

Jiajun Deng

MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition

Add code
May 30, 2025
Viaarxiv icon

SpatialSplat: Efficient Semantic 3D from Sparse Unposed Images

Add code
May 29, 2025
Viaarxiv icon

On-the-fly Routing for Zero-shot MoE Speaker Adaptation of Speech Foundation Models for Dysarthric Speech Recognition

Add code
May 28, 2025
Viaarxiv icon

Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots

Add code
May 26, 2025
Viaarxiv icon

Self-Classification Enhancement and Correction for Weakly Supervised Object Detection

Add code
May 22, 2025
Viaarxiv icon

CrossMuSim: A Cross-Modal Framework for Music Similarity Retrieval with LLM-Powered Text Description Sourcing and Mining

Add code
Mar 29, 2025
Viaarxiv icon

GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions

Add code
Mar 20, 2025
Viaarxiv icon

Efficient Adapter Tuning for Joint Singing Voice Beat and Downbeat Tracking with Self-supervised Learning Features

Add code
Mar 13, 2025
Viaarxiv icon

S3R-GS: Streamlining the Pipeline for Large-Scale Street Scene Reconstruction

Add code
Mar 11, 2025
Viaarxiv icon

Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition

Add code
Jan 08, 2025
Figure 1 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Figure 2 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Figure 3 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Figure 4 for Phone-purity Guided Discrete Tokens for Dysarthric Speech Recognition
Viaarxiv icon