Picture for Jiajun Deng

Jiajun Deng

Described Spatial-Temporal Video Detection

Add code
Jul 08, 2024
Viaarxiv icon

Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation

Add code
Jul 08, 2024
Viaarxiv icon

Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

Add code
Jul 02, 2024
Viaarxiv icon

Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition

Add code
Jun 14, 2024
Figure 1 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 2 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 3 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Figure 4 for Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
Viaarxiv icon

One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model

Add code
Jun 14, 2024
Figure 1 for One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
Figure 2 for One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
Figure 3 for One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
Figure 4 for One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
Viaarxiv icon

Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask

Add code
Jun 14, 2024
Figure 1 for Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
Figure 2 for Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
Figure 3 for Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
Figure 4 for Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
Viaarxiv icon

HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation

Add code
Mar 18, 2024
Viaarxiv icon

Agent3D-Zero: An Agent for Zero-shot 3D Understanding

Add code
Mar 18, 2024
Figure 1 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 2 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 3 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Figure 4 for Agent3D-Zero: An Agent for Zero-shot 3D Understanding
Viaarxiv icon

PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest

Add code
Mar 14, 2024
Figure 1 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Figure 2 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Figure 3 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Figure 4 for PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest
Viaarxiv icon

DeepEraser: Deep Iterative Context Mining for Generic Text Eraser

Add code
Feb 29, 2024
Figure 1 for DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
Figure 2 for DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
Figure 3 for DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
Figure 4 for DeepEraser: Deep Iterative Context Mining for Generic Text Eraser
Viaarxiv icon