Picture for Jiliang Hu

Jiliang Hu

End-to-end Contrastive Language-Speech Pretraining Model For Long-form Spoken Question Answering

Add code
Nov 12, 2025
Viaarxiv icon

Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding

Add code
Jan 13, 2025
Figure 1 for Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
Figure 2 for Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
Figure 3 for Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
Figure 4 for Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
Viaarxiv icon

VHASR: A Multimodal Speech Recognition System With Vision Hotwords

Add code
Oct 01, 2024
Figure 1 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Figure 2 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Figure 3 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Figure 4 for VHASR: A Multimodal Speech Recognition System With Vision Hotwords
Viaarxiv icon