Picture for Hongxiang Li

Hongxiang Li

Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning

Add code
May 31, 2024
Viaarxiv icon

Textual Inversion and Self-supervised Refinement for Radiology Report Generation

Add code
May 31, 2024
Viaarxiv icon

Uncertainty-aware sign language video retrieval with probability distribution modeling

Add code
May 30, 2024
Figure 1 for Uncertainty-aware sign language video retrieval with probability distribution modeling
Figure 2 for Uncertainty-aware sign language video retrieval with probability distribution modeling
Figure 3 for Uncertainty-aware sign language video retrieval with probability distribution modeling
Figure 4 for Uncertainty-aware sign language video retrieval with probability distribution modeling
Viaarxiv icon

Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation

Add code
Apr 03, 2024
Figure 1 for Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Figure 2 for Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Figure 3 for Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Figure 4 for Cross-Modal Conditioned Reconstruction for Language-guided Medical Image Segmentation
Viaarxiv icon

Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction

Add code
Jan 25, 2024
Viaarxiv icon

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

Add code
Nov 19, 2023
Viaarxiv icon

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

Add code
Aug 18, 2023
Figure 1 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Figure 2 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Figure 3 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Figure 4 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Viaarxiv icon

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

Add code
Apr 05, 2023
Figure 1 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 2 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 3 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 4 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Viaarxiv icon

SSVMR: Saliency-based Self-training for Video-Music Retrieval

Add code
Feb 18, 2023
Figure 1 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Figure 2 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Figure 3 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Figure 4 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Viaarxiv icon

Generating Templated Caption for Video Grounding

Add code
Jan 15, 2023
Figure 1 for Generating Templated Caption for Video Grounding
Figure 2 for Generating Templated Caption for Video Grounding
Figure 3 for Generating Templated Caption for Video Grounding
Figure 4 for Generating Templated Caption for Video Grounding
Viaarxiv icon