Picture for Hao Yang

Hao Yang

Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement

Add code
Jan 21, 2024
Figure 1 for Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
Figure 2 for Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
Figure 3 for Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
Figure 4 for Enhancing the vision-language foundation model with key semantic knowledge-emphasized report refinement
Viaarxiv icon

Using Large Language Model for End-to-End Chinese ASR and NER

Add code
Jan 21, 2024
Viaarxiv icon

Deep Ensemble Shape Calibration: Multi-Field Post-hoc Calibration in Online Advertising

Add code
Jan 17, 2024
Viaarxiv icon

A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model

Add code
Jan 17, 2024
Figure 1 for A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model
Figure 2 for A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model
Figure 3 for A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model
Figure 4 for A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model
Viaarxiv icon

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

Add code
Jan 11, 2024
Viaarxiv icon

R-BI: Regularized Batched Inputs enhance Incremental Decoding Framework for Low-Latency Simultaneous Speech Translation

Add code
Jan 11, 2024
Viaarxiv icon

Can AI Write Classical Chinese Poetry like Humans? An Empirical Study Inspired by Turing Test

Add code
Jan 10, 2024
Viaarxiv icon

Generalizable vision-language pre-training for annotation-free pathology localization

Add code
Jan 04, 2024
Viaarxiv icon

MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning

Add code
Jan 03, 2024
Figure 1 for MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning
Figure 2 for MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning
Figure 3 for MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning
Figure 4 for MLIP: Medical Language-Image Pre-training with Masked Local Representation Learning
Viaarxiv icon

Multimodal self-supervised learning for lesion localization

Add code
Jan 03, 2024
Figure 1 for Multimodal self-supervised learning for lesion localization
Figure 2 for Multimodal self-supervised learning for lesion localization
Figure 3 for Multimodal self-supervised learning for lesion localization
Figure 4 for Multimodal self-supervised learning for lesion localization
Viaarxiv icon