Picture for Ngan Le

Ngan Le

Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction

Add code
Jun 28, 2024
Figure 1 for Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction
Figure 2 for Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction
Figure 3 for Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction
Figure 4 for Multimodal Learning and Cognitive Processes in Radiology: MedGaze for Chest X-ray Scanpath Prediction
Viaarxiv icon

Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction

Add code
Jun 28, 2024
Figure 1 for Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction
Figure 2 for Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction
Figure 3 for Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction
Figure 4 for Enhancing Radiological Diagnosis: A Collaborative Approach Integrating AI and Human Expertise for Visual Miss Correction
Viaarxiv icon

CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark

Add code
Jun 05, 2024
Figure 1 for CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark
Figure 2 for CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark
Figure 3 for CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark
Figure 4 for CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark
Viaarxiv icon

HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model

Add code
Jun 01, 2024
Viaarxiv icon

Accelerating Transformers with Spectrum-Preserving Token Merging

Add code
May 25, 2024
Figure 1 for Accelerating Transformers with Spectrum-Preserving Token Merging
Figure 2 for Accelerating Transformers with Spectrum-Preserving Token Merging
Figure 3 for Accelerating Transformers with Spectrum-Preserving Token Merging
Figure 4 for Accelerating Transformers with Spectrum-Preserving Token Merging
Viaarxiv icon

S3Former: Self-supervised High-resolution Transformer for Solar PV Profiling

Add code
May 07, 2024
Viaarxiv icon

CarcassFormer: An End-to-end Transformer-based Framework for Simultaneous Localization, Segmentation and Classification of Poultry Carcass Defect

Add code
Apr 17, 2024
Viaarxiv icon

Unifying Global and Local Scene Entities Modelling for Precise Action Spotting

Add code
Apr 15, 2024
Figure 1 for Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Figure 2 for Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Figure 3 for Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Figure 4 for Unifying Global and Local Scene Entities Modelling for Precise Action Spotting
Viaarxiv icon

ShapeFormer: Shape Prior Visible-to-Amodal Transformer-based Amodal Instance Segmentation

Add code
Mar 22, 2024
Viaarxiv icon

WAVER: Writing-style Agnostic Video Retrieval via Distilling Vision-Language Models Through Open-Vocabulary Knowledge

Add code
Dec 27, 2023
Viaarxiv icon