Picture for Antoni B. Chan

Antoni B. Chan

Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders

Add code
May 30, 2025
Viaarxiv icon

Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting

Add code
May 28, 2025
Viaarxiv icon

Advancing Multiple Instance Learning with Continual Learning for Whole Slide Imaging

Add code
May 15, 2025
Viaarxiv icon

Density-based Object Detection in Crowded Scenes

Add code
Apr 14, 2025
Viaarxiv icon

Embodied Crowd Counting

Add code
Mar 11, 2025
Viaarxiv icon

Grad-ECLIP: Gradient-based Visual and Textual Explanations for CLIP

Add code
Feb 26, 2025
Viaarxiv icon

Re-Attentional Controllable Video Diffusion Editing

Add code
Dec 16, 2024
Viaarxiv icon

DistinctAD: Distinctive Audio Description Generation in Contexts

Add code
Nov 27, 2024
Figure 1 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 2 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 3 for DistinctAD: Distinctive Audio Description Generation in Contexts
Figure 4 for DistinctAD: Distinctive Audio Description Generation in Contexts
Viaarxiv icon

GeneQuery: A General QA-based Framework for Spatial Gene Expression Predictions from Histology Images

Add code
Nov 27, 2024
Figure 1 for GeneQuery: A General QA-based Framework for Spatial Gene Expression Predictions from Histology Images
Figure 2 for GeneQuery: A General QA-based Framework for Spatial Gene Expression Predictions from Histology Images
Figure 3 for GeneQuery: A General QA-based Framework for Spatial Gene Expression Predictions from Histology Images
Figure 4 for GeneQuery: A General QA-based Framework for Spatial Gene Expression Predictions from Histology Images
Viaarxiv icon

Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization

Add code
Sep 03, 2024
Figure 1 for Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization
Figure 2 for Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization
Figure 3 for Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization
Figure 4 for Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization
Viaarxiv icon