Picture for Son Tran

Son Tran

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Add code
Jul 18, 2024
Viaarxiv icon

Open Vocabulary Multi-Label Video Classification

Add code
Jul 12, 2024
Viaarxiv icon

VidLA: Video-Language Alignment at Scale

Add code
Mar 21, 2024
Figure 1 for VidLA: Video-Language Alignment at Scale
Figure 2 for VidLA: Video-Language Alignment at Scale
Figure 3 for VidLA: Video-Language Alignment at Scale
Figure 4 for VidLA: Video-Language Alignment at Scale
Viaarxiv icon

UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance

Add code
Sep 03, 2023
Figure 1 for UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance
Figure 2 for UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance
Figure 3 for UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance
Figure 4 for UnsMOT: Unified Framework for Unsupervised Multi-Object Tracking with Geometric Topology Guidance
Viaarxiv icon

SurveyLM: A platform to explore emerging value perspectives in augmented language models' behaviors

Add code
Aug 01, 2023
Figure 1 for SurveyLM: A platform to explore emerging value perspectives in augmented language models' behaviors
Viaarxiv icon

Vision-Language Pre-Training with Triple Contrastive Learning

Add code
Mar 28, 2022
Figure 1 for Vision-Language Pre-Training with Triple Contrastive Learning
Figure 2 for Vision-Language Pre-Training with Triple Contrastive Learning
Figure 3 for Vision-Language Pre-Training with Triple Contrastive Learning
Figure 4 for Vision-Language Pre-Training with Triple Contrastive Learning
Viaarxiv icon

Multi-modal Alignment using Representation Codebook

Add code
Mar 28, 2022
Figure 1 for Multi-modal Alignment using Representation Codebook
Figure 2 for Multi-modal Alignment using Representation Codebook
Figure 3 for Multi-modal Alignment using Representation Codebook
Figure 4 for Multi-modal Alignment using Representation Codebook
Viaarxiv icon

Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection

Add code
Dec 19, 2021
Figure 1 for Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection
Figure 2 for Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection
Figure 3 for Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection
Figure 4 for Parallel Multi-Scale Networks with Deep Supervision for Hand Keypoint Detection
Viaarxiv icon

SLADE: A Self-Training Framework For Distance Metric Learning

Add code
Nov 20, 2020
Figure 1 for SLADE: A Self-Training Framework For Distance Metric Learning
Figure 2 for SLADE: A Self-Training Framework For Distance Metric Learning
Figure 3 for SLADE: A Self-Training Framework For Distance Metric Learning
Figure 4 for SLADE: A Self-Training Framework For Distance Metric Learning
Viaarxiv icon

CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation

Add code
Sep 24, 2020
Figure 1 for CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation
Figure 2 for CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation
Figure 3 for CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation
Figure 4 for CogniFNN: A Fuzzy Neural Network Framework for Cognitive Word Embedding Evaluation
Viaarxiv icon