Picture for Cong-Duy Nguyen

Cong-Duy Nguyen

Meta-optimized Angular Margin Contrastive Framework for Video-Language Representation Learning

Add code
Jul 04, 2024
Viaarxiv icon

Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives

Add code
Jun 09, 2024
Figure 1 for Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
Figure 2 for Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
Figure 3 for Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
Figure 4 for Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
Viaarxiv icon

KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning

Add code
Mar 26, 2024
Figure 1 for KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning
Figure 2 for KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning
Figure 3 for KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning
Figure 4 for KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning
Viaarxiv icon

On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

Add code
Feb 01, 2024
Figure 1 for On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling
Figure 2 for On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling
Figure 3 for On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling
Figure 4 for On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling
Viaarxiv icon

READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling

Add code
Dec 12, 2023
Figure 1 for READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling
Figure 2 for READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling
Figure 3 for READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling
Figure 4 for READ-PVLA: Recurrent Adapter with Partial Video-Language Alignment for Parameter-Efficient Transfer Learning in Low-Resource Video-Language Modeling
Viaarxiv icon

DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding

Add code
Dec 05, 2023
Figure 1 for DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
Figure 2 for DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
Figure 3 for DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
Figure 4 for DemaFormer: Damped Exponential Moving Average Transformer with Energy-Based Modeling for Temporal Language Grounding
Viaarxiv icon

Improving Multimodal Sentiment Analysis: Supervised Angular Margin-based Contrastive Learning for Enhanced Fusion Representation

Add code
Dec 04, 2023
Figure 1 for Improving Multimodal Sentiment Analysis: Supervised Angular Margin-based Contrastive Learning for Enhanced Fusion Representation
Figure 2 for Improving Multimodal Sentiment Analysis: Supervised Angular Margin-based Contrastive Learning for Enhanced Fusion Representation
Figure 3 for Improving Multimodal Sentiment Analysis: Supervised Angular Margin-based Contrastive Learning for Enhanced Fusion Representation
Figure 4 for Improving Multimodal Sentiment Analysis: Supervised Angular Margin-based Contrastive Learning for Enhanced Fusion Representation
Viaarxiv icon

Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment

Add code
Dec 04, 2023
Figure 1 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Figure 2 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Figure 3 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Figure 4 for Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment
Viaarxiv icon

Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction

Add code
May 25, 2023
Figure 1 for Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction
Figure 2 for Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction
Figure 3 for Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction
Figure 4 for Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction
Viaarxiv icon

Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions

Add code
Nov 07, 2022
Figure 1 for Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions
Figure 2 for Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions
Figure 3 for Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions
Figure 4 for Adaptive Contrastive Learning on Multimodal Transformer for Review Helpfulness Predictions
Viaarxiv icon