Picture for Yongkang Wong

Yongkang Wong

STAR: Skeleton-aware Text-based 4D Avatar Generation with In-Network Motion Retargeting

Add code
Jun 07, 2024
Viaarxiv icon

TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment

Add code
May 22, 2024
Figure 1 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 2 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 3 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Figure 4 for TOPA: Extend Large Language Models for Video Understanding via Text-Only Pre-Alignment
Viaarxiv icon

Bridging the Intent Gap: Knowledge-Enhanced Visual Generation

Add code
May 21, 2024
Figure 1 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Figure 2 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Figure 3 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Figure 4 for Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
Viaarxiv icon

Finetuning Text-to-Image Diffusion Models for Fairness

Add code
Nov 11, 2023
Viaarxiv icon

ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens

Add code
Sep 28, 2023
Figure 1 for ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens
Figure 2 for ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens
Figure 3 for ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens
Figure 4 for ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens
Viaarxiv icon

MCM: Multi-condition Motion Synthesis Framework for Multi-scenario

Add code
Sep 06, 2023
Figure 1 for MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Figure 2 for MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Figure 3 for MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Figure 4 for MCM: Multi-condition Motion Synthesis Framework for Multi-scenario
Viaarxiv icon

A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023

Add code
Jul 13, 2023
Figure 1 for A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Figure 2 for A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Figure 3 for A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Figure 4 for A Study on Differentiable Logic and LLMs for EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2023
Viaarxiv icon

Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection

Add code
Jul 06, 2022
Figure 1 for Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection
Figure 2 for Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection
Figure 3 for Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection
Figure 4 for Chairs Can be Stood on: Overcoming Object Bias in Human-Object Interaction Detection
Viaarxiv icon

Distance Matters in Human-Object Interaction Detection

Add code
Jul 05, 2022
Figure 1 for Distance Matters in Human-Object Interaction Detection
Figure 2 for Distance Matters in Human-Object Interaction Detection
Figure 3 for Distance Matters in Human-Object Interaction Detection
Figure 4 for Distance Matters in Human-Object Interaction Detection
Viaarxiv icon

A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA

Add code
Jun 30, 2022
Figure 1 for A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
Figure 2 for A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
Figure 3 for A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
Figure 4 for A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA
Viaarxiv icon