Picture for Jihwan Park

Jihwan Park

Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection

Add code
Mar 26, 2024
Figure 1 for Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
Figure 2 for Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
Figure 3 for Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
Figure 4 for Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
Viaarxiv icon

Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models

Add code
Aug 18, 2023
Figure 1 for Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Figure 2 for Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Figure 3 for Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Figure 4 for Open-vocabulary Video Question Answering: A New Benchmark for Evaluating the Generalizability of Video Question Answering Models
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Add code
Apr 14, 2023
Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon

That's What I Said: Fully-Controllable Talking Face Generation

Add code
Apr 06, 2023
Figure 1 for That's What I Said: Fully-Controllable Talking Face Generation
Figure 2 for That's What I Said: Fully-Controllable Talking Face Generation
Figure 3 for That's What I Said: Fully-Controllable Talking Face Generation
Figure 4 for That's What I Said: Fully-Controllable Talking Face Generation
Viaarxiv icon

Metric Learning for User-defined Keyword Spotting

Add code
Nov 01, 2022
Figure 1 for Metric Learning for User-defined Keyword Spotting
Figure 2 for Metric Learning for User-defined Keyword Spotting
Figure 3 for Metric Learning for User-defined Keyword Spotting
Figure 4 for Metric Learning for User-defined Keyword Spotting
Viaarxiv icon

Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection

Add code
Apr 11, 2022
Figure 1 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Figure 2 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Figure 3 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Figure 4 for Consistency Learning via Decoding Path Augmentation for Transformers in Human Object Interaction Detection
Viaarxiv icon

Deformable Graph Convolutional Networks

Add code
Dec 29, 2021
Figure 1 for Deformable Graph Convolutional Networks
Figure 2 for Deformable Graph Convolutional Networks
Figure 3 for Deformable Graph Convolutional Networks
Figure 4 for Deformable Graph Convolutional Networks
Viaarxiv icon