Picture for Jonghwan Mun

Jonghwan Mun

General Item Representation Learning for Cold-start Content Recommendations

Apr 22, 2024
Figure 1 for General Item Representation Learning for Cold-start Content Recommendations
Figure 2 for General Item Representation Learning for Cold-start Content Recommendations
Figure 3 for General Item Representation Learning for Cold-start Content Recommendations
Figure 4 for General Item Representation Learning for Cold-start Content Recommendations
Viaarxiv icon

Honeybee: Locality-enhanced Projector for Multimodal LLM

Add code
Dec 11, 2023
Figure 1 for Honeybee: Locality-enhanced Projector for Multimodal LLM
Figure 2 for Honeybee: Locality-enhanced Projector for Multimodal LLM
Figure 3 for Honeybee: Locality-enhanced Projector for Multimodal LLM
Figure 4 for Honeybee: Locality-enhanced Projector for Multimodal LLM
Viaarxiv icon

Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection

Add code
Dec 04, 2023
Figure 1 for Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection
Figure 2 for Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection
Figure 3 for Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection
Figure 4 for Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection
Viaarxiv icon

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

Add code
Sep 11, 2023
Figure 1 for NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Figure 2 for NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Figure 3 for NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Figure 4 for NICE: CVPR 2023 Challenge on Zero-shot Image Captioning
Viaarxiv icon

Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning

Add code
Dec 27, 2022
Figure 1 for Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning
Figure 2 for Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning
Figure 3 for Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning
Figure 4 for Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning
Viaarxiv icon

Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs

Add code
Dec 01, 2022
Figure 1 for Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs
Figure 2 for Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs
Figure 3 for Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs
Figure 4 for Learning to Generate Text-grounded Mask for Open-world Semantic Segmentation from Only Image-Text Pairs
Viaarxiv icon

MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection

Add code
Mar 28, 2022
Figure 1 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Figure 2 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Figure 3 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Figure 4 for MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
Viaarxiv icon

Boundary-aware Self-supervised Learning for Video Scene Segmentation

Add code
Jan 14, 2022
Figure 1 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 2 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 3 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Figure 4 for Boundary-aware Self-supervised Learning for Video Scene Segmentation
Viaarxiv icon

Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts

Add code
Oct 13, 2021
Figure 1 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Figure 2 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Figure 3 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Figure 4 for Winning the ICCV'2021 VALUE Challenge: Task-aware Ensemble and Transfer Learning with Visual Concepts
Viaarxiv icon

Local-Global Video-Text Interactions for Temporal Grounding

Add code
Apr 16, 2020
Figure 1 for Local-Global Video-Text Interactions for Temporal Grounding
Figure 2 for Local-Global Video-Text Interactions for Temporal Grounding
Figure 3 for Local-Global Video-Text Interactions for Temporal Grounding
Figure 4 for Local-Global Video-Text Interactions for Temporal Grounding
Viaarxiv icon