Alert button
Picture for Yong Man Ro

Yong Man Ro

Alert button

MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection

Add code
Bookmark button
Alert button
Mar 22, 2024
Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro

Viaarxiv icon

What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models

Add code
Bookmark button
Alert button
Mar 20, 2024
Junho Kim, Yeon Ju Kim, Yong Man Ro

Figure 1 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Figure 2 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Figure 3 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Figure 4 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Viaarxiv icon

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Add code
Bookmark button
Alert button
Mar 12, 2024
Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro

Figure 1 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Figure 2 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Figure 3 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Figure 4 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Viaarxiv icon

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

Add code
Bookmark button
Alert button
Mar 07, 2024
Seunghee Han, Se Jin Park, Chae Won Kim, Yong Man Ro

Figure 1 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 2 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 3 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 4 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Viaarxiv icon

Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection

Add code
Bookmark button
Alert button
Mar 02, 2024
Taeheon Kim, Sebin Shin, Youngjoon Yu, Hak Gu Kim, Yong Man Ro

Figure 1 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Figure 2 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Figure 3 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Figure 4 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Viaarxiv icon

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Add code
Bookmark button
Alert button
Feb 25, 2024
Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro

Viaarxiv icon

Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing

Add code
Bookmark button
Alert button
Feb 23, 2024
Jeong Hun Yeo, Seunghee Han, Minsu Kim, Yong Man Ro

Viaarxiv icon

CoLLaVO: Crayon Large Language and Vision mOdel

Add code
Bookmark button
Alert button
Feb 20, 2024
Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro

Viaarxiv icon

Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units

Add code
Bookmark button
Alert button
Jan 18, 2024
Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro

Viaarxiv icon

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Add code
Bookmark button
Alert button
Dec 05, 2023
Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro

Viaarxiv icon