Picture for Yong Man Ro

Yong Man Ro

TroL: Traversal of Layers for Large Language and Vision Models

Add code
Jun 18, 2024
Viaarxiv icon

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation

Add code
Jun 12, 2024
Viaarxiv icon

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

Add code
Jun 04, 2024
Figure 1 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Figure 2 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Figure 3 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Figure 4 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Viaarxiv icon

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Add code
May 27, 2024
Viaarxiv icon

Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank

Add code
Apr 30, 2024
Figure 1 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Figure 2 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Figure 3 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Figure 4 for Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank
Viaarxiv icon

MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection

Add code
Mar 22, 2024
Viaarxiv icon

What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models

Add code
Mar 20, 2024
Figure 1 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Figure 2 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Figure 3 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Figure 4 for What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models
Viaarxiv icon

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Add code
Mar 12, 2024
Figure 1 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Figure 2 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Figure 3 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Figure 4 for MoAI: Mixture of All Intelligence for Large Language and Vision Models
Viaarxiv icon

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

Add code
Mar 07, 2024
Figure 1 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 2 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 3 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 4 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Viaarxiv icon

Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection

Add code
Mar 02, 2024
Figure 1 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Figure 2 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Figure 3 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Figure 4 for Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection
Viaarxiv icon