Picture for Mengfan Dong

Mengfan Dong

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization

Add code
Nov 17, 2024
Viaarxiv icon

MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model

Add code
Aug 26, 2024
Viaarxiv icon

Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

Add code
Feb 24, 2024
Figure 1 for Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models
Figure 2 for Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models
Figure 3 for Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models
Figure 4 for Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models
Viaarxiv icon

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

Add code
Dec 13, 2023
Figure 1 for Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
Figure 2 for Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
Figure 3 for Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
Figure 4 for Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
Viaarxiv icon