Picture for Jiayi Kuang

Jiayi Kuang

TangramPuzzle: Evaluating Multimodal Large Language Models with Compositional Spatial Reasoning

Add code
Jan 23, 2026
Viaarxiv icon

EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration

Add code
Jan 23, 2026
Viaarxiv icon

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Add code
Dec 31, 2025
Viaarxiv icon

Refine Knowledge of Large Language Models via Adaptive Contrastive Learning

Add code
Feb 11, 2025
Viaarxiv icon

Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey

Add code
Nov 26, 2024
Figure 1 for Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
Figure 2 for Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
Figure 3 for Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
Figure 4 for Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
Viaarxiv icon

FLEX-CLIP: Feature-Level GEneration Network Enhanced CLIP for X-shot Cross-modal Retrieval

Add code
Nov 26, 2024
Viaarxiv icon

Dynamic Demonstration Retrieval and Cognitive Understanding for Emotional Support Conversation

Add code
Apr 03, 2024
Viaarxiv icon