Picture for Kaixin Li

Kaixin Li

CrownGen: Patient-customized Crown Generation via Point Diffusion Model

Add code
Dec 26, 2025
Viaarxiv icon

MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique

Add code
Nov 12, 2025
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Viaarxiv icon

MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models

Add code
Oct 31, 2025
Viaarxiv icon

AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness

Add code
Jul 02, 2025
Viaarxiv icon

Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning

Add code
May 18, 2025
Figure 1 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 2 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 3 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Figure 4 for Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement Learning
Viaarxiv icon

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning

Add code
May 18, 2025
Viaarxiv icon

MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation

Add code
Feb 18, 2025
Viaarxiv icon

Robi Butler: Remote Multimodal Interactions with Household Robot Assistant

Add code
Sep 30, 2024
Viaarxiv icon

Not All Samples Should Be Utilized Equally: Towards Understanding and Improving Dataset Distillation

Add code
Aug 22, 2024
Viaarxiv icon