Picture for Zhiqi Huang

Zhiqi Huang

TCDA: Thread-Constrained Discourse-Aware Modeling for Conversational Sentiment Quadruple Analysis

Add code
May 03, 2026
Viaarxiv icon

Learning to Route Queries to Heads for Attention-based Re-ranking with Large Language Models

Add code
Apr 27, 2026
Viaarxiv icon

Kimi K2.5: Visual Agentic Intelligence

Add code
Feb 02, 2026
Viaarxiv icon

WorldVQA: Measuring Atomic World Knowledge in Multimodal Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon

Towards Pixel-Level VLM Perception via Simple Points Prediction

Add code
Jan 27, 2026
Viaarxiv icon

BabyVision: Visual Reasoning Beyond Language

Add code
Jan 10, 2026
Viaarxiv icon

MMErroR: A Benchmark for Erroneous Reasoning in Vision-Language Models

Add code
Jan 06, 2026
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Add code
May 19, 2025
Figure 1 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 2 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 3 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 4 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Viaarxiv icon

Kimi-VL Technical Report

Add code
Apr 10, 2025
Figure 1 for Kimi-VL Technical Report
Figure 2 for Kimi-VL Technical Report
Figure 3 for Kimi-VL Technical Report
Figure 4 for Kimi-VL Technical Report
Viaarxiv icon