Picture for Xinyu Zhou

Xinyu Zhou

Seeing is Believing: Rich-Context Hallucination Detection for MLLMs via Backward Visual Grounding

Add code
Nov 15, 2025
Viaarxiv icon

Text-based Aerial-Ground Person Retrieval

Add code
Nov 11, 2025
Viaarxiv icon

Kimi Linear: An Expressive, Efficient Attention Architecture

Add code
Oct 30, 2025
Viaarxiv icon

FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation

Add code
Oct 23, 2025
Figure 1 for FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
Figure 2 for FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
Figure 3 for FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
Figure 4 for FieldGen: From Teleoperated Pre-Manipulation Trajectories to Field-Guided Data Generation
Viaarxiv icon

Marine Chlorophyll Prediction and Driver Analysis based on LSTM-RF Hybrid Models

Add code
Aug 07, 2025
Viaarxiv icon

Kimi K2: Open Agentic Intelligence

Add code
Jul 28, 2025
Figure 1 for Kimi K2: Open Agentic Intelligence
Figure 2 for Kimi K2: Open Agentic Intelligence
Figure 3 for Kimi K2: Open Agentic Intelligence
Figure 4 for Kimi K2: Open Agentic Intelligence
Viaarxiv icon

Private Model Personalization Revisited

Add code
Jun 24, 2025
Viaarxiv icon

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Add code
May 29, 2025
Viaarxiv icon

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Add code
May 19, 2025
Figure 1 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 2 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 3 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Figure 4 for G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
Viaarxiv icon

Private Transformer Inference in MLaaS: A Survey

Add code
May 15, 2025
Viaarxiv icon