Picture for Jizhou Huang

Jizhou Huang

MapAgent: An Industrial-Grade Agentic Framework for City-scale Lane-level Map Generation

Add code
Jun 03, 2026
Viaarxiv icon

VISOR: Agentic Visual Retrieval-Augmented Generation via Iterative Search and Over-horizon Reasoning

Add code
Apr 10, 2026
Viaarxiv icon

Beyond End-to-End Video Models: An LLM-Based Multi-Agent System for Educational Video Generation

Add code
Feb 12, 2026
Viaarxiv icon

Video-MSR: Benchmarking Multi-hop Spatial Reasoning Capabilities of MLLMs

Add code
Jan 14, 2026
Viaarxiv icon

Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models

Add code
Jan 08, 2026
Viaarxiv icon

Decide Then Retrieve: A Training-Free Framework with Uncertainty-Guided Triggering and Dual-Path Retrieval

Add code
Jan 07, 2026
Viaarxiv icon

BBox DocVQA: A Large Scale Bounding Box Grounded Dataset for Enhancing Reasoning in Document Visual Question Answer

Add code
Nov 19, 2025
Viaarxiv icon

Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis

Add code
Nov 13, 2025
Figure 1 for Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis
Figure 2 for Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis
Figure 3 for Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis
Figure 4 for Facial-R1: Aligning Reasoning and Recognition for Facial Emotion Analysis
Viaarxiv icon

Personalized Prediction By Learning Halfspace Reference Classes Under Well-Behaved Distribution

Add code
Sep 19, 2025
Figure 1 for Personalized Prediction By Learning Halfspace Reference Classes Under Well-Behaved Distribution
Figure 2 for Personalized Prediction By Learning Halfspace Reference Classes Under Well-Behaved Distribution
Figure 3 for Personalized Prediction By Learning Halfspace Reference Classes Under Well-Behaved Distribution
Viaarxiv icon

Cross-LoRA: A Data-Free LoRA Transfer Framework across Heterogeneous LLMs

Add code
Aug 07, 2025
Viaarxiv icon