Picture for Tao Zhang

Tao Zhang

MMhops-R1: Multimodal Multi-hop Reasoning

Add code
Dec 16, 2025
Figure 1 for MMhops-R1: Multimodal Multi-hop Reasoning
Figure 2 for MMhops-R1: Multimodal Multi-hop Reasoning
Figure 3 for MMhops-R1: Multimodal Multi-hop Reasoning
Figure 4 for MMhops-R1: Multimodal Multi-hop Reasoning
Viaarxiv icon

IF-Bench: Benchmarking and Enhancing MLLMs for Infrared Images with Generative Visual Prompting

Add code
Dec 10, 2025
Viaarxiv icon

Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making

Add code
Dec 09, 2025
Viaarxiv icon

From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs

Add code
Nov 18, 2025
Figure 1 for From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs
Figure 2 for From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs
Figure 3 for From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs
Figure 4 for From Narrow Unlearning to Emergent Misalignment: Causes, Consequences, and Containment in LLMs
Viaarxiv icon

Beyond Randomness: Understand the Order of the Noise in Diffusion

Add code
Nov 11, 2025
Figure 1 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Figure 2 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Figure 3 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Figure 4 for Beyond Randomness: Understand the Order of the Noise in Diffusion
Viaarxiv icon

Whole-Body Control With Terrain Estimation of A 6-DoF Wheeled Bipedal Robot

Add code
Nov 09, 2025
Figure 1 for Whole-Body Control With Terrain Estimation of A 6-DoF Wheeled Bipedal Robot
Figure 2 for Whole-Body Control With Terrain Estimation of A 6-DoF Wheeled Bipedal Robot
Figure 3 for Whole-Body Control With Terrain Estimation of A 6-DoF Wheeled Bipedal Robot
Figure 4 for Whole-Body Control With Terrain Estimation of A 6-DoF Wheeled Bipedal Robot
Viaarxiv icon

MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series

Add code
Oct 31, 2025
Figure 1 for MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series
Figure 2 for MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series
Figure 3 for MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series
Figure 4 for MapSAM2: Adapting SAM2 for Automatic Segmentation of Historical Map Images and Time Series
Viaarxiv icon

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Add code
Oct 23, 2025
Viaarxiv icon

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Add code
Oct 22, 2025
Viaarxiv icon

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

Add code
Oct 21, 2025
Figure 1 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 2 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 3 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Figure 4 for Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model
Viaarxiv icon