Picture for Ziyuan Huang

Ziyuan Huang

Sequential Strategic Classification with Multi-Stage Selective Classifiers

Add code
May 05, 2026
Viaarxiv icon

Perceptual Flow Network for Visually Grounded Reasoning

Add code
May 04, 2026
Viaarxiv icon

TC-AE: Unlocking Token Capacity for Deep Compression Autoencoders

Add code
Apr 08, 2026
Viaarxiv icon

AgentVLN: Towards Agentic Vision-and-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

Multi-Level Strategic Classification: Incentivizing Improvement through Promotion and Relegation Dynamics

Add code
Feb 11, 2026
Viaarxiv icon

ARGenSeg: Image Segmentation with Autoregressive Image Generation Model

Add code
Oct 23, 2025
Figure 1 for ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
Figure 2 for ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
Figure 3 for ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
Figure 4 for ARGenSeg: Image Segmentation with Autoregressive Image Generation Model
Viaarxiv icon

Vision-Centric Activation and Coordination for Multimodal Large Language Models

Add code
Oct 16, 2025
Viaarxiv icon

Ming-Omni: A Unified Multimodal Model for Perception and Generation

Add code
Jun 11, 2025
Figure 1 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Figure 2 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Figure 3 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Figure 4 for Ming-Omni: A Unified Multimodal Model for Perception and Generation
Viaarxiv icon

Advancing 3D Medical Image Segmentation: Unleashing the Potential of Planarian Neural Networks in Artificial Intelligence

Add code
May 07, 2025
Viaarxiv icon

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction

Add code
May 05, 2025
Figure 1 for Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
Figure 2 for Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
Figure 3 for Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
Figure 4 for Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction
Viaarxiv icon