Picture for Zeyi Huang

Zeyi Huang

DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation

Add code
Dec 10, 2025
Viaarxiv icon

Multi-Joint Physics-Informed Deep Learning Framework for Time-Efficient Inverse Dynamics

Add code
Nov 14, 2025
Viaarxiv icon

MultiCrafter: High-Fidelity Multi-Subject Generation via Spatially Disentangled Attention and Identity-Aware Reinforcement Learning

Add code
Sep 26, 2025
Viaarxiv icon

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Add code
May 26, 2025
Figure 1 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 2 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 3 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Figure 4 for VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection
Viaarxiv icon

T2I-ConBench: Text-to-Image Benchmark for Continual Post-training

Add code
May 22, 2025
Viaarxiv icon

Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Add code
Apr 22, 2025
Viaarxiv icon

HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting

Add code
Mar 25, 2025
Figure 1 for HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
Figure 2 for HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
Figure 3 for HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
Figure 4 for HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
Viaarxiv icon

Do Vision Models Develop Human-Like Progressive Difficulty Understanding?

Add code
Mar 17, 2025
Viaarxiv icon

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents

Add code
Feb 25, 2025
Figure 1 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Figure 2 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Figure 3 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Figure 4 for IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents
Viaarxiv icon

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

Add code
Jan 08, 2025
Figure 1 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Figure 2 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Figure 3 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Figure 4 for Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
Viaarxiv icon