Picture for Zeyi Huang

Zeyi Huang

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Add code
May 26, 2025
Viaarxiv icon

T2I-ConBench: Text-to-Image Benchmark for Continual Post-training

Add code
May 22, 2025
Viaarxiv icon

Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts

Add code
Apr 22, 2025
Viaarxiv icon

HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting

Add code
Mar 25, 2025
Viaarxiv icon

Do Vision Models Develop Human-Like Progressive Difficulty Understanding?

Add code
Mar 17, 2025
Viaarxiv icon

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Agents

Add code
Feb 25, 2025
Viaarxiv icon

Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs

Add code
Jan 08, 2025
Viaarxiv icon

Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation

Add code
Dec 03, 2024
Figure 1 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 2 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 3 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Figure 4 for Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
Viaarxiv icon

Ascend HiFloat8 Format for Deep Learning

Add code
Sep 26, 2024
Figure 1 for Ascend HiFloat8 Format for Deep Learning
Figure 2 for Ascend HiFloat8 Format for Deep Learning
Figure 3 for Ascend HiFloat8 Format for Deep Learning
Figure 4 for Ascend HiFloat8 Format for Deep Learning
Viaarxiv icon

PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion

Add code
Dec 29, 2023
Viaarxiv icon