Picture for Yuyin Zhou

Yuyin Zhou

Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation

Add code
Mar 17, 2026
Viaarxiv icon

MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild

Add code
Mar 17, 2026
Viaarxiv icon

VecGlypher: Unified Vector Glyph Generation with Language Models

Add code
Feb 25, 2026
Viaarxiv icon

EntropyPrune: Matrix Entropy Guided Visual Token Pruning for Multimodal Large Language Models

Add code
Feb 19, 2026
Viaarxiv icon

What if Agents Could Imagine? Reinforcing Open-Vocabulary HOI Comprehension through Generation

Add code
Feb 12, 2026
Viaarxiv icon

Controllable Layered Image Generation for Real-World Editing

Add code
Jan 21, 2026
Viaarxiv icon

OpenVision 3: A Family of Unified Visual Encoder for Both Understanding and Generation

Add code
Jan 21, 2026
Viaarxiv icon

All You Need Are Random Visual Tokens? Demystifying Token Pruning in VLLMs

Add code
Dec 08, 2025
Viaarxiv icon

MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

Add code
Oct 29, 2025
Viaarxiv icon

GauSSmart: Enhanced 3D Reconstruction through 2D Foundation Models and Geometric Filtering

Add code
Oct 16, 2025
Viaarxiv icon