Picture for Yi Zhang

Yi Zhang

Carnegie Mellon University

MM-Food-100K: A 100,000-Sample Multimodal Food Intelligence Dataset with Verifiable Provenance

Add code
Aug 14, 2025
Viaarxiv icon

Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset

Add code
Jun 23, 2025
Viaarxiv icon

Towards Seamless Borders: A Method for Mitigating Inconsistencies in Image Inpainting and Outpainting

Add code
Jun 14, 2025
Viaarxiv icon

SlotPi: Physics-informed Object-centric Reasoning Models

Add code
Jun 12, 2025
Viaarxiv icon

DiffPR: Diffusion-Based Phase Reconstruction via Frequency-Decoupled Learning

Add code
Jun 12, 2025
Viaarxiv icon

3DGeoDet: General-purpose Geometry-aware Image-based 3D Object Detection

Add code
Jun 11, 2025
Viaarxiv icon

On Reasoning Strength Planning in Large Reasoning Models

Add code
Jun 10, 2025
Viaarxiv icon

GenIR: Generative Visual Feedback for Mental Image Retrieval

Add code
Jun 06, 2025
Viaarxiv icon

Towards provable probabilistic safety for scalable embodied AI systems

Add code
Jun 05, 2025
Viaarxiv icon

Beyond the LUMIR challenge: The pathway to foundational registration models

Add code
May 30, 2025
Viaarxiv icon