Picture for Chi Zhang

Chi Zhang

Department of Computer Science and Engineering, University of Gothenburg, Sweden

NFIG: Autoregressive Image Generation with Next-Frequency Prediction

Add code
Mar 10, 2025
Figure 1 for NFIG: Autoregressive Image Generation with Next-Frequency Prediction
Figure 2 for NFIG: Autoregressive Image Generation with Next-Frequency Prediction
Figure 3 for NFIG: Autoregressive Image Generation with Next-Frequency Prediction
Figure 4 for NFIG: Autoregressive Image Generation with Next-Frequency Prediction
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 Challenge

Add code
Mar 05, 2025
Viaarxiv icon

AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

Add code
Mar 04, 2025
Figure 1 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Figure 2 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Figure 3 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Figure 4 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Viaarxiv icon

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation

Add code
Feb 26, 2025
Figure 1 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 2 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 3 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 4 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Viaarxiv icon

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Add code
Feb 26, 2025
Viaarxiv icon

Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion

Add code
Feb 20, 2025
Viaarxiv icon

Neural Force Field: Learning Generalized Physical Representation from a Few Examples

Add code
Feb 13, 2025
Figure 1 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Figure 2 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Figure 3 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Figure 4 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Viaarxiv icon

UniForm: A Unified Diffusion Transformer for Audio-Video Generation

Add code
Feb 08, 2025
Figure 1 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Figure 2 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Figure 3 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Figure 4 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Viaarxiv icon

PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression

Add code
Feb 07, 2025
Viaarxiv icon