Picture for Chi Zhang

Chi Zhang

Department of Computer Science and Engineering, University of Gothenburg, Sweden

AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

Add code
Mar 04, 2025
Figure 1 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Figure 2 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Figure 3 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Figure 4 for AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
Viaarxiv icon

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation

Add code
Feb 26, 2025
Figure 1 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 2 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 3 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 4 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Viaarxiv icon

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Add code
Feb 26, 2025
Viaarxiv icon

Monocular Depth Estimation and Segmentation for Transparent Object with Iterative Semantic and Geometric Fusion

Add code
Feb 20, 2025
Viaarxiv icon

Neural Force Field: Learning Generalized Physical Representation from a Few Examples

Add code
Feb 13, 2025
Figure 1 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Figure 2 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Figure 3 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Figure 4 for Neural Force Field: Learning Generalized Physical Representation from a Few Examples
Viaarxiv icon

UniForm: A Unified Diffusion Transformer for Audio-Video Generation

Add code
Feb 08, 2025
Figure 1 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Figure 2 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Figure 3 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Figure 4 for UniForm: A Unified Diffusion Transformer for Audio-Video Generation
Viaarxiv icon

PoI: Pixel of Interest for Novel View Synthesis Assisted Scene Coordinate Regression

Add code
Feb 07, 2025
Viaarxiv icon

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Add code
Feb 05, 2025
Figure 1 for MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent
Figure 2 for MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent
Figure 3 for MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent
Figure 4 for MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent
Viaarxiv icon

Learning to Plan with Personalized Preferences

Add code
Feb 02, 2025
Figure 1 for Learning to Plan with Personalized Preferences
Figure 2 for Learning to Plan with Personalized Preferences
Figure 3 for Learning to Plan with Personalized Preferences
Figure 4 for Learning to Plan with Personalized Preferences
Viaarxiv icon

Behavior Modeling Space Reconstruction for E-Commerce Search

Add code
Jan 30, 2025
Figure 1 for Behavior Modeling Space Reconstruction for E-Commerce Search
Figure 2 for Behavior Modeling Space Reconstruction for E-Commerce Search
Figure 3 for Behavior Modeling Space Reconstruction for E-Commerce Search
Figure 4 for Behavior Modeling Space Reconstruction for E-Commerce Search
Viaarxiv icon