Picture for Mingyang Zhang

Mingyang Zhang

RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization

Add code
Mar 13, 2026
Viaarxiv icon

Learning Personalized Agents from Human Feedback

Add code
Feb 18, 2026
Viaarxiv icon

A Spatial-Spectral-Frequency Interactive Network for Multimodal Remote Sensing Classification

Add code
Oct 06, 2025
Viaarxiv icon

Consistency Trajectory Matching for One-Step Generative Super-Resolution

Add code
Mar 27, 2025
Viaarxiv icon

Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation

Add code
Jan 07, 2025
Viaarxiv icon

Channel Merging: Preserving Specialization for Merged Experts

Add code
Dec 18, 2024
Viaarxiv icon

Disentangling the Prosody and Semantic Information with Pre-trained Model for In-Context Learning based Zero-Shot Voice Conversion

Add code
Sep 10, 2024
Viaarxiv icon

DTFormer: A Transformer-Based Method for Discrete-Time Dynamic Graph Representation Learning

Add code
Jul 26, 2024
Figure 1 for DTFormer: A Transformer-Based Method for Discrete-Time Dynamic Graph Representation Learning
Figure 2 for DTFormer: A Transformer-Based Method for Discrete-Time Dynamic Graph Representation Learning
Figure 3 for DTFormer: A Transformer-Based Method for Discrete-Time Dynamic Graph Representation Learning
Figure 4 for DTFormer: A Transformer-Based Method for Discrete-Time Dynamic Graph Representation Learning
Viaarxiv icon

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach

Add code
Jul 23, 2024
Viaarxiv icon

RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging

Add code
Jun 24, 2024
Figure 1 for RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging
Figure 2 for RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging
Figure 3 for RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging
Figure 4 for RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging
Viaarxiv icon