Picture for Yang Ye

Yang Ye

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Add code
May 28, 2025
Viaarxiv icon

ImgEdit: A Unified Image Editing Dataset and Benchmark

Add code
May 26, 2025
Viaarxiv icon

Tactile-based Reinforcement Learning for Adaptive Grasping under Observation Uncertainties

Add code
May 22, 2025
Viaarxiv icon

HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding

Add code
May 19, 2025
Viaarxiv icon

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

Add code
Apr 11, 2025
Viaarxiv icon

Fast Autoregressive Video Generation with Diagonal Decoding

Add code
Mar 18, 2025
Viaarxiv icon

Force-Based Robotic Imitation Learning: A Two-Phase Approach for Construction Assembly Tasks

Add code
Jan 24, 2025
Viaarxiv icon

Open-Sora Plan: Open-Source Large Video Generation Model

Add code
Nov 28, 2024
Figure 1 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 2 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 3 for Open-Sora Plan: Open-Source Large Video Generation Model
Figure 4 for Open-Sora Plan: Open-Source Large Video Generation Model
Viaarxiv icon

WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model

Add code
Nov 26, 2024
Viaarxiv icon

Apple Intelligence Foundation Language Models

Add code
Jul 29, 2024
Figure 1 for Apple Intelligence Foundation Language Models
Figure 2 for Apple Intelligence Foundation Language Models
Figure 3 for Apple Intelligence Foundation Language Models
Figure 4 for Apple Intelligence Foundation Language Models
Viaarxiv icon