Picture for Gen Li

Gen Li

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment

Add code
Nov 06, 2025
Viaarxiv icon

Scaffolding Metacognition in Programming Education: Understanding Student-AI Interactions and Design Implications

Add code
Nov 06, 2025
Viaarxiv icon

Optimal Convergence Analysis of DDPM for General Distributions

Add code
Oct 31, 2025
Viaarxiv icon

Towards Efficient Online Exploration for Reinforcement Learning with Human Feedback

Add code
Sep 26, 2025
Viaarxiv icon

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Add code
Sep 10, 2025
Viaarxiv icon

Connections between reinforcement learning with feedback,test-time scaling, and diffusion guidance: An anthology

Add code
Sep 04, 2025
Viaarxiv icon

Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty

Add code
Aug 26, 2025
Figure 1 for Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty
Figure 2 for Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty
Figure 3 for Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty
Figure 4 for Can Structured Templates Facilitate LLMs in Tackling Harder Tasks? : An Exploration of Scaling Laws by Difficulty
Viaarxiv icon

Dual Enhancement on 3D Vision-Language Perception for Monocular 3D Visual Grounding

Add code
Aug 26, 2025
Viaarxiv icon

Hydra-Bench: A Benchmark for Multi-Modal Leaf Wetness Sensing

Add code
Jul 30, 2025
Viaarxiv icon