Picture for Jiayu Wang

Jiayu Wang

Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models

Add code
Jun 21, 2024
Figure 1 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 2 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 3 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 4 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Viaarxiv icon

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Add code
Jun 07, 2024
Viaarxiv icon

UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

Add code
Jun 03, 2024
Figure 1 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Figure 2 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Figure 3 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Figure 4 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Viaarxiv icon

Grammar-Aligned Decoding

Add code
May 31, 2024
Figure 1 for Grammar-Aligned Decoding
Figure 2 for Grammar-Aligned Decoding
Figure 3 for Grammar-Aligned Decoding
Figure 4 for Grammar-Aligned Decoding
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Add code
Feb 09, 2024
Viaarxiv icon

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Add code
Dec 15, 2023
Figure 1 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 2 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 3 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 4 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Viaarxiv icon

Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation

Add code
Dec 07, 2023
Figure 1 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 2 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 3 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 4 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Viaarxiv icon

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Add code
Nov 07, 2023
Viaarxiv icon