Picture for Tianyu He

Tianyu He

VidTok: A Versatile and Open-Source Video Tokenizer

Add code
Dec 17, 2024
Figure 1 for VidTok: A Versatile and Open-Source Video Tokenizer
Figure 2 for VidTok: A Versatile and Open-Source Video Tokenizer
Figure 3 for VidTok: A Versatile and Open-Source Video Tokenizer
Figure 4 for VidTok: A Versatile and Open-Source Video Tokenizer
Viaarxiv icon

Compositional 3D-aware Video Generation with LLM Director

Add code
Aug 31, 2024
Figure 1 for Compositional 3D-aware Video Generation with LLM Director
Figure 2 for Compositional 3D-aware Video Generation with LLM Director
Figure 3 for Compositional 3D-aware Video Generation with LLM Director
Figure 4 for Compositional 3D-aware Video Generation with LLM Director
Viaarxiv icon

A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy

Add code
Jul 28, 2024
Viaarxiv icon

Cheems: Wonderful Matrices More Efficient and More Effective Architecture

Add code
Jul 25, 2024
Figure 1 for Cheems: Wonderful Matrices More Efficient and More Effective Architecture
Figure 2 for Cheems: Wonderful Matrices More Efficient and More Effective Architecture
Figure 3 for Cheems: Wonderful Matrices More Efficient and More Effective Architecture
Figure 4 for Cheems: Wonderful Matrices More Efficient and More Effective Architecture
Viaarxiv icon

Video In-context Learning

Add code
Jul 10, 2024
Viaarxiv icon

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors

Add code
Jun 14, 2024
Figure 1 for GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
Figure 2 for GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
Figure 3 for GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
Figure 4 for GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors
Viaarxiv icon

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement

Add code
Jun 12, 2024
Figure 1 for Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
Figure 2 for Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
Figure 3 for Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
Figure 4 for Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
Viaarxiv icon

Grokking Modular Polynomials

Add code
Jun 05, 2024
Figure 1 for Grokking Modular Polynomials
Figure 2 for Grokking Modular Polynomials
Figure 3 for Grokking Modular Polynomials
Viaarxiv icon

Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks

Add code
Jun 04, 2024
Figure 1 for Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Figure 2 for Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Figure 3 for Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Figure 4 for Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks
Viaarxiv icon

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation

Add code
May 24, 2024
Figure 1 for InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Figure 2 for InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Figure 3 for InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Figure 4 for InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
Viaarxiv icon