Picture for Chen Sun

Chen Sun

MotiF: Making Text Count in Image Animation with Motion Focal Loss

Add code
Dec 20, 2024
Viaarxiv icon

Motion Prompting: Controlling Video Generation with Motion Trajectories

Add code
Dec 03, 2024
Viaarxiv icon

$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources

Add code
Oct 30, 2024
Figure 1 for $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
Figure 2 for $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
Figure 3 for $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
Figure 4 for $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
Viaarxiv icon

An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion

Add code
Oct 29, 2024
Figure 1 for An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion
Figure 2 for An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion
Figure 3 for An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion
Figure 4 for An Efficient Approach to Generate Safe Drivable Space by LiDAR-Camera-HDmap Fusion
Viaarxiv icon

Fourier Head: Helping Large Language Models Learn Complex Probability Distributions

Add code
Oct 29, 2024
Viaarxiv icon

Learning and Unlearning of Fabricated Knowledge in Language Models

Add code
Oct 29, 2024
Figure 1 for Learning and Unlearning of Fabricated Knowledge in Language Models
Figure 2 for Learning and Unlearning of Fabricated Knowledge in Language Models
Figure 3 for Learning and Unlearning of Fabricated Knowledge in Language Models
Figure 4 for Learning and Unlearning of Fabricated Knowledge in Language Models
Viaarxiv icon

Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens

Add code
Oct 17, 2024
Figure 1 for Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Figure 2 for Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Figure 3 for Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Figure 4 for Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens
Viaarxiv icon

Do Music Generation Models Encode Music Theory?

Add code
Oct 01, 2024
Figure 1 for Do Music Generation Models Encode Music Theory?
Figure 2 for Do Music Generation Models Encode Music Theory?
Figure 3 for Do Music Generation Models Encode Music Theory?
Figure 4 for Do Music Generation Models Encode Music Theory?
Viaarxiv icon

Do Pre-trained Vision-Language Models Encode Object States?

Add code
Sep 16, 2024
Figure 1 for Do Pre-trained Vision-Language Models Encode Object States?
Figure 2 for Do Pre-trained Vision-Language Models Encode Object States?
Figure 3 for Do Pre-trained Vision-Language Models Encode Object States?
Figure 4 for Do Pre-trained Vision-Language Models Encode Object States?
Viaarxiv icon

EPO: Hierarchical LLM Agents with Environment Preference Optimization

Add code
Aug 28, 2024
Viaarxiv icon