Picture for Linchao Zhu

Linchao Zhu

DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion

Add code
Sep 04, 2023
Figure 1 for DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion
Figure 2 for DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion
Figure 3 for DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion
Figure 4 for DiverseMotion: Towards Diverse Human Motion Generation via Discrete Diffusion
Viaarxiv icon

Tachikuma: Understading Complex Interactions with Multi-Character and Novel Objects by Large Language Models

Add code
Jul 24, 2023
Figure 1 for Tachikuma: Understading Complex Interactions with Multi-Character and Novel Objects by Large Language Models
Figure 2 for Tachikuma: Understading Complex Interactions with Multi-Character and Novel Objects by Large Language Models
Figure 3 for Tachikuma: Understading Complex Interactions with Multi-Character and Novel Objects by Large Language Models
Figure 4 for Tachikuma: Understading Complex Interactions with Multi-Character and Novel Objects by Large Language Models
Viaarxiv icon

Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition

Add code
Jul 03, 2023
Viaarxiv icon

Whitening-based Contrastive Learning of Sentence Embeddings

Add code
Jun 08, 2023
Viaarxiv icon

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Add code
May 29, 2023
Viaarxiv icon

CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model

Add code
May 23, 2023
Viaarxiv icon

Gloss-Free End-to-End Sign Language Translation

Add code
May 22, 2023
Figure 1 for Gloss-Free End-to-End Sign Language Translation
Figure 2 for Gloss-Free End-to-End Sign Language Translation
Figure 3 for Gloss-Free End-to-End Sign Language Translation
Figure 4 for Gloss-Free End-to-End Sign Language Translation
Viaarxiv icon

Efficient Multimodal Fusion via Interactive Prompting

Add code
Apr 13, 2023
Viaarxiv icon

DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training

Add code
Mar 06, 2023
Figure 1 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Figure 2 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Figure 3 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Figure 4 for DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training
Viaarxiv icon

Variational Cross-Graph Reasoning and Adaptive Structured Semantics Learning for Compositional Temporal Grounding

Add code
Jan 22, 2023
Viaarxiv icon