Picture for Oncel Tuzel

Oncel Tuzel

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Add code
Jul 12, 2024
Viaarxiv icon

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Add code
Jul 09, 2024
Viaarxiv icon

Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum

Add code
May 21, 2024
Figure 1 for Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
Figure 2 for Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
Figure 3 for Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
Figure 4 for Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
Viaarxiv icon

CLIP with Quality Captions: A Strong Pretraining for Vision Tasks

Add code
May 14, 2024
Viaarxiv icon

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Add code
Apr 24, 2024
Viaarxiv icon

Weight subcloning: direct initialization of transformers using larger pretrained ones

Add code
Dec 14, 2023
Figure 1 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Figure 2 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Figure 3 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Figure 4 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Viaarxiv icon

Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications

Add code
Nov 30, 2023
Figure 1 for Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications
Figure 2 for Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications
Figure 3 for Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications
Figure 4 for Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications
Viaarxiv icon

Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models

Add code
Nov 30, 2023
Figure 1 for Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models
Figure 2 for Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models
Figure 3 for Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models
Figure 4 for Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models
Viaarxiv icon

HUGS: Human Gaussian Splats

Add code
Nov 29, 2023
Figure 1 for HUGS: Human Gaussian Splats
Figure 2 for HUGS: Human Gaussian Splats
Figure 3 for HUGS: Human Gaussian Splats
Figure 4 for HUGS: Human Gaussian Splats
Viaarxiv icon

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

Add code
Nov 28, 2023
Viaarxiv icon