Alert button
Picture for Oncel Tuzel

Oncel Tuzel

Alert button

CLIP with Quality Captions: A Strong Pretraining for Vision Tasks

Add code
Bookmark button
Alert button
May 14, 2024
Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Oncel Tuzel

Viaarxiv icon

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Add code
Bookmark button
Alert button
Apr 24, 2024
Sachin Mehta, Maxwell Horton, Fartash Faghri, Mohammad Hossein Sekhavat, Mahyar Najibi, Mehrdad Farajtabar, Oncel Tuzel, Mohammad Rastegari

Viaarxiv icon

Weight subcloning: direct initialization of transformers using larger pretrained ones

Add code
Bookmark button
Alert button
Dec 14, 2023
Mohammad Samragh, Mehrdad Farajtabar, Sachin Mehta, Raviteja Vemulapalli, Fartash Faghri, Devang Naik, Oncel Tuzel, Mohammad Rastegari

Figure 1 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Figure 2 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Figure 3 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Figure 4 for Weight subcloning: direct initialization of transformers using larger pretrained ones
Viaarxiv icon

Probabilistic Speech-Driven 3D Facial Motion Synthesis: New Benchmarks, Methods, and Applications

Add code
Bookmark button
Alert button
Nov 30, 2023
Karren D. Yang, Anurag Ranjan, Jen-Hao Rick Chang, Raviteja Vemulapalli, Oncel Tuzel

Viaarxiv icon

Label-efficient Training of Small Task-specific Models by Leveraging Vision Foundation Models

Add code
Bookmark button
Alert button
Nov 30, 2023
Raviteja Vemulapalli, Hadi Pouransari, Fartash Faghri, Sachin Mehta, Mehrdad Farajtabar, Mohammad Rastegari, Oncel Tuzel

Viaarxiv icon

HUGS: Human Gaussian Splats

Add code
Bookmark button
Alert button
Nov 29, 2023
Muhammed Kocabas, Jen-Hao Rick Chang, James Gabriel, Oncel Tuzel, Anurag Ranjan

Figure 1 for HUGS: Human Gaussian Splats
Figure 2 for HUGS: Human Gaussian Splats
Figure 3 for HUGS: Human Gaussian Splats
Figure 4 for HUGS: Human Gaussian Splats
Viaarxiv icon

MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training

Add code
Bookmark button
Alert button
Nov 28, 2023
Pavan Kumar Anasosalu Vasu, Hadi Pouransari, Fartash Faghri, Raviteja Vemulapalli, Oncel Tuzel

Viaarxiv icon

TiC-CLIP: Continual Training of CLIP Models

Add code
Bookmark button
Alert button
Oct 24, 2023
Saurabh Garg, Mehrdad Farajtabar, Hadi Pouransari, Raviteja Vemulapalli, Sachin Mehta, Oncel Tuzel, Vaishaal Shankar, Fartash Faghri

Viaarxiv icon

Novel-View Acoustic Synthesis from 3D Reconstructed Rooms

Add code
Bookmark button
Alert button
Oct 23, 2023
Byeongjoo Ahn, Karren Yang, Brian Hamilton, Jonathan Sheaffer, Anurag Ranjan, Miguel Sarabia, Oncel Tuzel, Jen-Hao Rick Chang

Viaarxiv icon

SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding

Add code
Bookmark button
Alert button
Oct 23, 2023
Haoxiang Wang, Pavan Kumar Anasosalu Vasu, Fartash Faghri, Raviteja Vemulapalli, Mehrdad Farajtabar, Sachin Mehta, Mohammad Rastegari, Oncel Tuzel, Hadi Pouransari

Figure 1 for SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Figure 2 for SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Figure 3 for SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Figure 4 for SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Viaarxiv icon