Alert button
Picture for Silvio Savarese

Silvio Savarese

Alert button

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding

Add code
Bookmark button
Alert button
May 18, 2023
Le Xue, Ning Yu, Shu Zhang, Junnan Li, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese

Figure 1 for ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Figure 2 for ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Figure 3 for ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Figure 4 for ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
Viaarxiv icon

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

Add code
Bookmark button
Alert button
May 18, 2023
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu

Figure 1 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 2 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 3 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 4 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Viaarxiv icon

ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding

Add code
Bookmark button
Alert button
May 14, 2023
Le Xue, Ning Yu, Shu Zhang, Junnan Li, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese

Figure 1 for ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding
Figure 2 for ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding
Figure 3 for ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding
Figure 4 for ULIP-2: Towards Scalable Multimodal Pre-training For 3D Understanding
Viaarxiv icon

CodeGen2: Lessons for Training LLMs on Programming and Natural Languages

Add code
Bookmark button
Alert button
May 03, 2023
Erik Nijkamp, Hiroaki Hayashi, Caiming Xiong, Silvio Savarese, Yingbo Zhou

Figure 1 for CodeGen2: Lessons for Training LLMs on Programming and Natural Languages
Figure 2 for CodeGen2: Lessons for Training LLMs on Programming and Natural Languages
Figure 3 for CodeGen2: Lessons for Training LLMs on Programming and Natural Languages
Viaarxiv icon

Procedure-Aware Pretraining for Instructional Video Understanding

Add code
Bookmark button
Alert button
Mar 31, 2023
Honglu Zhou, Roberto Martín-Martín, Mubbasir Kapadia, Silvio Savarese, Juan Carlos Niebles

Figure 1 for Procedure-Aware Pretraining for Instructional Video Understanding
Figure 2 for Procedure-Aware Pretraining for Instructional Video Understanding
Figure 3 for Procedure-Aware Pretraining for Instructional Video Understanding
Figure 4 for Procedure-Aware Pretraining for Instructional Video Understanding
Viaarxiv icon

HIVE: Harnessing Human Feedback for Instructional Visual Editing

Add code
Bookmark button
Alert button
Mar 16, 2023
Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu

Figure 1 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Figure 2 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Figure 3 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Figure 4 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Viaarxiv icon

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

Add code
Bookmark button
Alert button
Jan 30, 2023
Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi

Figure 1 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 2 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 3 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 4 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Viaarxiv icon

ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding

Add code
Bookmark button
Alert button
Dec 10, 2022
Le Xue, Mingfei Gao, Chen Xing, Roberto Martín-Martín, Jiajun Wu, Caiming Xiong, Ran Xu, Juan Carlos Niebles, Silvio Savarese

Figure 1 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Figure 2 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Figure 3 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Figure 4 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Viaarxiv icon

Best-$k$ Search Algorithm for Neural Text Generation

Add code
Bookmark button
Alert button
Nov 22, 2022
Jiacheng Xu, Caiming Xiong, Silvio Savarese, Yingbo Zhou

Figure 1 for Best-$k$ Search Algorithm for Neural Text Generation
Figure 2 for Best-$k$ Search Algorithm for Neural Text Generation
Figure 3 for Best-$k$ Search Algorithm for Neural Text Generation
Figure 4 for Best-$k$ Search Algorithm for Neural Text Generation
Viaarxiv icon

Online Distribution Shift Detection via Recency Prediction

Add code
Bookmark button
Alert button
Nov 17, 2022
Rachel Luo, Rohan Sinha, Ali Hindy, Shengjia Zhao, Silvio Savarese, Edward Schmerling, Marco Pavone

Figure 1 for Online Distribution Shift Detection via Recency Prediction
Figure 2 for Online Distribution Shift Detection via Recency Prediction
Figure 3 for Online Distribution Shift Detection via Recency Prediction
Figure 4 for Online Distribution Shift Detection via Recency Prediction
Viaarxiv icon