Alert button
Picture for Saining Xie

Saining Xie

Alert button

V-IRL: Grounding Virtual Intelligence in Real Life

Add code
Bookmark button
Alert button
Feb 05, 2024
Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie

Viaarxiv icon

Deconstructing Denoising Diffusion Models for Self-Supervised Learning

Add code
Bookmark button
Alert button
Jan 25, 2024
Xinlei Chen, Zhuang Liu, Saining Xie, Kaiming He

Viaarxiv icon

SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers

Add code
Bookmark button
Alert button
Jan 16, 2024
Nanye Ma, Mark Goldstein, Michael S. Albergo, Nicholas M. Boffi, Eric Vanden-Eijnden, Saining Xie

Viaarxiv icon

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Add code
Bookmark button
Alert button
Jan 11, 2024
Shengbang Tong, Zhuang Liu, Yuexiang Zhai, Yi Ma, Yann LeCun, Saining Xie

Viaarxiv icon

Image Sculpting: Precise Object Editing with 3D Geometry Control

Add code
Bookmark button
Alert button
Jan 02, 2024
Jiraphon Yenphraphai, Xichen Pan, Sainan Liu, Daniele Panozzo, Saining Xie

Viaarxiv icon

V*: Guided Visual Search as a Core Mechanism in Multimodal LLMs

Add code
Bookmark button
Alert button
Dec 26, 2023
Penghao Wu, Saining Xie

Viaarxiv icon

Demystifying CLIP Data

Add code
Bookmark button
Alert button
Oct 02, 2023
Hu Xu, Saining Xie, Xiaoqing Ellen Tan, Po-Yao Huang, Russell Howes, Vasu Sharma, Shang-Wen Li, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer

Figure 1 for Demystifying CLIP Data
Figure 2 for Demystifying CLIP Data
Figure 3 for Demystifying CLIP Data
Figure 4 for Demystifying CLIP Data
Viaarxiv icon

Going Denser with Open-Vocabulary Part Segmentation

Add code
Bookmark button
Alert button
May 18, 2023
Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan

Figure 1 for Going Denser with Open-Vocabulary Part Segmentation
Figure 2 for Going Denser with Open-Vocabulary Part Segmentation
Figure 3 for Going Denser with Open-Vocabulary Part Segmentation
Figure 4 for Going Denser with Open-Vocabulary Part Segmentation
Viaarxiv icon

CiT: Curation in Training for Effective Vision-Language Data

Add code
Bookmark button
Alert button
Jan 05, 2023
Hu Xu, Saining Xie, Po-Yao Huang, Licheng Yu, Russell Howes, Gargi Ghosh, Luke Zettlemoyer, Christoph Feichtenhofer

Figure 1 for CiT: Curation in Training for Effective Vision-Language Data
Figure 2 for CiT: Curation in Training for Effective Vision-Language Data
Figure 3 for CiT: Curation in Training for Effective Vision-Language Data
Figure 4 for CiT: Curation in Training for Effective Vision-Language Data
Viaarxiv icon

ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders

Add code
Bookmark button
Alert button
Jan 02, 2023
Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, Saining Xie

Figure 1 for ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Figure 2 for ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Figure 3 for ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Figure 4 for ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders
Viaarxiv icon