Alert button
Picture for Omid Poursaeed

Omid Poursaeed

Alert button

Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

Add code
Bookmark button
Alert button
Apr 11, 2024
Kanchana Ranasinghe, Satya Narayan Shukla, Omid Poursaeed, Michael S. Ryoo, Tsung-Yu Lin

Viaarxiv icon

Universal Pyramid Adversarial Training for Improved ViT Performance

Add code
Bookmark button
Alert button
Dec 26, 2023
Ping-yeh Chiang, Yipin Zhou, Omid Poursaeed, Satya Narayan Shukla, Ashish Shah, Tom Goldstein, Ser-Nam Lim

Viaarxiv icon

Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding

Add code
Bookmark button
Alert button
Sep 20, 2023
Mohamed Afham, Satya Narayan Shukla, Omid Poursaeed, Pengchuan Zhang, Ashish Shah, Sernam Lim

Figure 1 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Figure 2 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Figure 3 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Figure 4 for Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding
Viaarxiv icon

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles

Add code
Bookmark button
Alert button
Jun 01, 2023
Chaitanya Ryali, Yuan-Ting Hu, Daniel Bolya, Chen Wei, Haoqi Fan, Po-Yao Huang, Vaibhav Aggarwal, Arkabandhu Chowdhury, Omid Poursaeed, Judy Hoffman, Jitendra Malik, Yanghao Li, Christoph Feichtenhofer

Figure 1 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 2 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 3 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Figure 4 for Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Viaarxiv icon

Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning

Add code
Bookmark button
Alert button
Dec 09, 2022
Jishnu Mukhoti, Tsung-Yu Lin, Omid Poursaeed, Rui Wang, Ashish Shah, Philip H. S. Torr, Ser-Nam Lim

Figure 1 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Figure 2 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Figure 3 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Figure 4 for Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning
Viaarxiv icon

A Unified Model for Tracking and Image-Video Detection Has More Power

Add code
Bookmark button
Alert button
Nov 20, 2022
Peirong Liu, Rui Wang, Pengchuan Zhang, Omid Poursaeed, Yipin Zhou, Xuefei Cao, Sreya Dutta Roy, Ashish Shah, Ser-Nam Lim

Figure 1 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 2 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 3 for A Unified Model for Tracking and Image-Video Detection Has More Power
Figure 4 for A Unified Model for Tracking and Image-Video Detection Has More Power
Viaarxiv icon

Robustness and Generalization via Generative Adversarial Training

Add code
Bookmark button
Alert button
Sep 06, 2021
Omid Poursaeed, Tianxing Jiang, Harry Yang, Serge Belongie, SerNam Lim

Figure 1 for Robustness and Generalization via Generative Adversarial Training
Figure 2 for Robustness and Generalization via Generative Adversarial Training
Figure 3 for Robustness and Generalization via Generative Adversarial Training
Figure 4 for Robustness and Generalization via Generative Adversarial Training
Viaarxiv icon

Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation

Add code
Bookmark button
Alert button
Nov 25, 2020
Davis Wertheimer, Omid Poursaeed, Bharath Hariharan

Figure 1 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Figure 2 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Figure 3 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Figure 4 for Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation
Viaarxiv icon

Self-supervised Learning of Point Clouds via Orientation Estimation

Add code
Bookmark button
Alert button
Aug 01, 2020
Omid Poursaeed, Tianxing Jiang, Quintessa Qiao, Nayun Xu, Vladimir G. Kim

Figure 1 for Self-supervised Learning of Point Clouds via Orientation Estimation
Figure 2 for Self-supervised Learning of Point Clouds via Orientation Estimation
Figure 3 for Self-supervised Learning of Point Clouds via Orientation Estimation
Figure 4 for Self-supervised Learning of Point Clouds via Orientation Estimation
Viaarxiv icon

Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling

Add code
Bookmark button
Alert button
Jul 20, 2020
Omid Poursaeed, Matthew Fisher, Noam Aigerman, Vladimir G. Kim

Figure 1 for Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling
Figure 2 for Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling
Figure 3 for Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling
Figure 4 for Coupling Explicit and Implicit Surface Representations for Generative 3D Modeling
Viaarxiv icon