Alert button
Picture for Irfan Essa

Irfan Essa

Alert button

3D Semantic MapNet: Building Maps for Multi-Object Re-Identification in 3D

Mar 19, 2024
Vincent Cartillier, Neha Jain, Irfan Essa

Viaarxiv icon

On the Efficacy of Text-Based Input Modalities for Action Anticipation

Jan 23, 2024
Apoorva Beedu, Karan Samel, Irfan Essa

Viaarxiv icon

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

Jan 11, 2024
Seung Hyun Lee, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu, Qifei Wang, Fei Deng, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang

Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Dec 21, 2023
Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Josh Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, David Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang

Viaarxiv icon

Photorealistic Video Generation with Diffusion Models

Dec 11, 2023
Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, José Lezama

Viaarxiv icon

Text and Click inputs for unambiguous open vocabulary instance segmentation

Nov 24, 2023
Nikolai Warner, Meera Hahn, Jonathan Huang, Irfan Essa, Vighnesh Birodkar

Viaarxiv icon

BayRnTune: Adaptive Bayesian Domain Randomization via Strategic Fine-tuning

Oct 16, 2023
Tianle Huang, Nitish Sontakke, K. Niranjan Kumar, Irfan Essa, Stefanos Nikolaidis, Dennis W. Hong, Sehoon Ha

Viaarxiv icon

Words into Action: Learning Diverse Humanoid Robot Behaviors using Language Guided Iterative Motion Refinement

Oct 10, 2023
K. Niranjan Kumar, Irfan Essa, Sehoon Ha

Viaarxiv icon

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Oct 09, 2023
Lijun Yu, José Lezama, Nitesh B. Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang

Viaarxiv icon

Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition

Sep 03, 2023
Hyeongju Choi, Apoorva Beedu, Irfan Essa

Figure 1 for Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition
Figure 2 for Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition
Figure 3 for Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition
Figure 4 for Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition
Viaarxiv icon