Alert button
Picture for Kihyuk Sohn

Kihyuk Sohn

Alert button

Robust Disaster Assessment from Aerial Imagery Using Text-to-Image Synthetic Data

Add code
Bookmark button
Alert button
May 22, 2024
Tarun Kalluri, Jihyeon Lee, Kihyuk Sohn, Sahil Singla, Manmohan Chandraker, Joseph Xu, Jeremiah Liu

Viaarxiv icon

Text Prompting for Multi-Concept Video Customization by Autoregressive Generation

Add code
Bookmark button
Alert button
May 22, 2024
Divya Kothandaraman, Kihyuk Sohn, Ruben Villegas, Paul Voigtlaender, Dinesh Manocha, Mohammad Babaeizadeh

Viaarxiv icon

DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow

Add code
Bookmark button
Alert button
Mar 22, 2024
Kyungmin Lee, Kihyuk Sohn, Jinwoo Shin

Viaarxiv icon

Direct Consistency Optimization for Compositional Text-to-Image Personalization

Add code
Bookmark button
Alert button
Feb 19, 2024
Kyungmin Lee, Sangkyung Kwak, Kihyuk Sohn, Jinwoo Shin

Viaarxiv icon

Unsupervised LLM Adaptation for Question Answering

Add code
Bookmark button
Alert button
Feb 16, 2024
Kuniaki Saito, Kihyuk Sohn, Chen-Yu Lee, Yoshitaka Ushiku

Viaarxiv icon

Instruct-Imagen: Image Generation with Multi-modal Instruction

Add code
Bookmark button
Alert button
Jan 03, 2024
Hexiang Hu, Kelvin C. K. Chan, Yu-Chuan Su, Wenhu Chen, Yandong Li, Kihyuk Sohn, Yang Zhao, Xue Ben, Boqing Gong, William Cohen, Ming-Wei Chang, Xuhui Jia

Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Bookmark button
Alert button
Dec 21, 2023
Dan Kondratyuk, Lijun Yu, Xiuye Gu, José Lezama, Jonathan Huang, Rachel Hornung, Hartwig Adam, Hassan Akbari, Yair Alon, Vighnesh Birodkar, Yong Cheng, Ming-Chang Chiu, Josh Dillon, Irfan Essa, Agrim Gupta, Meera Hahn, Anja Hauth, David Hendon, Alonso Martinez, David Minnen, David Ross, Grant Schindler, Mikhail Sirotenko, Kihyuk Sohn, Krishna Somandepalli, Huisheng Wang, Jimmy Yan, Ming-Hsuan Yang, Xuan Yang, Bryan Seybold, Lu Jiang

Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Photorealistic Video Generation with Diffusion Models

Add code
Bookmark button
Alert button
Dec 11, 2023
Agrim Gupta, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, José Lezama

Viaarxiv icon

Improve Supervised Representation Learning with Masked Image Modeling

Add code
Bookmark button
Alert button
Dec 01, 2023
Kaifeng Chen, Daniel Salz, Huiwen Chang, Kihyuk Sohn, Dilip Krishnan, Mojtaba Seyedhosseini

Viaarxiv icon

Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation

Add code
Bookmark button
Alert button
Oct 09, 2023
Lijun Yu, José Lezama, Nitesh B. Gundavarapu, Luca Versari, Kihyuk Sohn, David Minnen, Yong Cheng, Agrim Gupta, Xiuye Gu, Alexander G. Hauptmann, Boqing Gong, Ming-Hsuan Yang, Irfan Essa, David A. Ross, Lu Jiang

Figure 1 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Figure 2 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Figure 3 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Figure 4 for Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
Viaarxiv icon