Alert button
Picture for Guan Pang

Guan Pang

Alert button

Animated Stickers: Bringing Stickers to Life with Video Diffusion

Add code
Bookmark button
Alert button
Feb 08, 2024
David Yan, Winnie Zhang, Luxin Zhang, Anmol Kalia, Dingkang Wang, Ankit Ramchandani, Miao Liu, Albert Pumarola, Edgar Schoenfeld, Elliot Blanchard, Krishna Narni, Yaqiao Luo, Lawrence Chen, Guan Pang, Ali Thabet, Peter Vajda, Amy Bearman, Licheng Yu

Viaarxiv icon

LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning

Add code
Bookmark button
Alert button
Dec 06, 2023
Bolin Lai, Xiaoliang Dai, Lawrence Chen, Guan Pang, James M. Rehg, Miao Liu

Viaarxiv icon

DISGO: Automatic End-to-End Evaluation for Scene Text OCR

Add code
Bookmark button
Alert button
Aug 25, 2023
Mei-Yuh Hwang, Yangyang Shi, Ankit Ramchandani, Guan Pang, Praveen Krishnan, Lucas Kabela, Frank Seide, Samyak Datta, Jun Liu

Figure 1 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 2 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 3 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Figure 4 for DISGO: Automatic End-to-End Evaluation for Scene Text OCR
Viaarxiv icon

Text-Conditional Contextualized Avatars For Zero-Shot Personalization

Add code
Bookmark button
Alert button
Apr 14, 2023
Samaneh Azadi, Thomas Hayes, Akbar Shah, Guan Pang, Devi Parikh, Sonal Gupta

Figure 1 for Text-Conditional Contextualized Avatars For Zero-Shot Personalization
Figure 2 for Text-Conditional Contextualized Avatars For Zero-Shot Personalization
Figure 3 for Text-Conditional Contextualized Avatars For Zero-Shot Personalization
Figure 4 for Text-Conditional Contextualized Avatars For Zero-Shot Personalization
Viaarxiv icon

MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration

Add code
Bookmark button
Alert button
Apr 28, 2022
Thomas Hayes, Songyang Zhang, Xi Yin, Guan Pang, Sasha Sheng, Harry Yang, Songwei Ge, Qiyuan Hu, Devi Parikh

Figure 1 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 2 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 3 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Figure 4 for MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration
Viaarxiv icon

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer

Add code
Bookmark button
Alert button
Apr 07, 2022
Songwei Ge, Thomas Hayes, Harry Yang, Xi Yin, Guan Pang, David Jacobs, Jia-Bin Huang, Devi Parikh

Figure 1 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Figure 2 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Figure 3 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Figure 4 for Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
Viaarxiv icon

TextStyleBrush: Transfer of Text Aesthetics from a Single Example

Add code
Bookmark button
Alert button
Jun 15, 2021
Praveen Krishnan, Rama Kovvuri, Guan Pang, Boris Vassilev, Tal Hassner

Figure 1 for TextStyleBrush: Transfer of Text Aesthetics from a Single Example
Figure 2 for TextStyleBrush: Transfer of Text Aesthetics from a Single Example
Figure 3 for TextStyleBrush: Transfer of Text Aesthetics from a Single Example
Figure 4 for TextStyleBrush: Transfer of Text Aesthetics from a Single Example
Viaarxiv icon

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text

Add code
Bookmark button
Alert button
May 12, 2021
Amanpreet Singh, Guan Pang, Mandy Toh, Jing Huang, Wojciech Galuba, Tal Hassner

Figure 1 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Figure 2 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Figure 3 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Figure 4 for TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text
Viaarxiv icon

A Multiplexed Network for End-to-End, Multilingual OCR

Add code
Bookmark button
Alert button
Mar 29, 2021
Jing Huang, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J Liang, Praveen Krishnan, Xi Yin, Tal Hassner

Figure 1 for A Multiplexed Network for End-to-End, Multilingual OCR
Figure 2 for A Multiplexed Network for End-to-End, Multilingual OCR
Figure 3 for A Multiplexed Network for End-to-End, Multilingual OCR
Figure 4 for A Multiplexed Network for End-to-End, Multilingual OCR
Viaarxiv icon