Alert button
Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

Alert button

Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Feb 29, 2024
Tsai-Shien Chen, Aliaksandr Siarohin, Willi Menapace, Ekaterina Deyneka, Hsiang-wei Chao, Byung Eun Jeon, Yuwei Fang, Hsin-Ying Lee, Jian Ren, Ming-Hsuan Yang, Sergey Tulyakov

Viaarxiv icon

Interactive Multi-Head Self-Attention with Linear Complexity

Feb 27, 2024
Hankyul Kang, Ming-Hsuan Yang, Jongbin Ryu

Viaarxiv icon

Scene Prior Filtering for Depth Map Super-Resolution

Feb 23, 2024
Zhengxue Wang, Zhiqiang Yan, Ming-Hsuan Yang, Jinshan Pan, Jian Yang, Ying Tai, Guangwei Gao

Viaarxiv icon

StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing

Feb 21, 2024
Gaoxiang Cong, Yuankai Qi, Liang Li, Amin Beheshti, Zhedong Zhang, Anton van den Hengel, Ming-Hsuan Yang, Chenggang Yan, Qingming Huang

Viaarxiv icon

VideoPrism: A Foundational Visual Encoder for Video Understanding

Feb 20, 2024
Long Zhao, Nitesh B. Gundavarapu, Liangzhe Yuan, Hao Zhou, Shen Yan, Jennifer J. Sun, Luke Friedman, Rui Qian, Tobias Weyand, Yue Zhao, Rachel Hornung, Florian Schroff, Ming-Hsuan Yang, David A. Ross, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Ting Liu, Boqing Gong

Viaarxiv icon

Training Class-Imbalanced Diffusion Model Via Overlap Optimization

Feb 16, 2024
Divin Yan, Lu Qi, Vincent Tao Hu, Ming-Hsuan Yang, Meng Tang

Viaarxiv icon

GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting

Feb 11, 2024
Xiaoyu Zhou, Xingjian Ran, Yajiao Xiong, Jinlin He, Zhiwei Lin, Yongtao Wang, Deqing Sun, Ming-Hsuan Yang

Viaarxiv icon

Generalizable Entity Grounding via Assistance of Large Language Model

Feb 04, 2024
Lu Qi, Yi-Wen Chen, Lehan Yang, Tiancheng Shen, Xiangtai Li, Weidong Guo, Yu Xu, Ming-Hsuan Yang

Viaarxiv icon

PromptRR: Diffusion Models as Prompt Generators for Single Image Reflection Removal

Feb 04, 2024
Tao Wang, Wanglong Lu, Kaihao Zhang, Wenhan Luo, Tae-Kyun Kim, Tong Lu, Hongdong Li, Ming-Hsuan Yang

Viaarxiv icon

RAP-SAM: Towards Real-Time All-Purpose Segment Anything

Jan 18, 2024
Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang

Viaarxiv icon