Alert button
Picture for Chong Luo

Chong Luo

Alert button

Streaming Video Model

Mar 30, 2023
Yucheng Zhao, Chong Luo, Chuanxin Tang, Dongdong Chen, Noel Codella, Zheng-Jun Zha

Figure 1 for Streaming Video Model
Figure 2 for Streaming Video Model
Figure 3 for Streaming Video Model
Figure 4 for Streaming Video Model
Viaarxiv icon

Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching

Mar 27, 2023
Donggyun Kim, Jinwoo Kim, Seongwoong Cho, Chong Luo, Seunghoon Hong

Figure 1 for Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Figure 2 for Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Figure 3 for Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Figure 4 for Universal Few-shot Learning of Dense Prediction Tasks with Visual Token Matching
Viaarxiv icon

OmniTracker: Unifying Object Tracking by Tracking-with-Detection

Mar 21, 2023
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang

Figure 1 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 2 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 3 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 4 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Viaarxiv icon

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Dec 13, 2022
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang

Figure 1 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 2 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 3 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 4 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Viaarxiv icon

TridentSE: Guiding Speech Enhancement with 32 Global Tokens

Oct 24, 2022
Dacheng Yin, Zhiyuan Zhao, Chuanxin Tang, Zhiwei Xiong, Chong Luo

Figure 1 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 2 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 3 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Figure 4 for TridentSE: Guiding Speech Enhancement with 32 Global Tokens
Viaarxiv icon

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

Sep 15, 2022
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Luowei Zhou, Yucheng Zhao, Yujia Xie, Ce Liu, Yu-Gang Jiang, Lu Yuan

Figure 1 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 2 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 3 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 4 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Viaarxiv icon

An Anchor-Free Detector for Continuous Speech Keyword Spotting

Aug 09, 2022
Zhiyuan Zhao, Chuanxin Tang, Chengdong Yao, Chong Luo

Figure 1 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 2 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 3 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Figure 4 for An Anchor-Free Detector for Continuous Speech Keyword Spotting
Viaarxiv icon

RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion

Jun 28, 2022
Dacheng Yin, Chuanxin Tang, Yanqing Liu, Xiaoqiang Wang, Zhiyuan Zhao, Yucheng Zhao, Zhiwei Xiong, Sheng Zhao, Chong Luo

Figure 1 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 2 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 3 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Figure 4 for RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Viaarxiv icon

Peripheral Vision Transformer

Jun 14, 2022
Juhong Min, Yucheng Zhao, Chong Luo, Minsu Cho

Figure 1 for Peripheral Vision Transformer
Figure 2 for Peripheral Vision Transformer
Figure 3 for Peripheral Vision Transformer
Figure 4 for Peripheral Vision Transformer
Viaarxiv icon

Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph

Feb 24, 2022
Dacheng Yin, Xuanchi Ren, Chong Luo, Yuwang Wang, Zhiwei Xiong, Wenjun Zeng

Figure 1 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Figure 2 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Figure 3 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Figure 4 for Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
Viaarxiv icon