Alert button

"Image": models, code, and papers
Alert button

Flooding Regularization for Stable Training of Generative Adversarial Networks

Nov 01, 2023
Iu Yahiro, Takashi Ishida, Naoto Yokoya

Viaarxiv icon

The Development of LLMs for Embodied Navigation

Add code
Bookmark button
Alert button
Nov 01, 2023
Jinzhou Lin, Han Gao, Rongtao Xu, Changwei Wang, Li Guo, Shibiao Xu

Viaarxiv icon

Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection

Add code
Bookmark button
Alert button
Oct 18, 2023
Lingchen Meng, Xiyang Dai, Jianwei Yang, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Yi-Ling Chen, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 2 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 3 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Figure 4 for Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
Viaarxiv icon

Cross-modal Active Complementary Learning with Self-refining Correspondence

Add code
Bookmark button
Alert button
Oct 26, 2023
Yang Qin, Yuan Sun, Dezhong Peng, Joey Tianyi Zhou, Xi Peng, Peng Hu

Viaarxiv icon

Entity Embeddings : Perspectives Towards an Omni-Modality Era for Large Language Models

Oct 27, 2023
Eren Unlu, Unver Ciftci

Viaarxiv icon

DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks

Add code
Bookmark button
Alert button
Sep 14, 2023
Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang

Figure 1 for DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Figure 2 for DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Figure 3 for DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Figure 4 for DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Viaarxiv icon

Group Testing for Accurate and Efficient Range-Based Near Neighbor Search : An Adaptive Binary Splitting Approach

Add code
Bookmark button
Alert button
Nov 05, 2023
Kashish Mittal, Harsh Shah, Ajit Rajwade

Viaarxiv icon

Handwritten image augmentation

Aug 26, 2023
Mahendran N

Figure 1 for Handwritten image augmentation
Figure 2 for Handwritten image augmentation
Figure 3 for Handwritten image augmentation
Figure 4 for Handwritten image augmentation
Viaarxiv icon

ContextRef: Evaluating Referenceless Metrics For Image Description Generation

Add code
Bookmark button
Alert button
Sep 21, 2023
Elisa Kreiss, Eric Zelikman, Christopher Potts, Nick Haber

Figure 1 for ContextRef: Evaluating Referenceless Metrics For Image Description Generation
Figure 2 for ContextRef: Evaluating Referenceless Metrics For Image Description Generation
Figure 3 for ContextRef: Evaluating Referenceless Metrics For Image Description Generation
Figure 4 for ContextRef: Evaluating Referenceless Metrics For Image Description Generation
Viaarxiv icon

Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond

Add code
Bookmark button
Alert button
Oct 31, 2023
Zhecan Wang, Long Chen, Haoxuan You, Keyang Xu, Yicheng He, Wenhao Li, Noel Codella, Kai-Wei Chang, Shih-Fu Chang

Figure 1 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Figure 2 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Figure 3 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Figure 4 for Dataset Bias Mitigation in Multiple-Choice Visual Question Answering and Beyond
Viaarxiv icon