Alert button
Picture for Soravit Changpinyo

Soravit Changpinyo

Alert button

Towards Multi-Lingual Visual Question Answering

Add code
Bookmark button
Alert button
Sep 12, 2022
Soravit Changpinyo, Linting Xue, Idan Szpektor, Ashish V. Thapliyal, Julien Amelot, Xi Chen, Radu Soricut

Figure 1 for Towards Multi-Lingual Visual Question Answering
Figure 2 for Towards Multi-Lingual Visual Question Answering
Figure 3 for Towards Multi-Lingual Visual Question Answering
Figure 4 for Towards Multi-Lingual Visual Question Answering
Viaarxiv icon

All You May Need for VQA are Image Captions

Add code
Bookmark button
Alert button
May 04, 2022
Soravit Changpinyo, Doron Kukliansky, Idan Szpektor, Xi Chen, Nan Ding, Radu Soricut

Figure 1 for All You May Need for VQA are Image Captions
Figure 2 for All You May Need for VQA are Image Captions
Figure 3 for All You May Need for VQA are Image Captions
Figure 4 for All You May Need for VQA are Image Captions
Viaarxiv icon

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

Add code
Bookmark button
Alert button
Jul 05, 2021
Tai-Yu Pan, Cheng Zhang, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao

Figure 1 for On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
Figure 2 for On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
Figure 3 for On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
Figure 4 for On Model Calibration for Long-Tailed Object Detection and Instance Segmentation
Viaarxiv icon

2.5D Visual Relationship Detection

Add code
Bookmark button
Alert button
Apr 26, 2021
Yu-Chuan Su, Soravit Changpinyo, Xiangning Chen, Sathish Thoppay, Cho-Jui Hsieh, Lior Shapira, Radu Soricut, Hartwig Adam, Matthew Brown, Ming-Hsuan Yang, Boqing Gong

Figure 1 for 2.5D Visual Relationship Detection
Figure 2 for 2.5D Visual Relationship Detection
Figure 3 for 2.5D Visual Relationship Detection
Figure 4 for 2.5D Visual Relationship Detection
Viaarxiv icon

Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts

Add code
Bookmark button
Alert button
Feb 17, 2021
Soravit Changpinyo, Piyush Sharma, Nan Ding, Radu Soricut

Figure 1 for Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Figure 2 for Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Figure 3 for Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Figure 4 for Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Viaarxiv icon

A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection

Add code
Bookmark button
Alert button
Feb 17, 2021
Cheng Zhang, Tai-Yu Pan, Yandong Li, Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao

Figure 1 for A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
Figure 2 for A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
Figure 3 for A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
Figure 4 for A Simple and Effective Use of Object-Centric Images for Long-Tailed Object Detection
Viaarxiv icon

Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval

Add code
Bookmark button
Alert button
Feb 09, 2021
Soravit Changpinyo, Jordi Pont-Tuset, Vittorio Ferrari, Radu Soricut

Figure 1 for Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval
Figure 2 for Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval
Figure 3 for Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval
Figure 4 for Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval
Viaarxiv icon

Weakly Supervised Content Selection for Improved Image Captioning

Add code
Bookmark button
Alert button
Sep 10, 2020
Khyathi Raghavi Chandu, Piyush Sharma, Soravit Changpinyo, Ashish Thapliyal, Radu Soricut

Figure 1 for Weakly Supervised Content Selection for Improved Image Captioning
Figure 2 for Weakly Supervised Content Selection for Improved Image Captioning
Figure 3 for Weakly Supervised Content Selection for Improved Image Captioning
Figure 4 for Weakly Supervised Content Selection for Improved Image Captioning
Viaarxiv icon

Connecting Vision and Language with Localized Narratives

Add code
Bookmark button
Alert button
Dec 06, 2019
Jordi Pont-Tuset, Jasper Uijlings, Soravit Changpinyo, Radu Soricut, Vittorio Ferrari

Figure 1 for Connecting Vision and Language with Localized Narratives
Figure 2 for Connecting Vision and Language with Localized Narratives
Figure 3 for Connecting Vision and Language with Localized Narratives
Figure 4 for Connecting Vision and Language with Localized Narratives
Viaarxiv icon

Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering

Add code
Bookmark button
Alert button
Sep 04, 2019
Soravit Changpinyo, Bo Pang, Piyush Sharma, Radu Soricut

Figure 1 for Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Figure 2 for Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Figure 3 for Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Figure 4 for Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering
Viaarxiv icon